Gene Franean1_5853 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5853 
Symbol 
ID5674176 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp7100804 
End bp7103263 
Gene Length2460 bp 
Protein Length819 aa 
Translation table11 
GC content72% 
IMG OID641244703 
Productglycosyl transferase family protein 
Protein accessionYP_001510105 
Protein GI158317597 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0744] Membrane carboxypeptidase (penicillin-binding protein) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.698953 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.762762 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGGCC GGCCGCGGGG TGGTGTCACC CACCCGGACG AGACAGGTGG TCCCGCCGGG 
CGGTCCCGCG GTGGGCCGAA CGGGCCCGGG CGGCTCGGGC GGTCGGTGCC GCGTCCCCCG
GGGAGCCGTT CACGCGGCAC GTCGGCGCCG CTCGGCAGCG CCGACGACCT GCCGTTCGGG
CCGCCGACGG GCGGAATTCG CCGCCCAGTC GACCGGCGGC GGCACCTCAC CCGGGCGGGG
CTCATCGCGA CCGCCATGGG CGTCGCGGTC GGCCTCATGG CGTTGCCGTT CCTGAGCGCC
GCGGGGCTGT TCGCGAAGGC GTCCGCCGAC CACTTCCTGG AACTTCCGGC GGACCTCGTC
ATCCCGCCGC CGCCACAGAG CTCCAAGATC CTCGCGGCGG ACGGATCCGT GATCGCCACG
CTCACCGGTA CAGAGAACCG GGAGGTCGTC ACCGGCGACC AGATCCCGAA GATCATGCGG
GAGGCGATCG TCTCGATCGA GGACGCCCGC TTCTACTCTC ACAACGGTGT CGACCCCCAG
GGGGTGTTGC GGGCCGCGGC CCGCAACAGC GAGGCCGGCT CCACCACCCA GGGCGGCTCC
ACACTGACCC AGCAGTACGT GAAGAACGTG CTGCTGCAGG ACGCCACCAC TCCCGAGGAG
CGCGACGCGG TCGCCGGCGA CTCGCTGGAC CGCAAGCTGC GCGAGCTCCG CTACGCGATC
GCGATCGAGC AGCGGTTCAG CAAGGACGAG ATCCTCACCC GGTACCTGAA CATCGCCTAC
TTCGGCGACG GCGCGTACGG CGTGGGGACG GCGGCCGAGC ACTACTTCGG CATTCCCATC
TCCGAGGTCA GCCTCGAGCA GGCCGCCCTG CTCGCCGGGC TCGTCCAGAG CCCGTCGCGC
TACGACCCGG TGAACCACCC GGAGGCCGCA CTCGAGCGCA GGAACATGGT GCTCACCCGG
ATGGCCGACG AGCACTACGT CAGCCGCACC CAGGCGGACG CCGCCAGGCA GATGCCGATC
GACGTCATCC GCGACCAGCC GACCACGCAG GACTCCTGTG AGATCTCGGT GGCGCCGTTC
TTCTGCGACT ACATCCGCAC CCGGCTGCGC GCCAACCCGG CGCTGGGCGC CACCGCCGAG
GAGCGTGACC GGCGCATCCA CGAGGGTGGC CTGGTCATCA AGACCACGCT GAACCCGCAG
GTGCAGCAGG CGGCGCAGAT GGCGGTCAAC TCGACCATCC CGCCGGACAA CCGGGTGTCC
GCGACCGAGG TCGTCATCCA GCCGGGCACC GGCGCGATCC TCGGGATGGC GGTCAACCGC
GTCTACGGGA CCAACGAGGC GGCGAACCAG ACGAAGGTCT CGCTGCCGAT GGAGCCCACG
TTCCAGCCCG GGTCGACCTT CAAGACGTTC ACCGTGGCGG CCGCACTCGA GCAGGGGTAC
GGCGTGGGGA CGGCGTGGTA CTCCCCCGCC TGCTACTCGA CCGACGCTTT CCCGCTCGAC
CGCGGCGAGG GGGACTGCCC CAAGGGCTAC CAGAACGCCG ACCCGGCCGA AGCCGGCATC
TACCGGATGG ACGACGCGAC CTGGGATTCG GTCAACACCT ACTTCATCCA GCTCGAGGAG
GAGGTCGGGG TGCCGGCGGT CGTCGAGATG GCGACCAGGC TCGGCGTCTC CCCCGACCAC
CTCAAGGACA TCCCGCCGAC CAAGGGCGAC GTCACCATCG GCGGTGAGTA CGTCAGCCCG
ATGGACATGG CGGTCGCCTA CTCCACGCTG GCGGCCGGCG GCGTCCGGTG CGAGCCGCGC
TTCGCCACCT CGGTGGTCGA CGCCGGTGGC CAGCAGATCG ACGTGGGCAA CACGCCGAAG
TGCGAGCGGG TGCTGTCCCA GGGCGTCGCG GACACCACGA CGAGCATCCT GGCCGGGGTG
CTCACCAAGG GCACCGGCGG GAACGCCCGG TTCGACCACC CCGCGGCGGG CAAGACCGGC
ACCAACGACG GCTTCTCCAG CGCCTGGTTC GTCGGCTACA CCCCGCAGAT CGCGGCGGCG
GTGGCCGTTG GCGACCCGCA CGGCGCGGTC GCGCACCCGC TGCGCAACGT CGTCGCGGCC
GGCCGCACCT GGCCGCACGT GTTCGGCGGC GACCTGCCGG CGATCATCTG GGGCAGTTCG
ATGCGGGGCG CGCTGGCCGG GCAGCCCGTG CAGCCGCTGC CCGGCGCGGA CCGCGACGTG
GCCCGGGGAA CCAGGGGCGG CCGACTCATG AACACCCCAC CACCGGCACC CACCCCGACG
CTGCCGAGCC TGGAAGATCT GCTGCCAGGC GCGGGCACCG GCCAGACCCC GGGGTTCGGG
CAGGGCGGCG GTGGCGGTGG TCAGGGCCCG GGGCAGGTCG TCGTGCCGCA GACCAACCAG
GGCACCGGCC AGGGGCAACA ACAGCAGGGC CAACCCGGCC GCAACAACCA GGGCCGCTAG
 
Protein sequence
MAGRPRGGVT HPDETGGPAG RSRGGPNGPG RLGRSVPRPP GSRSRGTSAP LGSADDLPFG 
PPTGGIRRPV DRRRHLTRAG LIATAMGVAV GLMALPFLSA AGLFAKASAD HFLELPADLV
IPPPPQSSKI LAADGSVIAT LTGTENREVV TGDQIPKIMR EAIVSIEDAR FYSHNGVDPQ
GVLRAAARNS EAGSTTQGGS TLTQQYVKNV LLQDATTPEE RDAVAGDSLD RKLRELRYAI
AIEQRFSKDE ILTRYLNIAY FGDGAYGVGT AAEHYFGIPI SEVSLEQAAL LAGLVQSPSR
YDPVNHPEAA LERRNMVLTR MADEHYVSRT QADAARQMPI DVIRDQPTTQ DSCEISVAPF
FCDYIRTRLR ANPALGATAE ERDRRIHEGG LVIKTTLNPQ VQQAAQMAVN STIPPDNRVS
ATEVVIQPGT GAILGMAVNR VYGTNEAANQ TKVSLPMEPT FQPGSTFKTF TVAAALEQGY
GVGTAWYSPA CYSTDAFPLD RGEGDCPKGY QNADPAEAGI YRMDDATWDS VNTYFIQLEE
EVGVPAVVEM ATRLGVSPDH LKDIPPTKGD VTIGGEYVSP MDMAVAYSTL AAGGVRCEPR
FATSVVDAGG QQIDVGNTPK CERVLSQGVA DTTTSILAGV LTKGTGGNAR FDHPAAGKTG
TNDGFSSAWF VGYTPQIAAA VAVGDPHGAV AHPLRNVVAA GRTWPHVFGG DLPAIIWGSS
MRGALAGQPV QPLPGADRDV ARGTRGGRLM NTPPPAPTPT LPSLEDLLPG AGTGQTPGFG
QGGGGGGQGP GQVVVPQTNQ GTGQGQQQQG QPGRNNQGR