Gene Franean1_3392 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3392 
Symbol 
ID5671763 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4018172 
End bp4020376 
Gene Length2205 bp 
Protein Length734 aa 
Translation table11 
GC content69% 
IMG OID641242280 
ProductMMPL domain-containing protein 
Protein accessionYP_001507700 
Protein GI158315192 
COG category[R] General function prediction only 
COG ID[COG2409] Predicted drug exporters of the RND superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0160817 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGGGTGC TGGTCACCGT CGGCTTCGGC GCGGCCAGGG CACCCGCCGC CCCGGACGAC 
GCGTTCTCCA TGCCAGGTAC CGAATCCCAG CGGGCGTTCG ACCTCCTAGA CGAACGCTTC
CCCGGCACCG GAGCGGACGG GGCGGTCGCC CGCATCGTCT TCGTCGCCCC ACCAGGCCAG
ACCCTGACCA CACCCGAAAA CCGCACCACT GTTGAACGAG TCGTCGCGGA GGCCGCCGCC
AGCCCACAGG TCGCCAACGC CGTGAACCCC TTCCAAAGCG GGGCGGTCAG CACGGACGCG
GCGACCGCCT ACGCCACGGT CTCCTACACG GCGATCAGTG ATGATCTCAC CGACGCCACC
AAAAACAGCC TGCAAGGCGC GGTCACAGAC GGCCAGACAG CGGGCCTGAC CGTCGAGCTA
GGCGGAGACG CGCTGACAAC CCGGCCAGGG GCGGGCGGAG TGACCGAGGC CGTCGGAATC
GCGATCGCCG CACTCGTCCT GCTGATCACC TTCGGATCCC TGGCCGCAGC CGGGCTACCG
CTACTGACCG CGATCGTCAG CGTCGGCGTC GGAATCGCCT CGATCATGGC CCTGGCCAGC
ACCCTCGGCC TGTCCTCAAC CACCAGCACC CTCGCGATGA TGCTCGGCCT CGCCGTCGGC
ATCGACTACG CCGTGTTCAT CGTCTCCCGC TACCGGGAAG AACACGCCCG CGGGCTGGAA
CCCCAGGACG CCACAGCGGT CGCGACCGGC ACCGCCGGGT CATCGGTAGT GTTCGCCGGG
CTCACCGTGG TGATCGCGCT GGCCGGCCTG TTCATCGTCG GAGTCCCAAC CCTGACGAAA
ATGGGCCTGG CCGCCGCGGG CACCGTCGGT ATCGCCGTCG GAGTCGCGCT GACCCTCGTC
CCGGCGCTGC TCGGGTTCTT CCCCCGCGCC GTGCTCCCCC GCTCCACACG CAAGAGCACC
ACGCGCAGCA CCACAAGTAG GTTCGCGCGC AGAGCCACGA GGAAGACCGA GCACCGCGGG
CCCAACGCGG GCACCCGCTG GGCGAACCTG ATCCTGCGCC GCCCGCTGCC CGTCCTCATC
CTCTCCGTAC TCGCCCTGGG GGCGATCGCC CTGCCCGTCC TGGACCTGCG CCTGGGCACG
GCCGGCGACG AGGCGAAGCC CACCTCCACC ACCGAACGCC GCGCCTACGA CGACCTCGCC
GCGGGCTTCG GGCCAGGCTT CAACGGCCCA CTGACCATCG TCGTCGACGC GACAGGTTCC
GACAACGCGC AGACAGCGGT CACCACGATC ACCCAGAAGA TCAGCGCAAC ACCCGGTGTC
GTCTCCGCCT CGGCCGCCCG GTTCAACACC GCGGGCGACA CAGCGGTATT CACCGCGGTG
CCGGCCACCG GACCGAGCGA GGCAGCAACC AAGGACCTCG TCCACACCAT CCGCGCGCAA
CGCGCCACGG TCACCGCCGC CACCGGCGCG ACCTTCCAGG TCACCGGCAC CACCGCCGTG
AACATCGACA TCGCCCAGAA GGTCCAGGAC GCACTCATCC CCTACCTCGC CATCGTGGTG
GGCCTGGCGT TCCTGCTCCT GCTCGTGCTG TTCCGCTCGG TACTCGTCCC ACTCAAAGCC
GCCCTCGGGT TCCTACTCTC CGTCCTGGCT GCCCTCGGAG CAGTCGTCGC GGTCTTCCAA
TGGGGCTGGC TCGCTGGGCT CATCGGCCTC CACCAAACCG GACCCATCAT GAGCATGATG
CCGATCTTCA TGGTCGGTAT CGTCTTCGGC CTCGCCATGG ACTACGAGGT CTTCCTCGTC
GCCCGCATCC GCGAGGCCCA CGTCCACGGC GAGAACGCCC GGGACGCGAT CACCTCCGGG
TTCGGGTACA GCGCCCGCGT CGTGGTCGCC GCCGCACTGA TCATGATGGC GGTCTTCGCC
GGCTTCATCG GCACCAGCGA ACCGATCATC AAAATGATCG GGTTCGGCCT GGCCACCGCG
GTCCTACTCG ACGCCTTCGT CGTCCGCATG ACCATCGTCC CCGCCGTCCT CGCCCTTCTC
GGAGAGAAGG CATGGTGGAT CCCACGCCAC CTCGACCGGG TCCTGCCCCA CATCGACGTC
GAGGGCGAGA CGCTGAACCG GCCCACCGCC GTGGCACCGG CCGTTGCGGC ACCGGCCACC
GGCCGCGAGG AACCCGTCGC CCTGGAATCC GCGTCCGCAC GATAA
 
Protein sequence
MGVLVTVGFG AARAPAAPDD AFSMPGTESQ RAFDLLDERF PGTGADGAVA RIVFVAPPGQ 
TLTTPENRTT VERVVAEAAA SPQVANAVNP FQSGAVSTDA ATAYATVSYT AISDDLTDAT
KNSLQGAVTD GQTAGLTVEL GGDALTTRPG AGGVTEAVGI AIAALVLLIT FGSLAAAGLP
LLTAIVSVGV GIASIMALAS TLGLSSTTST LAMMLGLAVG IDYAVFIVSR YREEHARGLE
PQDATAVATG TAGSSVVFAG LTVVIALAGL FIVGVPTLTK MGLAAAGTVG IAVGVALTLV
PALLGFFPRA VLPRSTRKST TRSTTSRFAR RATRKTEHRG PNAGTRWANL ILRRPLPVLI
LSVLALGAIA LPVLDLRLGT AGDEAKPTST TERRAYDDLA AGFGPGFNGP LTIVVDATGS
DNAQTAVTTI TQKISATPGV VSASAARFNT AGDTAVFTAV PATGPSEAAT KDLVHTIRAQ
RATVTAATGA TFQVTGTTAV NIDIAQKVQD ALIPYLAIVV GLAFLLLLVL FRSVLVPLKA
ALGFLLSVLA ALGAVVAVFQ WGWLAGLIGL HQTGPIMSMM PIFMVGIVFG LAMDYEVFLV
ARIREAHVHG ENARDAITSG FGYSARVVVA AALIMMAVFA GFIGTSEPII KMIGFGLATA
VLLDAFVVRM TIVPAVLALL GEKAWWIPRH LDRVLPHIDV EGETLNRPTA VAPAVAAPAT
GREEPVALES ASAR