Gene Francci3_2901 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2901 
Symbol 
ID3903965 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3415734 
End bp3417299 
Gene Length1566 bp 
Protein Length521 aa 
Translation table11 
GC content71% 
IMG OID637880222 
Producthypothetical protein 
Protein accessionYP_481988 
Protein GI86741588 
COG category[C] Energy production and conversion
[H] Coenzyme transport and metabolism 
COG ID[COG0654] 2-polyprenyl-6-methoxyphenol hydroxylase and related FAD-dependent oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0623331 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.318501 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGAACAT CATCCGCACC GTTGCGCTGG ACCACCCGAT GCGTCATCGC GGGCGGTGGG 
CCGGCGGGGA TGATGCTCGG TCTGTTGCTG GCCCGCGCAG GGGTGGACGT CATCGTGCTG
GAAAAGCACG ATGATTTTGC CCGTGACTTC CGCGGTGACA CGATTCACCC GTCCACGATG
GCAGTGATGG CGGAGCTGGG CCTGCTGACG GACTTCCTTC GCATCCCGCA CACCCGGGCC
GCCACGCTGG CTTTGGATAT GGCGGGCCGG CGCCGGACCG TTGTGGACTT CCGGCACCTG
CGGACGCCCT GCCCGTTCAT CGCCCTGATG CCCCAATGGG ACTTTCTCAC GTTTCTGGCC
GAGCGGGCCG GTGCCTATCC GACGTTCCGC CTGGCGATGA GCACCGAGGC GACCGACCTG
GTCCGGGCGA ATGGACGGGT CGTGGGCGTG CGCGCGGCCG GCCCCCTCGG GGAGGTCGAG
ATCCGGGCGG ATCTGACGGT GGCCGCCGAC GGCCGGCACT CGACCCTGCG GTCCCGTGCC
GGCCTGCCGG TGCGGGAGCG CGGCGCTCCC TTCGACGTCC TCTGGTTCCG GCTGCCGAAA
GACATGGGTG ACAGGTCCGC GAGCGGCCGC CGGGCGGCGA GGGACGGGAA CGGGAACGAG
GAGGGGAACG GGCGTGGGGA GGAGAAGGGG AACGAGCGTG GGGAGGAGGG AAACGGGGAC
GGATTCACCC TGGCGCACCT CCGCAAGGGC CACGCCCTGA TCACCCTGGA TCGACGCGAC
TACTGGCAGT GCGGCATGGT GGTCCGGAAG GGGTCGGCGC AGCGGCAGCC AAGGACGGCC
GGCGGGCTGG CGGCGTTCCG TGCGCAGATC ACCACCGCGG CGCCGGCGCT GTCCGGTGCC
GTTGACGACC TCACCGACTG GGACCAGGTG AAGACCCTGG TGGTGCAGGT CGACCGGCTT
CGCCGATGGT TCCAGCCGGG TCTGCTCTGC ATCGGCGACG CCGCCCACGC GATGTCCCCG
GCGGGCGGCG TCGGGGTGAA CTACGCCGTC CAGGACGCGG TGGCGACGGC GAACCTGATG
GCCGTGACGC TGCGGGCCGG GCCACCCGAG CCGGCCGAGC TGCGGCGGGT GCAGCGCCGG
CGGACCTGGC CGGTCGTGCT CATGCAGATG ATCCAGGTCC GGCAGGGCGC CTTCCTGGTA
CGCCTGTTGG GTGACGACGA GCGGCCAGCG CACGGCGGCA GCCCGTCGCG CCAGGTCGCG
CGTGCCCCCC TGACGAACGC GACGACGAAC GCGACGGCAA GGACGATGAC GGGGGCGGTG
CGGGCCGGGA TGTCCAACCT GGTGACGGCC ATGATGTCCT ATGGGGTGAC CGCCACGGCG
GCGCCGCGGA TCGGGCGGGT GCTCGGGCGG GTACTCGGGC GGCTACTCGG GCGAGTCATC
GGCATCGGAT TTCGGCCCGA ACACGTCCGC ACCCCCGACG TGTTCGCCGA GGATGCCGGA
TACACCAAGA ATGCCGGATA CGCCAAGAAT GCCGGGTACG CCGAGGGCGT CGAGCGGGCG
AGATGA
 
Protein sequence
MGTSSAPLRW TTRCVIAGGG PAGMMLGLLL ARAGVDVIVL EKHDDFARDF RGDTIHPSTM 
AVMAELGLLT DFLRIPHTRA ATLALDMAGR RRTVVDFRHL RTPCPFIALM PQWDFLTFLA
ERAGAYPTFR LAMSTEATDL VRANGRVVGV RAAGPLGEVE IRADLTVAAD GRHSTLRSRA
GLPVRERGAP FDVLWFRLPK DMGDRSASGR RAARDGNGNE EGNGRGEEKG NERGEEGNGD
GFTLAHLRKG HALITLDRRD YWQCGMVVRK GSAQRQPRTA GGLAAFRAQI TTAAPALSGA
VDDLTDWDQV KTLVVQVDRL RRWFQPGLLC IGDAAHAMSP AGGVGVNYAV QDAVATANLM
AVTLRAGPPE PAELRRVQRR RTWPVVLMQM IQVRQGAFLV RLLGDDERPA HGGSPSRQVA
RAPLTNATTN ATARTMTGAV RAGMSNLVTA MMSYGVTATA APRIGRVLGR VLGRLLGRVI
GIGFRPEHVR TPDVFAEDAG YTKNAGYAKN AGYAEGVERA R