Gene Francci3_2294 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2294 
Symbol 
ID3904828 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2673362 
End bp2674585 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content71% 
IMG OID637879625 
Productmonooxygenase, FAD-binding 
Protein accessionYP_481391 
Protein GI86740991 
COG category[C] Energy production and conversion 
COG ID[COG0644] Dehydrogenases (flavoproteins) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.539275 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00707471 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTACGACG CAATTGTGGT GGGTGCCCGG TGCGCCGGTT CCCCGCTGGC CATGCTGCTG 
GCCCGGCGCG GCCACCGGGT GCTGCTGGTC GACCGCGCGA CCTTTCCGGC GGACACGCTG
TCCACCCACT ATCTGCCCCA GGCCGGGGCA CACCAGCCGG CCGGTTGGGG GCTGCTTGAC
CGACTTGTCC GAACCGGGTG TCCGCCGATC ACCGAGATGA CGCTGTCCTG GGAGGACAGC
GTCATCACCG GCGGCGTCGA TCCGGTGGAC GGCATCAGCG CGGCCTACGC ACCCCGGCGC
ACCGTGCTGG ACGCGATGCT GCTGGACGCC GCGCTCGACG CCGGCGTCGA GGTCCGCCTG
GGCTACCCGG TGACCGACGT ACTGGTAGCC GACGGCCGCG CGGTGGGTAT CCGCGGTGGC
ACCCGGCGCT CCGGCGGCGC CGCGGCCGAG GATCGGGCCG CCATTGTCAT CGGCGCGGAC
GGCCCCCGCT CCACGATCGC GGCGACGATG AACGCCGCCT TCTACAACGT CGTACCGGCC
GCCAGCTTCA TCTACTACTC GTACTGGAGC GGCCTCGACC GGCAGCACAG CGCCCGCTTC
CGCAACGGGG CCCAGATCGG CTGCTGGCCG ACTAACGACG GCCTGACCGT GGTTGCGGTT
ATGCGGCGGC GGGAGCGCCT GGCCGAGTTC CGGGCCGACG TGCCGGGAAA CTTCCTCGGT
GTCGTGCGTG CCGTCTTCCC CGAGTTGGCC GACGAGCTGG CCACCCGCGG CCGGCGCGAG
GAACGCTTCC ACGGCTCCCT CTACCCCGAC AACTACTACC GCGCAGGCCA CGGGCCGGGC
TGGGCGCTGG TCGGCGACGC CGGTTACCAC CGCGATCCCG TGACCGCCCA GGGCATGCTC
GACGCCTTCA CCCAGGCGGA CCTGCTGGCG GCCGCGGTGG ACCGCGGCCT GTCCGGGCAG
CAGCCGATGA ACGCCGCCCT GGCGGATTTC CAGCGGCATC GCGACGAGGC GACCGCCGCG
TCCTACCGGC TTGCCTGCAC GGTAGGCGAA CTGGCGTTCC CCCCCGAACT GGCCGCGCTG
CTGGTGGCCG CGGCGAACAG TCCAGAAACC CGGAAAAAGT TTCTCGGTAT GGTTGCCGGC
TTCGTTCCGC TGGCGGAGTT CTTCGCCCCG GCGCACCTGA CCGAACCGAA ATGCTCTCCA
ACGAATCCTG ACCCTGAACC CTGA
 
Protein sequence
MYDAIVVGAR CAGSPLAMLL ARRGHRVLLV DRATFPADTL STHYLPQAGA HQPAGWGLLD 
RLVRTGCPPI TEMTLSWEDS VITGGVDPVD GISAAYAPRR TVLDAMLLDA ALDAGVEVRL
GYPVTDVLVA DGRAVGIRGG TRRSGGAAAE DRAAIVIGAD GPRSTIAATM NAAFYNVVPA
ASFIYYSYWS GLDRQHSARF RNGAQIGCWP TNDGLTVVAV MRRRERLAEF RADVPGNFLG
VVRAVFPELA DELATRGRRE ERFHGSLYPD NYYRAGHGPG WALVGDAGYH RDPVTAQGML
DAFTQADLLA AAVDRGLSGQ QPMNAALADF QRHRDEATAA SYRLACTVGE LAFPPELAAL
LVAAANSPET RKKFLGMVAG FVPLAEFFAP AHLTEPKCSP TNPDPEP