Gene Franean1_4864 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4864 
SymbolcobD 
ID5673204 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5836690 
End bp5837688 
Gene Length999 bp 
Protein Length332 aa 
Translation table11 
GC content81% 
IMG OID641243719 
Productcobalamin biosynthesis protein 
Protein accessionYP_001509135 
Protein GI158316627 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1270] Cobalamin biosynthesis protein CobD/CbiB 
TIGRFAM ID[TIGR00380] cobalamin biosynthesis protein CobD 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0377956 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.399913 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCCGCG CCCGCACGGT GGGGCTCGTC CTCGGCGCGG CCCTCGACGC GGTGCTCGCC 
GACCCTGCCC GCGGCCATCC CGTGGCCGGT TTCGGGCGGG TCGCCGGCGG CCTCGAGCGG
GCCGTCCACC GTGACAGCCG GCTGGTCGGC GCGGCGTACG CCACGGTCCT CGTCGCCGGC
ACGGGCGCCG CGGCCGCCGG CGCGGAACGG GCGCTGGCCG GCCGGCACCC GGGACGGCGG
GCCGCGCTGG CCCGCGCGGG CCTGACCGCC GCCACGACCT GGACGGTGCT CGGCGGCACC
TCGCTGCGGC GCCAGGGCCG GGCCCTCGGC GGCGAGCTGG AACGCCGCGA CCTGCGCGCG
GCCCGGACGC GGCTGCCGTC GCTGTGCGGG CGGGACCCGT CGGCCCTCGA CGCCGGCGGG
CTCGCGCGTG CCGGGGTGGA GTCGGTGGCG GAGAACACCT CGGACGCCGT CGTCGCCCCG
CTGCTGTGGG GGGCCGTCGC CGGGCTGCCC GGGCTGGTGG CCTACCGCGC GGCGAACACC
CTCGACGCGA TGGTCGGCTA CCGCGACGCC CGGCACGGCC GGTTCGGCTG GGCCGTGGCC
CGGACCGACG ACGCGGCGAA CCTGCTGCCG GCCCGGATGT GCGCGCTGCT CACCTGCGCC
TGCGCCCCGG TCGTCGGCGG TTCCCCGGCG GAGGCGTTCC GGGTGATGCG CCGCGACGGC
CGATCGCATC CGAGCCCGAA CGCGGGCGTG GTCGAGGCCG CGTTCGCCGG TGCGCTCGGC
CTGCGTCTCG GTGGTGAGCT GCGGTACCCG CACGGGGTCG AGCACCGTCC CGAGCTGGGG
TCCGGGCGTC CGGCCGAAGC CGGCGACCTG GCGGCGGCGG CCCGTCTTTC CGGCGCGGTC
AGCGCGGCCT CGGTCGTGGT GTGCGCCGGT GCGGTCGCCG CCCTCGACAC GTTGCGGGCG
CGCCGGGCGG GCTCCCGCGG CCCGCGGGAG GCGGGATGA
 
Protein sequence
MSRARTVGLV LGAALDAVLA DPARGHPVAG FGRVAGGLER AVHRDSRLVG AAYATVLVAG 
TGAAAAGAER ALAGRHPGRR AALARAGLTA ATTWTVLGGT SLRRQGRALG GELERRDLRA
ARTRLPSLCG RDPSALDAGG LARAGVESVA ENTSDAVVAP LLWGAVAGLP GLVAYRAANT
LDAMVGYRDA RHGRFGWAVA RTDDAANLLP ARMCALLTCA CAPVVGGSPA EAFRVMRRDG
RSHPSPNAGV VEAAFAGALG LRLGGELRYP HGVEHRPELG SGRPAEAGDL AAAARLSGAV
SAASVVVCAG AVAALDTLRA RRAGSRGPRE AG