Gene Ava_1053 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_1053 
SymbolcbiD 
ID3678605 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp1281999 
End bp1283111 
Gene Length1113 bp 
Protein Length370 aa 
Translation table11 
GC content45% 
IMG OID637716389 
Productcobalt-precorrin-6A synthase 
Protein accessionYP_321572 
Protein GI75907276 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1903] Cobalamin biosynthesis protein CbiD 
TIGRFAM ID[TIGR00312] cobalamin biosynthesis protein CbiD 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.301385 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000449274 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCGTAGCG GATATACTTT ACCTGTTTTT GCTTGTGCTG GTGCGATCGC TGCTTTACAC 
TGGTTACGCC AGCGTCAGTC TCTACAAGTT GGGTTGGTAG ATTTAATTGA GCCTGCACAA
ATGGCTGAAG TACCTATTGA GCAGGTGGCA GGATTATCAG AAAATATGGC CTTGGCAATT
ACCCGCAGTG ATCCTGGTGA TAATATTGAC TTGACTAAAA ACACCCCGAT TTGGGCTGTG
GTGGAATGGG GCCAAGGAGG GGGTGAACAG GTAACTATCA AGGGCGGGGA AGGGATTGGT
AAGCAGGTTA ATGCTGATAA CCGTGCGGCT ATCTATAGCT ATGCTCAAAG ATTGTTGCAA
GCCAACTTAA CTCGATTATT AGCCCCAGAG GAAAGCATTA TCGTCACTAT CATTTTACCG
GAGGGGCGAT CGCTCGCTGT TCGGACTTCT AATTCCGCCT TTGGGGTTGT CGAGGGATTA
TCCCTACTAG GAACCACAGG TATTTCTCAA CCTTTGAGTT CACCAGATCA GTTAGACGCG
TTTCGTAGCG AATTGCAACA CAAAGCTAGT TTGTATGCAA GTCTGGTATT CTGCATTGGC
GAGAATGGTT TAGATTTGGC GCGAAAAATC GGTATTAATG CTGAGAAATT AGTAAAAACT
GCTAATTGGT TAGGGCCGAT GTTGGTAGAA GCTGAGGCCT TGGGTGTTAA GGAAATCTTA
TTGTTTGGCT ATCATGGCAA GTTGATGAAA CTAGCCGGGG GCATTTTTCA CACCCACCAC
CACTTGGCTG ATGGACGACG GGAAGTTTTG GCAACACACT GTGCTTTGGG GGGTTTAAGT
AAACAAGATA TAGAAATAGT GTTTCACGCC CCAACGGCTG AAGCTGCACT CAAGCACTTA
AAAGCGTTAG ATAGTTCCAC AGGTAGTGAT TGGGTAAATC AAGTTTATAG TGCGATCGCC
GAAACTATCG ATTCTCGTTG CCAAGAATAT ATGCAAAGCC ATAGCAGCAG AGGCACAGCA
GCCACAATCT GCGGCTCAAT TCTCTTTGAC CGCGATCGCA AAATTATCGT GAAGAGCAAA
ACTGCTTGTA ACTTAATGGG AAATTTATGT TAA
 
Protein sequence
MRSGYTLPVF ACAGAIAALH WLRQRQSLQV GLVDLIEPAQ MAEVPIEQVA GLSENMALAI 
TRSDPGDNID LTKNTPIWAV VEWGQGGGEQ VTIKGGEGIG KQVNADNRAA IYSYAQRLLQ
ANLTRLLAPE ESIIVTIILP EGRSLAVRTS NSAFGVVEGL SLLGTTGISQ PLSSPDQLDA
FRSELQHKAS LYASLVFCIG ENGLDLARKI GINAEKLVKT ANWLGPMLVE AEALGVKEIL
LFGYHGKLMK LAGGIFHTHH HLADGRREVL ATHCALGGLS KQDIEIVFHA PTAEAALKHL
KALDSSTGSD WVNQVYSAIA ETIDSRCQEY MQSHSSRGTA ATICGSILFD RDRKIIVKSK
TACNLMGNLC