Gene Nwi_0404 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNwi_0404 
Symbol 
ID3676989 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrobacter winogradskyi Nb-255 
KingdomBacteria 
Replicon accessionNC_007406 
Strand
Start bp450614 
End bp452521 
Gene Length1908 bp 
Protein Length635 aa 
Translation table11 
GC content66% 
IMG OID637711944 
Productcobalt chelatase, CobT subunit 
Protein accessionYP_317023 
Protein GI75674602 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG4547] Cobalamin biosynthesis protein CobT (nicotinate-mononucleotide:5, 6-dimethylbenzimidazole phosphoribosyltransferase) 
TIGRFAM ID[TIGR01651] cobaltochelatase, CobT subunit 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.167561 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.564584 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACCG CATCACCCAG CAACAGGTTT CGTGCCTCGC CGAAGGAGGC GCCGACAGAG 
CCGTTCAAGC GCGCGGTCAC CTCCTGCCTG CGCGCGATCT CCCGGCGGCC GGAGCTTGAG
GTAACCTTTG CCATCGAGCG GCCCGGCCTG TCGCCAGGCA AGGCGCGGCT GCCCGAGCCC
GGACGCAAGA TGAGCCGGCG GGACGCGGCG ATCGTGCGCG GGCATGCGGA TTCCATCGCG
CTAAAGCTCG CGTGTCACGA TCCGAAGGTG CATCGCAAGC TCATGCCGGG CAATCCGCAG
GCGCGCGGCG TCTTCGACGC GGTCGAGCAG GCCCGCGTCG AGGCCATCGG CTCCCGCCGG
ATGACGGGCG TCGCCAGGAA TCTCACCGCG ATGCTCGATG ATCACTTCCA TCGCGGCAAG
TATGACGAGA TCACCGACCG GGCCGACGCG CCGTTGTCGG ATGCGCTGGC GATGATGGTG
CGCGAGCGGC TGACCGGCCT GCCGCCGCCC GCCGCGGCGT CCAGGCTGGT CGACCTGTGG
CGCCCGTTCC TGGAAGACAA GATCGGCGCC CGGCTGGACC AGCTCAGCCA TTTTACCGAG
GATCAGGCGA AGTTCGGCGA CCTCGTGCAC GATCTGCTGT CTGAGCTTGA TCTCGGTGAC
GACAGCCGCG CCGATACGGA GAAGGAGGAG AACGAGGACG ACAACCGCGA GGGCGAGAAC
GATCAGTCCG GTACGGAAGG CTCGCCCGAC AGCGAGGCCG CCCAGGAGAT GAGCGCGGAT
CAGGCGCAGG ACATGACCGA CGACATGCCT GACGGCGCCA TGGAAAGCGC GCAGGCCTCG
GTGAGCGACA CGTTTGACGA TGGCGACCTT GCCGAGGAGG AGATGCCTGG CGAGGCGACG
CGGCCCAATG CGGGCGGCGC CAACGAGCCG CGCGGCCCGG AATATCATGC GTTCGCGCCG
AAGTTCGATG AGGTGATCGC CGCCGAGGAT CTGTGCGACC ACGATGAACT GGAGCGGCTG
CGCAGCTATC TCGACAAGCA ACTCGCGCAC TTGCAGGGCA TCGTGGCGCG GCTCGCCAAT
CGTCTTCAGC GCCGCTTGAT GGCGCAACAG AATCGCGCCT GGGAGTTCGA TCTGGAAGAA
GGCATTCTCG ATCCGGCGCG GCTGTCGCGC GTGGTGATCG ACCCTTATCA GCCGCTCTCG
TTCAAGCACG AGAAGGAGGC GACGTTCCGC GATACCGTGG TGACGCTGCT GCTCGACAAT
TCCGGTTCGA TGCGGGGGCG GCCGATCACG GTGGCCGCGA CCTGCGCCGA CATTCTGGCG
CGCACGCTGG AACGCTGCGG CGTCAAGGTC GAAATTCTCG GCTTCACGAC GCGGGCCTGG
AAGGGCGGGC AGTCGCGCGA GGCGTGGCTC GCCGCCGGCA AGCCGGTCAC GCCGGGGCGG
CTCAACGATC TCCGCCATAT CATTTACAAG GCGGCCGACG CTCCCTGGCG GCGATCGCGG
AAAAACCTCG GACTGATGAT GCGCGAGGGC CTGCTCAAGG AAAACATCGA CGGCGAGGCG
CTCGACTGGG CGCATAAACG GCTGCTGGGC CGTTCCGAGC AGCGCAAGAT CCTGATGATG
ATTTCGGACG GTGCGCCGGT CGACGATTCG ACCCTGTCGG TCAATCCGGG AAATTATCTG
GAGCGTCACC TGCGATGGGT GATCGAGGAA ATCGAAACCA GATCCCCCGT CGAACTGATC
GCCATCGGCA TCGGTCACGA CGTGACGCGC TACTATCGCC GCGCGGTGAC CATCGTGGAT
GCCGAGGAAC TCGGCGGCGC CATGACCGAA CAACTCGCCG AACTGTTCAG CGAGACCCAT
GAGCCGCCGC CAGGTCCGGC ACGGCGGCCA CGCAAGCTGC ATTCGTAG
 
Protein sequence
MSTASPSNRF RASPKEAPTE PFKRAVTSCL RAISRRPELE VTFAIERPGL SPGKARLPEP 
GRKMSRRDAA IVRGHADSIA LKLACHDPKV HRKLMPGNPQ ARGVFDAVEQ ARVEAIGSRR
MTGVARNLTA MLDDHFHRGK YDEITDRADA PLSDALAMMV RERLTGLPPP AAASRLVDLW
RPFLEDKIGA RLDQLSHFTE DQAKFGDLVH DLLSELDLGD DSRADTEKEE NEDDNREGEN
DQSGTEGSPD SEAAQEMSAD QAQDMTDDMP DGAMESAQAS VSDTFDDGDL AEEEMPGEAT
RPNAGGANEP RGPEYHAFAP KFDEVIAAED LCDHDELERL RSYLDKQLAH LQGIVARLAN
RLQRRLMAQQ NRAWEFDLEE GILDPARLSR VVIDPYQPLS FKHEKEATFR DTVVTLLLDN
SGSMRGRPIT VAATCADILA RTLERCGVKV EILGFTTRAW KGGQSREAWL AAGKPVTPGR
LNDLRHIIYK AADAPWRRSR KNLGLMMREG LLKENIDGEA LDWAHKRLLG RSEQRKILMM
ISDGAPVDDS TLSVNPGNYL ERHLRWVIEE IETRSPVELI AIGIGHDVTR YYRRAVTIVD
AEELGGAMTE QLAELFSETH EPPPGPARRP RKLHS