Gene Nham_2896 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNham_2896 
Symbol 
ID4033202 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrobacter hamburgensis X14 
KingdomBacteria 
Replicon accessionNC_007964 
Strand
Start bp3188833 
End bp3190764 
Gene Length1932 bp 
Protein Length643 aa 
Translation table11 
GC content64% 
IMG OID637971342 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_578124 
Protein GI92118395 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.60996 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACATCC GCTCCAATCC GGACACCACG CGCCCCGCCG TCACCACCGG CGCCCTGCCC 
TCCTCCCGCA AGATTTTCTC GGTGCCCGAG GCCGCACCCG ACCTGCTCGT GCCGCTGCGC
GAGATCGTCC TGAGCGAAGG CGCGGGCGAA CCGAACCTGC CGGTCTACGA CACATCGGGA
CCCTATACCG ACCCGGACGT CACCATCGAC GTCAATGCCG GTCTGCCGCG CACGCGCCTT
GCCTGGGTGA AGGAACGCGG CGGCGTCGAG GAATACGACG GCCGCGTCAT CAAGCCCGAG
GACAACGGCA ATGTCGGCGC GTCCCACGCC GCGACCGCGT TCAAGGCGCA TCACAAGCCG
CTGCGTGGTG TCGGCGATGC GCCGATCACC CAACTCGAAT TCGCACGCGC AGGCATCATC
ACCAAGGAGA TGATCTACGT CGCCGAGCGC GAGAACATCG GGCGCAAGCA GCAACTCGAA
CGCGCCGAAG CCGCGCTGGC CGACGGCGAA AGCTTCGGCG CTGCGGTGCC CACCTTCATC
ACGCCGGAAT TCGTGCGCGA GGAGATCGCG CGCGGCCGCG CCATCATCCC GGCCAACATC
AACCACGCCG AGCTTGAGCC GATGATCATC GGCCGTAATT TTCTGGTGAA GATCAACGCC
AACATCGGCA ACAGCGCGGT GACCTCCTCC GTCGAGGAGG AGGTGGATAA GATGGTGTGG
GCGATCCGCT GGGGCGCCGA CACCGTGATG GACCTCTCGA CGGGACGCAA CATCCACACC
ACGCGCGAAT GGATCTTGCG CAATTCGCCA GTACCTATTG GCACCGTGCC GATCTATCAG
GCACTGGAGA AGTGCGACGG CGATCCGGTC AAGCTGACGT GGGAGCTTTA CCGCGACACG
CTGGTGGAGC AGTGCGAACA GGGCGTCGAT TACTTCACCA TCCACGCCGG CGTGCGCCTG
CCCTACATCC ACCTCACCGC CGACCGCGTC ACCGGCATCG TCTCGCGCGG CGGATCGATC
ATGGCGAAGT GGTGCCTCGC CCACCACAAG GAGAGCTTCC TCTACACCCA CTTCGAGGAG
ATCTGCGATC TCATGCGCAA GTATGACGTG TCGTTCTCGC TGGGCGACGG CCTGCGCCCG
GGCTCGATCG CCGACGCCAA CGACCGCGCC CAGTTCGCGG AACTGGAAAC GCTCGGCGAG
CTGACGCAGA TCGCGTGGAA CAAGGGCTGC CAGGTGATGA TCGAAGGCCC CGGCCACGTG
CCGATGCACA AGATCAAGAT CAACATGGAC AAGCAGCTCA AAGAGTGCGG CGAAGCGCCG
TTCTATACGC TGGGCCCGCT GACCACCGAC ATCGCGCCGG GCTACGACCA CATCACATCA
GGCATCGGCG CCGCCATGAT CGGCTGGTTC GGCTGCGCGA TGCTCTGTTA TGTTACGCCG
AAGGAGCATC TCGGGCTGCC GAACCGCGAC GACGTCAAGA CCGGCGTCAT CACCTATCGC
GTCGCGGCGC ACGCCGCCGA TCTTGCCAAG GGCCATCCGG CTGCGCAACT GCGCGATGAC
GCACTGAGCC GCGCGCGGTT CGATTTCCGC TGGCAGGATC AGTTCAACCT CGGCCTCGAT
CCGGAGACGG CGGTGGCCTT CCATGACGAG ACGCTGCCGA AGGAGGCGCA CAAGGTCGCG
CATTTCTGCT CGATGTGCGG CCCGAAATTC TGCTCGATGA AGATCACGCA GGACGTGCGC
GACTACGCCG CCACACTGGG CGACAACGAG AAGGCGGCGC TTTATCCCGA GGGCAGCAAA
CTCGCCAGCG GCATGACGAT GAAAGGCGTC ATTGAAGACG GCATGACGCA GATGAGCGAG
AAGTTCAAGG AGATGGGCGG ACAGGTTTAT GTCGAAGCGG AAGCCGTGAA GGAAAGCAAC
AAGGTGTTGT GA
 
Protein sequence
MNIRSNPDTT RPAVTTGALP SSRKIFSVPE AAPDLLVPLR EIVLSEGAGE PNLPVYDTSG 
PYTDPDVTID VNAGLPRTRL AWVKERGGVE EYDGRVIKPE DNGNVGASHA ATAFKAHHKP
LRGVGDAPIT QLEFARAGII TKEMIYVAER ENIGRKQQLE RAEAALADGE SFGAAVPTFI
TPEFVREEIA RGRAIIPANI NHAELEPMII GRNFLVKINA NIGNSAVTSS VEEEVDKMVW
AIRWGADTVM DLSTGRNIHT TREWILRNSP VPIGTVPIYQ ALEKCDGDPV KLTWELYRDT
LVEQCEQGVD YFTIHAGVRL PYIHLTADRV TGIVSRGGSI MAKWCLAHHK ESFLYTHFEE
ICDLMRKYDV SFSLGDGLRP GSIADANDRA QFAELETLGE LTQIAWNKGC QVMIEGPGHV
PMHKIKINMD KQLKECGEAP FYTLGPLTTD IAPGYDHITS GIGAAMIGWF GCAMLCYVTP
KEHLGLPNRD DVKTGVITYR VAAHAADLAK GHPAAQLRDD ALSRARFDFR WQDQFNLGLD
PETAVAFHDE TLPKEAHKVA HFCSMCGPKF CSMKITQDVR DYAATLGDNE KAALYPEGSK
LASGMTMKGV IEDGMTQMSE KFKEMGGQVY VEAEAVKESN KVL