Gene Nwi_2468 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNwi_2468 
Symbol 
ID3674685 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrobacter winogradskyi Nb-255 
KingdomBacteria 
Replicon accessionNC_007406 
Strand
Start bp2690618 
End bp2692558 
Gene Length1941 bp 
Protein Length646 aa 
Translation table11 
GC content63% 
IMG OID637714034 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_319073 
Protein GI75676652 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACATTC GCTCCAATCC GGACACCACG CGCCCCGCCG TCACCACCGG TGCCCTGCCC 
TCGTCCCGCA AGATGTTCTC CGCGCCCGAC GCCGCGCCCG ATCTGCGGGT GCCGCTGCGT
GAGATCCTCC TGTCCGAGGG CGCGGGCGAG CCGAACCTGC CGGTCTATGA CACCTCCGGT
CCGTATACCG ATCCGAACGT CATCATTGAT GTGAATGCCG GGCTGCCGCG CACGCGTCTC
GCCTGGGTGA AGGAACGCGG CGGCGTCGAG GAATATGACG GCCGAGAGAT CAAGCCGGAG
GACAACGGCA ATGTCGGCGC AAGCCACGCC GCGGCGGCGT TCAAGGCGCA CCACAAGCCG
CTGAGGGGCA TCGGCGATGC GCCGATCACA CAGCTGGAGT TCGCCCGCGC CGGCATCATC
ACCAAGGAAA TGATCTATGT GGCCGAGCGC GAGAATCTCG GACGCAAGAA GCAACTCGAA
CGCGCCGAAG CCGCGCTGGC CGACGGTGAA GCCTTCGGCG CCTCCGTACC CGCCTTCATC
ACGCCGGAAT TCGTGCGCGA GGAGATCGCG CGCGGCCGTG CCATCATTCC TTCCAACATA
AACCACGCCG AACTGGAGCC GATGATCATC GGCCGCAATT TCCTGGTGAA GATCAACGCC
AATATCGGCA ACTCGGCCGT GACCTCCTCG GTTGAAGAAG AGGTGGACAA GATGGTGTGG
GCGATCCGCT GGGGCGCCGA CACGGTGATG GACCTCTCGA CCGGCCGCAA CATTCACACC
ACGCGCGAAT GGATTTTGCG CAACTCGCCC GTTCCGATCG GAACGGTGCC GATCTATCAG
GCGCTGGAGA AATGCGACGG CGACCCGGTG AAACTGACGT GGGAGCTTTA TCGCGACACG
CTGGTGGAGC AGTGCGAACA GGGCGTCGAT TACTTCACCA TCCATGCCGG CGTACGGCTG
CCCTACATCC ACCTCACCGC CGACCGCGTC ACCGGCATCG TCTCGCGCGG CGGCTCGATC
ATGGCGAAGT GGTGCCTCGC CCACCACAAG GAGAGCTTCC TCTATACGCA CTTCGAGGAA
ATCTGCGACC TCATGCGCAA GTATGACGTG TCGTTCTCGC TCGGTGACGG CCTGCGCCCC
GGCTCGATCG CGGACGCCAA CGACCGCGCG CAGTTCGCCG AACTGGAAAC GCTCGGCGAA
CTCACGCAGA TCGCATGGAA GAAGGGCTGC CAGGTGATGA TCGAAGGCCC CGGCCACGTG
CCGATGCACA AGATCAAGAT CAACATGGAC AAGCAGCTGA AAGAATGCGG CGAAGCGCCG
TTCTATACGC TAGGGCCGCT GACCACCGAC ATCGCGCCTG GCTATGACCA CATCACCTCG
GGCATCGGCG CCGCCATGAT CGGCTGGTTC GGCTGCGCGA TGCTCTGCTA CGTGACGCCG
AAGGAGCATC TCGGCCTGCC CAACCGCGAC GACGTGAAGA CCGGCGTGAT TACCTACAAG
ATCGCGGCGC ACGCCGCCGA CCTCGCCAAG GGCCATCCGG CCGCGCAACT GCGCGATGAC
GCATTGAGCC GCGCGCGGTT CGACTTCCGC TGGCAGGACC AATTCAACCT CGGTCTCGAT
CCTGACACCG CGGTCGCCTT CCACGACGAG ACGCTGCCGA AGGACGCGCA TAAGGTCGCG
CACTTCTGTT CGATGTGCGG ACCGAAATTC TGCTCGATGA AGATCACGCA GGATGTGCGC
GACTACGCTG CGACGCTCGG CGATAACGAG AAGGCCGCGC TCTATCCGGA CACCGCACCA
AAGGCGAATG ACGCCGCAGG CGTCCCCGAG CCGTCTTATC GCGACCAGGG CATGAAAGAG
ATGAGCGCGA GGTTCAAGGA GATGGGAGGT AACGTGTATC TAGATGCCGA GAAGGTGAAG
GAGAGTAATC GGGTGTTGTG A
 
Protein sequence
MNIRSNPDTT RPAVTTGALP SSRKMFSAPD AAPDLRVPLR EILLSEGAGE PNLPVYDTSG 
PYTDPNVIID VNAGLPRTRL AWVKERGGVE EYDGREIKPE DNGNVGASHA AAAFKAHHKP
LRGIGDAPIT QLEFARAGII TKEMIYVAER ENLGRKKQLE RAEAALADGE AFGASVPAFI
TPEFVREEIA RGRAIIPSNI NHAELEPMII GRNFLVKINA NIGNSAVTSS VEEEVDKMVW
AIRWGADTVM DLSTGRNIHT TREWILRNSP VPIGTVPIYQ ALEKCDGDPV KLTWELYRDT
LVEQCEQGVD YFTIHAGVRL PYIHLTADRV TGIVSRGGSI MAKWCLAHHK ESFLYTHFEE
ICDLMRKYDV SFSLGDGLRP GSIADANDRA QFAELETLGE LTQIAWKKGC QVMIEGPGHV
PMHKIKINMD KQLKECGEAP FYTLGPLTTD IAPGYDHITS GIGAAMIGWF GCAMLCYVTP
KEHLGLPNRD DVKTGVITYK IAAHAADLAK GHPAAQLRDD ALSRARFDFR WQDQFNLGLD
PDTAVAFHDE TLPKDAHKVA HFCSMCGPKF CSMKITQDVR DYAATLGDNE KAALYPDTAP
KANDAAGVPE PSYRDQGMKE MSARFKEMGG NVYLDAEKVK ESNRVL