Gene Shewana3_1937 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewana3_1937 
Symbol 
ID4479734 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. ANA-3 
KingdomBacteria 
Replicon accessionNC_008577 
Strand
Start bp2304011 
End bp2305693 
Gene Length1683 bp 
Protein Length560 aa 
Translation table11 
GC content51% 
IMG OID639726519 
Productthiamine-phosphate pyrophosphorylase 
Protein accessionYP_869574 
Protein GI117920382 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0351] Hydroxymethylpyrimidine/phosphomethylpyrimidine kinase
[COG0352] Thiamine monophosphate synthase 
TIGRFAM ID[TIGR00097] phosphomethylpyrimidine kinase
[TIGR00693] thiamine-phosphate pyrophosphorylase 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCTCTG CTAAACATGT TAGCCACTTG GCGCCTAAAC CTATCGTCTG GACCATTGCG 
GGTTCAGACA GCGGTGGCGG CGCGGGGATC CAAGCCGACT TAGCCACCAT CAAGGATTTG
GGCGGTCATG GATGCAGCGT GATCACCACG TTAACGGCCC AAAGTTCGGT GGCGGTGGAT
TTAGTTGAGC CTGTGAGTGA GGCAATGCTA CTCACACAGC TCTCGACCCT GTTAGCCGAT
CTACCGCCTC AGGCGATTAA AATTGGCTTA CTCGCGAATC AGCAGCAACT GCATTTAGTT
GCCGATTGGT TGGCTGGCTT TAAAACCCAG TTTCCACTCG TCCCTGTTAT TCTCGACCCT
GTAATGGTTG CAAGCTGTGG GGATGAACTT GGGGATAAGA GCACGGCGAG CCAGCCACTG
GATTTTACGC CCTTTAAGGG CTTGATTAGC CTGATAACGC CGAATGTGCA GGAGTTGGCA
AAGCTTACTG TCACAACTGA CAAACAAGTG TCACCAATAC ACACAAAAGC AGCGTTCGCT
GCTGCGGCAA TGCAACTCTC TGAACAATTA GACTGCAGCG TATTAGCCAA GGGCGGCGAT
GTTGATTTTG CGGCTCAGGC AAGCGTAGGC ATTAATACAA GGGATAGCTT AAGCGGTCAC
AAAAGCGATA TCACAAGTCA TCGCAGTGCC ACAGATAATC AGCGTATGGC GGAAGATTTG
TTGATTTGTC ATCAGGTTAC AGGTTGCACA CCGCTCGACG CTAATGGCTG CTTTTGGCTC
AGCAGTGCGC GGATTAACAC GCGCCATAAC CACGGCAGTG GCTGTACTCT GTCTTCGGCC
ATCGCCTCGG TGTTAGCCTT TGGCTTTGTA TTGCAGGATG CCGTTGTGGT AGCAAAAGCC
TATGTCAATC AAGGCTTAAC TTATGCAGGG GGGATTGGCC AAGGCCCAGG GCCACTCGCG
CGTACCGCTT GGCCGCACAA TTTGACGGCG TATCCTCATA TTACTGCTTA TTCTGAAAAC
AGCTTGAGTG AATCCAGTGA TGTGCAATGC GGCGCGTTTA AGCGCCTTGA GCCTGACTTA
GGAATCTATC CTGTCGTCGA TAACCTACTG TTACTCGAAC AGTTATTAGC GGCAGGCGTG
AAGACGGTAC AGCTCAGGAT AAAGTCTAAT GCGCTAAAGT CGGACGAACT TGAGGCTCAA
ATCCAAACCG CGATTGCCTT AGGTAAACAC TATGAAGCGC AGCTTTTTAT CAATGATCAT
TGGCAGTTAG CGATAAAACA TGGCGCATTT GGGGTGCATC TTGGCCAAGA AGATCTGGCA
GTGACGGATC TTAATGCCAT TCATGCAGCA GGATTGGCGC TAGGCATCTC GAGCCACGGT
TATTTCGAGT TGCTGCGTGC CCATCAACAT GCGCCATCTT ACATCGCCCT CGGGCATATT
TTCCCAACGA CCACCAAGCA AATGCCATCG GCGCCGCAGG GATTATTTAA ACTCACTCAT
TATGTTGAGC TGTTAAATAC ACACTATCCC TTAGTGGCAA TTGGTGGCAT AGGACCTTCG
AATCTTTTGC TAGTCAAAGC GACTGGGGTG AGCAATATTG CCGTGGTGCG GGCGATTACC
GAGGCTAATG ATCCAGTAAT GGCCCTTGCC GAATTGACGC GAGCCTGGGA GTCAAGCCTA
TGA
 
Protein sequence
MTSAKHVSHL APKPIVWTIA GSDSGGGAGI QADLATIKDL GGHGCSVITT LTAQSSVAVD 
LVEPVSEAML LTQLSTLLAD LPPQAIKIGL LANQQQLHLV ADWLAGFKTQ FPLVPVILDP
VMVASCGDEL GDKSTASQPL DFTPFKGLIS LITPNVQELA KLTVTTDKQV SPIHTKAAFA
AAAMQLSEQL DCSVLAKGGD VDFAAQASVG INTRDSLSGH KSDITSHRSA TDNQRMAEDL
LICHQVTGCT PLDANGCFWL SSARINTRHN HGSGCTLSSA IASVLAFGFV LQDAVVVAKA
YVNQGLTYAG GIGQGPGPLA RTAWPHNLTA YPHITAYSEN SLSESSDVQC GAFKRLEPDL
GIYPVVDNLL LLEQLLAAGV KTVQLRIKSN ALKSDELEAQ IQTAIALGKH YEAQLFINDH
WQLAIKHGAF GVHLGQEDLA VTDLNAIHAA GLALGISSHG YFELLRAHQH APSYIALGHI
FPTTTKQMPS APQGLFKLTH YVELLNTHYP LVAIGGIGPS NLLLVKATGV SNIAVVRAIT
EANDPVMALA ELTRAWESSL