Gene Shewmr7_2096 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewmr7_2096 
Symbol 
ID4258363 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. MR-7 
KingdomBacteria 
Replicon accessionNC_008322 
Strand
Start bp2473470 
End bp2475167 
Gene Length1698 bp 
Protein Length565 aa 
Translation table11 
GC content51% 
IMG OID638122764 
Productthiamine-phosphate pyrophosphorylase 
Protein accessionYP_738141 
Protein GI114047591 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0351] Hydroxymethylpyrimidine/phosphomethylpyrimidine kinase
[COG0352] Thiamine monophosphate synthase 
TIGRFAM ID[TIGR00097] phosphomethylpyrimidine kinase
[TIGR00693] thiamine-phosphate pyrophosphorylase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCCAG CTAAACATAT TAGCCTGGTG GCGCCTAAAC CTATCGTCTG GACCATTGCC 
GGTTCAGACA GTGGTGGCGG CGCGGGGATC CAAGCCGACT TAGCCACCAT CAAGGATTTG
GGCGGCCATG GATGCAGCGT GATCACCACG TTAACGGCCC AAAGTTCGGT GGCGGTGGCG
TTAGTTGAGC CCGTGAGTGA GGCAATTTTA CTCACACAGC TCTCGACCCT GTTAGCCGAT
CTACCGCCTC AGGCGATTAA GATTGGTTTA CTCGCAAATC AGCAGCAAGT GCACTTAGTT
GCCGATTGGT TGGCTGGTTT TAAAACCCAA TTTCCGCTCG TCCCTGTTAT TCTCGACCCT
GTGATGGTTG CCAGCTGTGG TGATGAGTTA GGTGATAAGA GCACTGCAAG TAAACCTCTG
GATTTTACTC CCTTTAAGGG CTTGATTAGC CTGATAACGC CGAATGTGCA GGAGTTGGCA
AGGCTAACTG CCGCCACAGA CAAACAAGCG TCAGCAATAC ACACAAAAGC AGAATTCGCC
GCTGCGGCAA TGCAACTCTC AGCGCAATTA GCGTGCAGCG TATTAGCCAA GGGCGGCGAT
ATTGATTTTG CGGCTCAAGC AAGTGATGGC ATTAACACCA GTGATCTCTT AAGCGATCAC
AAAAGCGATA TCACAAGTCA TCCCATCGTC ACAGATAATC AGCGTCTTGC GGAAGATCTG
CTGATTTGTC ATCAGGTTAC TGGCTGCTCA CCGCTTGACG CTAATGGCGG TTTTTGGCTG
AGCAGTGCGC GGATAAACAC GCGCCATAAC CATGGCAGTG GTTGTACTCT GTCTTCGGCC
ATCGCCTCGG TGTTAGCCTT TGGCTTTGTA TTGCAGGATG CAGTTGTGGT AGCAAAAGCC
TATGTCAATC AAGGCTTAAC TCATGCAGTG GGTGTTGGCC AAGGCCCAGG GCCACTGGCG
CGTACCGCTT GGCCGCACAA TTTAACGGCG TATCCTCACG TCACTGCTTA TTCTCAAAAC
AGCTTGAGTG AATCCAGTGA TTTGCAATGC GGCGCGTTTA ATCGCCTTGA TCCTGACTTA
GGGATTTATC CAGTTGTTGA TAACTTACTG TTACTCGAGC AGTTATTGGC GGCAGGCGTG
AAGACGGTAC AGCTCAGGAT AAAGTCTAAT GCACTAAAGT CTAATGTGCT GGCGTCGGAC
GAACTTGAGG CACAAATCCA AACCGCGATT GCCTTAGGTA AGCATTATGA TGCGCAGCTT
TTTATCAATG ATCATTGGCA GTTAGCGATA AAGCATGGCG CATTTGGGAT CCATCTCGGC
CAAGAAGATC TGGCGGTAAC GGATCTTAAC GCCATTCATG CAGCAGGACT GGCTCTTGGC
ATCTCGAGCC ACGGTTATTT CGAGTTGCTG CGTGCCCATC AACATGCGCC ATCGTACATC
GCCCTTGGGC ATATCTTCCC GACGACCACC AAGCAAATGC CATCGGCGCC GCAGGGATTA
TGTAAACTCA CTCATTATGT TGAGCTGTTA AATGCGCACT ATCCCTTAGT GGCAATTGGC
GGCATAGGAC CTTCGAATCT TGACCAAGTC AAAGCGACTG GGGTGAGCAA TATTGCCGTG
GTGCGGGCGA TTACCGAAGC AAATGATCCA GTAATGGCCT ATGCCGAATT GACTCGGGCT
TGGGAGTCAA GCCTATGA
 
Protein sequence
MTPAKHISLV APKPIVWTIA GSDSGGGAGI QADLATIKDL GGHGCSVITT LTAQSSVAVA 
LVEPVSEAIL LTQLSTLLAD LPPQAIKIGL LANQQQVHLV ADWLAGFKTQ FPLVPVILDP
VMVASCGDEL GDKSTASKPL DFTPFKGLIS LITPNVQELA RLTAATDKQA SAIHTKAEFA
AAAMQLSAQL ACSVLAKGGD IDFAAQASDG INTSDLLSDH KSDITSHPIV TDNQRLAEDL
LICHQVTGCS PLDANGGFWL SSARINTRHN HGSGCTLSSA IASVLAFGFV LQDAVVVAKA
YVNQGLTHAV GVGQGPGPLA RTAWPHNLTA YPHVTAYSQN SLSESSDLQC GAFNRLDPDL
GIYPVVDNLL LLEQLLAAGV KTVQLRIKSN ALKSNVLASD ELEAQIQTAI ALGKHYDAQL
FINDHWQLAI KHGAFGIHLG QEDLAVTDLN AIHAAGLALG ISSHGYFELL RAHQHAPSYI
ALGHIFPTTT KQMPSAPQGL CKLTHYVELL NAHYPLVAIG GIGPSNLDQV KATGVSNIAV
VRAITEANDP VMAYAELTRA WESSL