Gene Shewmr4_1882 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewmr4_1882 
Symbol 
ID4252456 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. MR-4 
KingdomBacteria 
Replicon accessionNC_008321 
Strand
Start bp2241762 
End bp2243459 
Gene Length1698 bp 
Protein Length565 aa 
Translation table11 
GC content51% 
IMG OID638118493 
Productthiamine-phosphate pyrophosphorylase 
Protein accessionYP_734013 
Protein GI113970220 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0351] Hydroxymethylpyrimidine/phosphomethylpyrimidine kinase
[COG0352] Thiamine monophosphate synthase 
TIGRFAM ID[TIGR00097] phosphomethylpyrimidine kinase
[TIGR00693] thiamine-phosphate pyrophosphorylase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCCTG CTAAAGATAT TAGCCAGGTG GCGCCTAAAC CTATCGTCTG GACCATTGCG 
GGTTCAGACA GCGGTGGCGG CGCGGGGATC CAAGCCGACT TAGCCACCAT CAAGGATTTG
GGCGGCCATG GATGCAGCGT GATCACCACG TTAACGGCCC AAAGTTCGGT GGCGGTGGAG
TTAGTTGAGC CTGTGAGTGA GGCAATGTTA CTCACACAGC TCTCGACCCT GTTAGCCGAT
CTACCGCCTC AGGCGATTAA GATTGGTTTA CTCGCAAATC AGCAGCAACT GCACTTAGTT
GCCGACTGGT TGGCTGGTTT TAAAACCCAA TTTCCACTCG TCCCCGTTAT TCTCGACCCT
GTGATGGTTG CAAGCTGTGG TGATGAGTTA GGTGATAAGA GCACTGCAAG TAAACCTCTG
GATTTTACTC CCTTTAAGGG CTTGATTAGC CTGATAACGC CGAATGTGCA GGAGTTGGCA
AGGCTAACTG CCGCCACAGA CAAACAAGCG TCAGCAATAC ACACAAAAGC AGAATTCGCC
GCTGCGGCAA TGCAACTCTC AGCGCAATTA GCGTGCAGCG TATTAGCCAA GGGCGGCGAT
ATTGATTTTG CGGCTCAAGC AAGTGATGGC ATTAACACCA GTGATCTCTT AAGCGATCAC
AAAAGCGATA TCACAAGTCA TCATATCGCC ACAGATAATC AGCGTTTGGC GGAAGATTTG
TTGATTTGTC ATCAGGTTAC TGGCTGTTCA CCGCTTGACG CTAATGGCGG TTTTTGGCTG
AGCAGTGTGC GGATAAACAC GCGCCATAAC CACGGCAGTG GTTGTACTCT GTCTTCGGCC
ATCGCCTCGG TGTTGGCCTC TGGCTTTGTA TTGCAGGATG CGGTTGTGGT AGCAAAAGCC
TATGTCAATC AAGGCTTAAC TTATGCAGAG GGGATTGGCC AAGGCCCAGG GCCACTGGCG
CGTACCGCTT GGCCGCACAA TTTAACGGCG TATCCTCACG TCACTGCTTA TTCTCAAAAC
AGCTTGAGTG AATCCAGTGA TTTGCAATGC GGCGCGTTTA ATCGCCTTGA TCCTGACTTA
GGGATTTATC CAGTTGTTGA TAACTTACTG TTACTCGACC AGTTATTGGC GGCAGGCGTG
AAGACGGTAC AGCTCAGGAT AAAGTCTAAT GCGCTAAAGT CTAATGTGCT GGCGTCGGAC
GAACTTGAGG CACAAATCCA AACCGCGATT GCCTTAGGTA AGCATTATGA TGCGCAGCTT
TTTATCAATG ATCATTGGCA GTTAGCGATA AAGCATGGCG CATTTGGGAT CCATCTCGGC
CAAGAAGATC TGGCGGTAAC GGATCTTAAC GCCATTCATG CAGCAGGACT GGCGCTTGGC
ATCTCGAGCC ACGGTTATTT CGAGTTGCTG CGTGCCCATC AACATGCGCC ATCGTACATC
GCCCTTGGGC ATATCTTCCC GACGACCACC AAGCAAATGC CATCGGCGCC GCAGGGATTA
TGTAAACTCA CTCATTATGT TGAGCTGTTA AATGCGCACT ATCCCTTAGT GGCAATTGGC
GGCATAGGAC CTTCGAATCT TGACCAAGTC AAAGCGACTG GGGTGAGCAA TATTGCCGTG
GTGCGGGCGA TTACCGAAGC AAATGATCCA GTAATGGCCT ATGCCGAATT GACTCGGGCT
TGGGAGTCAA GCCTATGA
 
Protein sequence
MTPAKDISQV APKPIVWTIA GSDSGGGAGI QADLATIKDL GGHGCSVITT LTAQSSVAVE 
LVEPVSEAML LTQLSTLLAD LPPQAIKIGL LANQQQLHLV ADWLAGFKTQ FPLVPVILDP
VMVASCGDEL GDKSTASKPL DFTPFKGLIS LITPNVQELA RLTAATDKQA SAIHTKAEFA
AAAMQLSAQL ACSVLAKGGD IDFAAQASDG INTSDLLSDH KSDITSHHIA TDNQRLAEDL
LICHQVTGCS PLDANGGFWL SSVRINTRHN HGSGCTLSSA IASVLASGFV LQDAVVVAKA
YVNQGLTYAE GIGQGPGPLA RTAWPHNLTA YPHVTAYSQN SLSESSDLQC GAFNRLDPDL
GIYPVVDNLL LLDQLLAAGV KTVQLRIKSN ALKSNVLASD ELEAQIQTAI ALGKHYDAQL
FINDHWQLAI KHGAFGIHLG QEDLAVTDLN AIHAAGLALG ISSHGYFELL RAHQHAPSYI
ALGHIFPTTT KQMPSAPQGL CKLTHYVELL NAHYPLVAIG GIGPSNLDQV KATGVSNIAV
VRAITEANDP VMAYAELTRA WESSL