Gene GSU0605 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU0605 
Symbol 
ID2687129 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp639288 
End bp640760 
Gene Length1473 bp 
Protein Length490 aa 
Translation table11 
GC content67% 
IMG OID637125272 
Productthiamine-phosphate pyrophosphorylase/phosphomethylpyrimidine kinase 
Protein accessionNP_951663 
Protein GI39995712 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0351] Hydroxymethylpyrimidine/phosphomethylpyrimidine kinase
[COG0352] Thiamine monophosphate synthase 
TIGRFAM ID[TIGR00097] phosphomethylpyrimidine kinase
[TIGR00693] thiamine-phosphate pyrophosphorylase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCTCTA ACGGTCACAC GCTTCGGCTC GTCATAAACC GGGACAAGCA CGACTCGGTC 
ATCCGGGGGC TCTACCTGGT GACCGATCAC GACGACAACC TCATCCCGCG GGTTGAGGCG
GCCATCGACG GCGGCGCCCG AGTGGTCCAG TACCGCAACA AGAATCAGGA CCGGGAGAGC
CGGCTCGCCC TCGGCCTGGA GCTGCGGGAG CTGTGCCGCA GGCGGAGCAT CCCCTTCATC
GTGAACGACG ACCTGGAGAT GGCCGTGAGC CTCAAGGCCG ACGGGCTTCA CCTGGGCCAG
GGTGACGGCG ATCCCCGCGA AGCGCGACGC GTTCTCGGCC CCGGCAAGAT CATCGGCGTG
TCCACCCACA CCCTGAGCGA AGCCTTGGAG GCCCAGGCGG CCGGGGTGGA CTACATCGGC
CTCGGCGCCA TGTTCCCCTC CCGGAGTAAG GAGGTCGAGC ATGTGGCCGG ATCGGAGCTG
CTCGCGGCCA TCCGGAGTTC CATCAGCATC CCCATTGTCG CCATCGGCGG CATTACCCGC
GATAACGGCG CCAGCGTCAT TGATGCCGGT GCCGATGCCG TGGCAGTCAT ATCGGCGGTT
CTGTCCCATC CCGATCCGGC CCTGGCGGCC ACCGAGATAG CACTGCTCTT CAACCGGCGC
GCACCGTTTC CGCGCGGCTC CGTCCTCACC GTGGCCGGCA GCGATTCGGG GGGAGGCGCG
GGCATCCAGG CAGACCTGAA AACCGTCACC CTGCTCGGGA GCTATGGCTC GTCGGTCCTC
ACGGCCCTGA CAGCCCAGAA CACCCGGGGG GTCAGCGGCA TCCACGGCGT ACCGCCGGCG
TTCGTTGCCG ACCAGCTCGA CGCGGTCTTC TCCGACATTC CCGTGGATGT GGTCAAGACC
GGCATGCTCT TCTCGGCGGA AACCATCGTC GCAATCGCCG CGAAGCTTAC CGAGTACCGG
CGGCGAATGG TAGTGGTCGA TCCCGTTATG GTGGCCAAGG GGGGGGCGAA CCTGATCGAC
CGCGGAGCGG TAAGCGTGCT CAAGGAGCGG CTCTTTCCCC TCGCCTACCT CGTTACCCCC
AATATACCCG AGGCCGAGCG GCTCACCGGT GCAAACATCT CCGACGAAGA ATCGATGCGG
GAGGCGGCCC GCCGCCTGCA CCGGCTGGGG GCACGCAACG TTCTTCTCAA GGGCGGCCAC
CTGCTGGCCG GCGACTCGGT GGACATCCTC TTCGACGGGG CAGCCTTCCA CCGCTTCGTC
TCACCGCGAA TCCTCTCGAA AAACACCCAC GGCACCGGCT GTACCTTCGC CTCGGCCATT
GCCACCTATC TGGCCCAGGG CGACCCCCTG CGCGAAGCCA TCGCCCGGGC CAAACGTTAC
ATCACCGCTG CCATCCGCCT TGCCCAGCCC TTGGGACGCG GACACGGACC GGTCAACCAT
ATCCTCGCCG CGGAGGACGT CAGGGACCGG TGA
 
Protein sequence
MASNGHTLRL VINRDKHDSV IRGLYLVTDH DDNLIPRVEA AIDGGARVVQ YRNKNQDRES 
RLALGLELRE LCRRRSIPFI VNDDLEMAVS LKADGLHLGQ GDGDPREARR VLGPGKIIGV
STHTLSEALE AQAAGVDYIG LGAMFPSRSK EVEHVAGSEL LAAIRSSISI PIVAIGGITR
DNGASVIDAG ADAVAVISAV LSHPDPALAA TEIALLFNRR APFPRGSVLT VAGSDSGGGA
GIQADLKTVT LLGSYGSSVL TALTAQNTRG VSGIHGVPPA FVADQLDAVF SDIPVDVVKT
GMLFSAETIV AIAAKLTEYR RRMVVVDPVM VAKGGANLID RGAVSVLKER LFPLAYLVTP
NIPEAERLTG ANISDEESMR EAARRLHRLG ARNVLLKGGH LLAGDSVDIL FDGAAFHRFV
SPRILSKNTH GTGCTFASAI ATYLAQGDPL REAIARAKRY ITAAIRLAQP LGRGHGPVNH
ILAAEDVRDR