Gene TM1040_3021 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3021 
Symbol 
ID4076594 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp3188663 
End bp3190258 
Gene Length1596 bp 
Protein Length531 aa 
Translation table11 
GC content58% 
IMG OID638008350 
ProductD-3-phosphoglycerate dehydrogenase 
Protein accessionYP_615015 
Protein GI99082861 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0111] Phosphoglycerate dehydrogenase and related dehydrogenases 
TIGRFAM ID[TIGR01327] D-3-phosphoglycerate dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.487247 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCCCCA AAGTACTCAT CTCCGACAAA CTCTCGGACG CCGCCGTTCA GATCTTCCGT 
GACCGCGGCA TTGACGTGGA TTTCCTACCC GATGTGGGCA AGGACAAGGA AAAGCTGGCT
GAGATCATTG GCCAATATGA TGGCCTCGCG ATCCGCTCCG CCACCAAAGT GACGCCGACC
ATTCTGGAAA ACGCCACCAA CCTAAAGGTG ATTGGCCGCG CCGGGATCGG CACCGACAAC
ATCGACAAGG AAGCCGCGTC GAAAAAGGGT GTGATCGTCA TGAACACTCC CTTCGGCAAC
ATGATCACCA CCGCCGAACA TGCGATTGCG ATGATGTTCG CCGTTGCGCG ACAGGTCCCC
GAGGCCTCTG CCTCGACCCA TGCCGGGAAA TGGGAAAAGT CCAAATTCAT GGGCGTTGAG
CTGACCGGCA AGACCCTCGG TGTTATTGGT GCGGGCAACA TTGGCGGCAT CGTCTGCGAT
CGCGCCCGTG GGCTGAAGAT GAAGGTTGTG GCCTATGATC CGTTCCTATC CGAGGAAAAG
GCCAAGAAGA TGCAGGTCGA AAAGGTCGAG CTGGACGAAC TGCTCGCGCG CGCTGATTTC
ATCACGCTTC ATGTTCCGCT GACCGAGCAG ACCAAGAATA TCCTGAGCCG CGAAAACATC
TCCAAGACCA AGAAAGGCGT GCGCATCATC AACTGTGCCC GTGGCGGTCT GGTGGATGAA
GAAGCCCTGG CCGAAGCCCT GACCTCCGGC CATGTGGCTG GCGCGGCATT TGATGTATTC
TCTGTGGAGC CTGCGAAGGA AAACCCGCTC TTCAACCTGC CCAACGTGGT CTGCACCCCG
CACCTCGGCG CCGCGACGAC CGAGGCGCAG GAAAACGTGG CGCTGCAAGT CGCAGATCAG
ATGGCGAACT ACCTCTTGAC CGGTGCCGTT GAAAACGCGC TCAACATGCC ATCGGTGACG
GCCGAGGAAG CCAAGATCAT GGGGCCGTGG ATCAAACTGG CCGATCATCT CGGGGCCTTT
GCGGGTCAGA TGACCGACGA GCCGATTACT GCGATCAATG TGCTCTACGA TGGTGTGGCA
GCGGATATGA ACCTCGACGC GCTGAACTGC TCGGTGGTTG CAGGCATCAT GAAAAAGGTG
AACCCGGATG TGAACATGGT CTCGGCACCC GTTGTGGCCA AAGAACGCGG CATCCAGATC
TCTACCACCA ATCAGGACAA GTCCGGCTCT TTTGATGGCT ATATCAAGGT GACTGTCGTC
ACGGCAAAAC GCGAGCGCTC TGTCGCAGGC ACCGTGTTCT CGGATGGCAA ACCGCGCTTC
ATCCAGATCA AGGGCATCAA TATCGATGCC GAAGTGGGCG CGCATATGCT CTACACCACC
AACCAGGACG TGCCGGGTAT CATCGGCACC TTGGGTCAGA CTTTGGGCGA TATGGACGTC
AACATCGCGA ACTTTACGCT TGGCCGAAAC GAGGTCGGCG GCGAAGCGAT CGCGCTCCTT
TATGTGGATG CGGAGGTCCC CGCAGACGCG CGTGCCAAGC TGGCAGAGAC CGGTTACTTC
ACCCAGATCA AGCCGCTGGC CTTTGACGTC GCCTGA
 
Protein sequence
MAPKVLISDK LSDAAVQIFR DRGIDVDFLP DVGKDKEKLA EIIGQYDGLA IRSATKVTPT 
ILENATNLKV IGRAGIGTDN IDKEAASKKG VIVMNTPFGN MITTAEHAIA MMFAVARQVP
EASASTHAGK WEKSKFMGVE LTGKTLGVIG AGNIGGIVCD RARGLKMKVV AYDPFLSEEK
AKKMQVEKVE LDELLARADF ITLHVPLTEQ TKNILSRENI SKTKKGVRII NCARGGLVDE
EALAEALTSG HVAGAAFDVF SVEPAKENPL FNLPNVVCTP HLGAATTEAQ ENVALQVADQ
MANYLLTGAV ENALNMPSVT AEEAKIMGPW IKLADHLGAF AGQMTDEPIT AINVLYDGVA
ADMNLDALNC SVVAGIMKKV NPDVNMVSAP VVAKERGIQI STTNQDKSGS FDGYIKVTVV
TAKRERSVAG TVFSDGKPRF IQIKGINIDA EVGAHMLYTT NQDVPGIIGT LGQTLGDMDV
NIANFTLGRN EVGGEAIALL YVDAEVPADA RAKLAETGYF TQIKPLAFDV A