Gene GSU1607 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU1607 
SymbolglyA 
ID2685603 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp1757872 
End bp1759119 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content58% 
IMG OID637126287 
Productserine hydroxymethyltransferase 
Protein accessionNP_952658 
Protein GI39996707 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0112] Glycine/serine hydroxymethyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.203798 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGATTC TCGAAACCTT TGACCCGCAG GTAGCTGAGG CGATCCGCCA TGAAACCGAA 
CGGCAGGAGT ACAACCTGGA GTTGATCGCT TCCGAAAACT TTGTTTCCGA GGCGGTACTG
GAAGCCCAGG GCTCGGTGAT GACCAATAAG TATGCCGAGG GATACCCCGG TAAGCGCTAC
TATGGTGGAT GCCACCATGT GGACGTGGTG GAAAATCTCG CTATTGAGCG GGCCAAGGAG
CTTTTCGGTG CCGATCATGC CAACGTCCAG CCCCATTCGG GCTCCCAGGC AAATATGGCG
GTCTATTTTT CGGTGCTCAA GCCCGGCGAC ACCATTCTTG GGATGAATCT GTCCCACGGC
GGCCACCTGA CCCACGGCAG CCCCGTGAAC TTCTCCGGCC GTTTCTTCAA CGTGGTTCCC
TACGGCGTGT CCCAGGAGAC CGAAACGATC GACTTCAATG AGGTGGAGCG TCTTGCCCTT
GAGCATAAGC CGAAGATGAT AGTTGTGGGG GCAAGCGCCT ATCCCCGAAC CATCGATTTT
GCCGCCTTCC GCATCATTGC CGATAAGGTC GGCGCGGTTA TCATGGTTGA TATGGCTCAC
ATTGCGGGCC TGGTTGCGGC CGGTCTCCAT CCGAGCCCTG TTCCCTACGC TGAATTCGTG
ACCACCACTA CCCATAAGAC CCTCAGAGGT CCCCGCGGCG GGATGATCCT GTGCCGTGAG
GAGTACGCCA AGACGCTCAA TTCCAACATC TTCCCCGGTA TCCAGGGGGG GCCGCTCATG
CATGTCATCG CGGCCAAGGC CGTTGCCCTC AAGGAGGCCC TCCAGCCCGA GTTCAAAGCG
TATCAGGCCC AGATCGTGAA AAATGCCAAG GCCCTTGCCG ACGAGCTGGT AAAGCGCGGG
TTCCGGCTTG TGTCCGGCGG CACCGATAAC CATCTGATGC TGGTTAACCT AACCGGCACC
GAACTGACCG GCAAGGTGGC GGAAGAGTCT CTGGATAAGG CCGGCATCAC GGTGAACAAG
AACACGGTGC CTTTCGAGAC CCGTTCACCC TTTGTCACCT CCGGTTTCCG GATCGGCACT
CCCGCAGCCA CTACCCACGG TCTCAAGGAA GCTGAAATGG CCGACGTGGC GGGCTTTATC
GCAGAGGCCC TGGCCAACGT GGACAATGAT GCCAAACTCG CCGAGATTAA GGGGAGGGTC
AATGTGCTTA TGAAACGCTT CCCCCTCTAT GCTCACCGTC TTTCATAA
 
Protein sequence
MSILETFDPQ VAEAIRHETE RQEYNLELIA SENFVSEAVL EAQGSVMTNK YAEGYPGKRY 
YGGCHHVDVV ENLAIERAKE LFGADHANVQ PHSGSQANMA VYFSVLKPGD TILGMNLSHG
GHLTHGSPVN FSGRFFNVVP YGVSQETETI DFNEVERLAL EHKPKMIVVG ASAYPRTIDF
AAFRIIADKV GAVIMVDMAH IAGLVAAGLH PSPVPYAEFV TTTTHKTLRG PRGGMILCRE
EYAKTLNSNI FPGIQGGPLM HVIAAKAVAL KEALQPEFKA YQAQIVKNAK ALADELVKRG
FRLVSGGTDN HLMLVNLTGT ELTGKVAEES LDKAGITVNK NTVPFETRSP FVTSGFRIGT
PAATTHGLKE AEMADVAGFI AEALANVDND AKLAEIKGRV NVLMKRFPLY AHRLS