Gene GSU1695 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU1695 
SymbolthrC 
ID2685570 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp1853254 
End bp1854639 
Gene Length1386 bp 
Protein Length461 aa 
Translation table11 
GC content60% 
IMG OID637126376 
Productthreonine synthase 
Protein accessionNP_952746 
Protein GI39996795 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0498] Threonine synthase 
TIGRFAM ID[TIGR00260] threonine synthase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCTACA TCAGCACCAG AGGAACCATT CAGCCCATTC GCTTCAAGGA CGCGGTCATG 
ATGGGGCTCG CCACCGATGG GGGGCTTCTT CTGCCGGAAA CAATACCGGC CATCGACCGG
GACACGCTCG CGGCATGGAA GACGTTGCCG TTCCAGGAAC TCGCCTTCCG GATCATCTCC
CGCTACGCCG ACGACATCCC GGCCGATGAC CTGCGGAGTC TCATTGAGCG CTCCTATGCA
ACCTTTGATC ATCCCGACGT GACGCCGGTC GTGGAGCGGG GCGGCCTCCA CATCCTCGAA
CTCTTCCACG GACCCACGCT TGCATTCAAG GACGTGGCTC TTCAGTTCCT GGGCAACCTC
TTCGAATATC TGCTGCAGGA GCGGAACGAA CGGATGAACA TCGTCGGCGC CACGTCCGGC
GACACGGGAA GTGCCGCCAT TTACGGCGTG CGGGGGAAGG AGAACATCAA CATCTTCATC
CTCCATCCCC ACGGCAAAAC CTCGCCGGTC CAGGCGCTAC AGATGACCAC GGTACTCGAT
CCCAACGTGC ACAACATTGC CGTGCGCGGC ACCTTTGACG ATTGCCAGAA CATCGTCAAG
AGCCTGTTCA ACGACCTCCC CTTCAAGGAA CGCTACGCTC TCGGCGCCGT CAACTCCATC
AACTGGGCCC GGGTACTGGC CCAGGTGGTC TATTACTTCC TCTCTTACTT CCGCGTGGCA
AAGACCATTG GCGACGAAGT TGTCTTCTCG GTTCCCACGG GCAACTTCGG CGATATTTTT
GCCGGCTACG TGGCCAAGCG AATGGGACTG CCCATTGCCA GACTGCTCCT GGCCACCAAC
GAAAACAACA TTCTCGCCCG CTTCATCAAT GACGGAGACT ATTCGCTGAG TGCCGTGGTG
CCCACTGTGT CGCCGTCCAT GGACATCCAG CTGGCTTCAA ACTTCGAACG CTATGTCTAC
TATCTCTTCG GAGAAGACCC TGCCAGGGTC CGCGAGGCAT TCGCCACGCT TCCCGCCCGG
GGCCGGATCG TCTTCTCCGA TGCCGAAATG GAGCACGTGC GCACCGAATT CCTTGCCTGT
TCGGTTAACC AGCAGGAGAC CGTCGACACC ATCGCCTCCT TCAACCGTGA AACCGGCTAC
CTCCTTGATC CCCACACGGC AGTGGGTGTC CGGGCTGCCC GGCAACTGGT GACCGACGGC
ACGCCGGTCA TCTGCCTTGC CACTGCTCAC CCGGCGAAGT TCGCCGATGC CGTGGTGCGC
GCAGTAGGGT TCGAACCGCC GCGTCCTCCA TCGCTCATGG GAATTGAAGA CCTGCCGAGC
CGGTGCGAGG TGCTTGACGC ACGTATCGAG CAGATCAGGA CCTTCATCGA GGAGAAGGCC
CGCTAA
 
Protein sequence
MRYISTRGTI QPIRFKDAVM MGLATDGGLL LPETIPAIDR DTLAAWKTLP FQELAFRIIS 
RYADDIPADD LRSLIERSYA TFDHPDVTPV VERGGLHILE LFHGPTLAFK DVALQFLGNL
FEYLLQERNE RMNIVGATSG DTGSAAIYGV RGKENINIFI LHPHGKTSPV QALQMTTVLD
PNVHNIAVRG TFDDCQNIVK SLFNDLPFKE RYALGAVNSI NWARVLAQVV YYFLSYFRVA
KTIGDEVVFS VPTGNFGDIF AGYVAKRMGL PIARLLLATN ENNILARFIN DGDYSLSAVV
PTVSPSMDIQ LASNFERYVY YLFGEDPARV REAFATLPAR GRIVFSDAEM EHVRTEFLAC
SVNQQETVDT IASFNRETGY LLDPHTAVGV RAARQLVTDG TPVICLATAH PAKFADAVVR
AVGFEPPRPP SLMGIEDLPS RCEVLDARIE QIRTFIEEKA R