Gene GSU1122 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU1122 
Symbol 
ID2686869 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp1206510 
End bp1207637 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content59% 
IMG OID637125791 
ProductHD domain-containing protein 
Protein accessionNP_952175 
Protein GI39996224 
COG category[R] General function prediction only 
COG ID[COG3481] Predicted HD-superfamily hydrolase 
TIGRFAM ID[TIGR00277] uncharacterized domain HDIG 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAGCAAGA TTTTCATAGC GAGCATCCAT GACCGCGATC TGGTCGATTC CGTGTTTCTC 
GTGAAGGAAA AGATCATGGC CATGGCCAAG AACGGCAAGC CCTACATGAC GCTTCGGCTC
ATGGACAAAA GCGGCGAGAT AGAGGGGCGT GTCTGGGACA ACGTTGACCA GCTGTCGGCC
TCCTTCGATA AGGACGACTT CGTTGGCGTC CGCTCCAAGG CATCGGTCTA CCTGGGCAAA
ATGCAGCTTA TCATTTCCGA GCTGGTGCGG GTTCCCGAAG ACAGGGTCAA CCTGGCGGAC
TTTCTCCCCG AATCGGACCG CTCCATTGCC GAGATGGAGA GCGAGCTCAA GGCCCTGGTG
GAAACCTTTT CCGATCAGCA CCTGAAAGCG CTGATGAAGG CCTTTTTCGA CGATTCTTCC
TTCATGGAGC TCTACCGGAC CGCGCCGGCC GCCAAGGGGA TGCACCACGT CTATCTGGGT
GGACTGCTGG AGCACTCACT GGCCGTGTCC CGCCTGGTTG ACGCCATCGT CCCCCTCTAC
GCGGATCTCA ACCGCGATCT GCTGGTGGCG GGTGCCCTGT TGCACGACGT GGGCAAGGTG
CGGGAGATGA CGTACCTGCG TTCCTTCGAC TACACCGACG AGGGGAAACT CATCGGCCAT
ATCACCATCG GCGTGGAGAT GCTCCAGGAG CGGATTTCGA CCATTCCCGG CTTCCCGCCG
GAGCTGGGGA TGCTGCTCAA GCACATGCTG CTGTCCCACC ATGGTCAGTA CGAATACGGT
TCCCCCAAGC GCCCCAAGAC TGTCGAGGCA ACGATTCTCA ACTACCTGGA CGATCTGGAC
TCCAAGATCA ACGGGATCAG GACCCATATC CGCAAGGAAA GCGAAAACCT GGGGCGCTGG
ACCTCCTATC ACCGGCTCTA TGACCGCTAC TTCTACAAGG AGAGCTACAG CGGCGAGGAG
GAATACCGGG AAGGGGCGGA TGAGCTCATG GTGCTCGAGC CGGAGCCGGT ATCGCTGCCG
GCTGCCCCTC GGGCCGCGGA GGCCGAGCGC AAAAGCGGCA ACACCGCCCG AAAGGGGTTC
AGCAACAATC CGTTCGAGAC CCTGCAAAAG AATCTGGATC TGTTCTGA
 
Protein sequence
MSKIFIASIH DRDLVDSVFL VKEKIMAMAK NGKPYMTLRL MDKSGEIEGR VWDNVDQLSA 
SFDKDDFVGV RSKASVYLGK MQLIISELVR VPEDRVNLAD FLPESDRSIA EMESELKALV
ETFSDQHLKA LMKAFFDDSS FMELYRTAPA AKGMHHVYLG GLLEHSLAVS RLVDAIVPLY
ADLNRDLLVA GALLHDVGKV REMTYLRSFD YTDEGKLIGH ITIGVEMLQE RISTIPGFPP
ELGMLLKHML LSHHGQYEYG SPKRPKTVEA TILNYLDDLD SKINGIRTHI RKESENLGRW
TSYHRLYDRY FYKESYSGEE EYREGADELM VLEPEPVSLP AAPRAAEAER KSGNTARKGF
SNNPFETLQK NLDLF