Gene GSU1697 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU1697 
Symbol 
ID2687393 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp1856595 
End bp1858529 
Gene Length1935 bp 
Protein Length644 aa 
Translation table11 
GC content61% 
IMG OID637126378 
Producthypothetical protein 
Protein accessionNP_952748 
Protein GI39996797 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1305] Transglutaminase-like enzymes, putative cysteine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTATCCG TTAAGACCGT TATCACCATT CTCGCCTACC TCGTGGCGGC TCTCGGGTAT 
CTCCCCGTCG TACTCTATGC GGACTGGCCA GCGCGACTCG CGGCTCCGCT GGCCTTGGCG
CTGGGCATTG CATTTGATCG CCGGGGACGG CATCCTCTGG CCGGCATTCC GGCAACACTA
TTCACGGTCG TTGCCTTTGC GGCGTACTTA CTCCAGTGGA GTCGCTCCAA TCCGGCAGCG
CCGGTCGTGA ACTTCCTCGT CATGCTTCTG TCGGTGCGGC TGATCAACGA AAAGAGTCCG
CGAAACCTTC TTCAGATATT TGCGCTTTCC CTTTTTCTCC TGGCAGGGTC ATCGCTCTTC
AGCCTGAGCG CCCTGTTTCT GTTCTATCTG ACGCTCCTGC TGGCCCTTAT CGCTATTGCG
CTCGTTCTTC TGGCCTTTCA TAGCGTCGAC GAAGGTATCA GTCTCACCTC CCGCGCACTC
AGGCGCGTGG TCACGGCGGC CCTTGCCATG CCCGCCGCGT CACTGCCGCT ACTGCTCGTT
TTTTTCGCCA TCCTGCCGCG GACCCAGTTT CCGCTCCTCA GTTTTCTGAA TGCCCCGGGG
GAGAAAACAA CAGGGCTCAG TGAGCGCGTG GAGCCGGGCA CGTCCTCCAG CGTAGCCGAT
GTCCACACCG TTGCGTTCAG GGCCGAATGC GAGCGGTTGG AGCGGAATGA GTTGTATTGG
CGGGGACTGG TCCTGGACAC GGCTACCCCG GCTGGATGGG TGCGCGGGAC GACCCCGACC
GCGGATTCGC CTGCCCAGCC ACAGGGAAGA GCCGTGCGTC AGGTGATCTA TCCCGAACCC
TCGCGCACCA CGTACCTGGT GGGGCTCAAC GTGCCGATTC GCATGGACGG AATCCGAAAC
AGGCAAGCCA ATGATTTTAC CTTTGTCAAT CGGGGACGTC CGGGGGGGCG GATCAGGTAT
GAGGTGCAGT CGGTGGTGTC CGACACTATT GCCGTCAGGG GGACGGTGGA CCGAAACCGC
TATTTGAAAC TGCCCGAACG GATCTCCGGC AGAACAATTG ACCTGGCACG CCGGTTGACT
GCCGGCGCAG CAAACGATGC GGCCAAGCTA GAAAGGATCG AGGCGTGGTT CCGTGATGCC
GGATTCTCTT ACGCTACCAC AGGGCTTCCG CTGTCGGACG ATCCGGTCGA TGCCTTTCTC
TTTGGCAGCA AAAGAGGACA CTGCGAGTTT TTTGCGTCAT CGTTCGCCCG GCTGTTGCGG
GTGGCGGGCG TACCGGCGAG GCTGGTGGGG GGCTACTACG GCGGGGAATA CAACGAACTG
GCCGGCTATT ATCTGGTTAC CGAAGACCGC GCCCATGTCT GGGTGGAAGC GTTCATAGAG
GGACGAGGTT GGGTAATGGT CGACCCAAGC GCCTTTGCCG TCAACTTCGC TCGAGTGGGC
GAAGCATCCA GAGCCGGGAT GCTTGGCACA ATGACCCGCG TGGTCGACTC GCTCTCCTAT
CTCTGGATTC AGACAGTTAT CACCTATGAC CTCCAGAAAC AGATCGAGCT GGTCCGCACG
GCGAACAGCC GGTTCAAAGA CCTCCGCATG CCGGGGGATG TGCGTCTGGT CGCGGCGGCG
ACAATCGGAG CACTGGCGGC GTTGGGGTGC GTCTTGGCTA TTCGCCGTCA CAGGCCTGCC
GGCAGGGAGA AACGGCTGCT GAAACGTCTC CGTGCAAAGC TCACCCGTTT CTACGGCCTT
CCCCGGGATC CGGATCCCTC GCAGGGACTG TCTGAAGTCG TCGCAGGACT GAACGATCCG
GCGGCAACGG AGTTCGTCGC CATTTACGGT GGGGCGGTTT ACCACGACCG GCGTCTCACC
CCTGATGAGG TCCGCCGCCT TGATCGATTA CTTGAAGAGA TCGGCAGGAG GCACCATCGT
CGTAGCAGCC CATAG
 
Protein sequence
MVSVKTVITI LAYLVAALGY LPVVLYADWP ARLAAPLALA LGIAFDRRGR HPLAGIPATL 
FTVVAFAAYL LQWSRSNPAA PVVNFLVMLL SVRLINEKSP RNLLQIFALS LFLLAGSSLF
SLSALFLFYL TLLLALIAIA LVLLAFHSVD EGISLTSRAL RRVVTAALAM PAASLPLLLV
FFAILPRTQF PLLSFLNAPG EKTTGLSERV EPGTSSSVAD VHTVAFRAEC ERLERNELYW
RGLVLDTATP AGWVRGTTPT ADSPAQPQGR AVRQVIYPEP SRTTYLVGLN VPIRMDGIRN
RQANDFTFVN RGRPGGRIRY EVQSVVSDTI AVRGTVDRNR YLKLPERISG RTIDLARRLT
AGAANDAAKL ERIEAWFRDA GFSYATTGLP LSDDPVDAFL FGSKRGHCEF FASSFARLLR
VAGVPARLVG GYYGGEYNEL AGYYLVTEDR AHVWVEAFIE GRGWVMVDPS AFAVNFARVG
EASRAGMLGT MTRVVDSLSY LWIQTVITYD LQKQIELVRT ANSRFKDLRM PGDVRLVAAA
TIGALAALGC VLAIRRHRPA GREKRLLKRL RAKLTRFYGL PRDPDPSQGL SEVVAGLNDP
AATEFVAIYG GAVYHDRRLT PDEVRRLDRL LEEIGRRHHR RSSP