Gene GSU1984 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU1984 
Symbol 
ID2688142 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp2174883 
End bp2176355 
Gene Length1473 bp 
Protein Length490 aa 
Translation table11 
GC content56% 
IMG OID637126675 
Productpolysaccharide chain length determinant protein, putative 
Protein accessionNP_953033 
Protein GI39997082 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis 
TIGRFAM ID[TIGR03007] polysaccharide chain length determinant protein, PEP-CTERM locus subfamily 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGCAGT CCGAATTCGA CTACCGGCAC TATCTCACGC TTATCGTCGC CCGTAAGCGC 
CTTTTTGTCG TGGTGGCCCT CGCGGTCATG GCTGCCGCAG TGGTCTACAG CTACGTCCTT
CCCAAGAAGT ATGAGGCCAG GAGCACCGTA TTCATCGAGA AAAATGTCAT CAGCGAACTG
GTCAAGGGGA TTGCCGTAAC TCCTTCCATG GAACAGGCCA TCAAGGGCCT TAGCGAGGCA
ATAACCAGCC GTACGCTCGT CACCAAGGTG ATCAATGATC TCGACCTCGA CGTGACCACC
AAAAGCAATG CCGAGATCGA GGCCCGGGTT CGCGCCATCC AGCAGAACAT AACCATCAAA
CTCAAGGGCG ACAACATTTT CACCATCTCC TACGTGGACA AGGATCCCCG AGTGGCCCGC
GATTTCGTCA ATACCCTGGT CCGGCGCTAC GTGGAGGAAA ACATATCCTC CAAGCGGGAA
GAATCCTACG GTGCCATCAA ATTCCTTTCC GAGCAGATCG ACACCTTCCG CGGCAAACTC
GAAGAAGCCG AAGGCGAACT CAACCGGTAC AAAACCGAGA AAGGGGGGGT CATCGCTATC
GACGAAGCAA AGCTGTTCGA GGAGATCAAC GTCGCTCAGC AGAAACTCTA TGATATCCAG
CTGCGCAGGC GCCACCTGGA AGGACTTCGA CCGGTCACGC GCAAGGCGGG AGACCCCCTC
CAGGTAAAGC TCGTCGCCCT CCAGAAGCAG CTTGAAGAGC TCCGTGTCTC TTACACCGAC
AGCTACCCCG AGGTGCTCCG CGTCAGGGGA GAAATCGAGA CCCTCAAGGA GCAGATGAAA
AATCGTTCGC CCCAGCAGGA AACGGTCGTT GACCCCCAGG AGTACGAGAA GGCCGAAGCG
GAACTCCAGG CGCTCAAGGT CAGTGAAGAC GGCCTCAAAC GCTACATCGC CACCAATCAG
GCACTGCTCA GGAATATCCC TTCGGCCAAG GCCGGGCTTG AAAAGCTCGA ACTGGAAAAG
AAGAACCGCA AGGACCTTTA CGACCAGCTC ATGGCGCGTC ATGGCCAATC CGAGGTGTCG
AAGCAGATGG AAGTCCAAGA CAAGACAACC ACCTTCCGCA TCGTCGACCC GGCGGTCACG
CCGTCAAGTC CCATTAGCCC CAACCGGGTC AAGATCATGC TGATGGGGAT TGTCGCTGGA
CTCGGCTGCG CCCTGGGACT CATCGTCCTC CTCGACCGGC TTAATAATTC AGTGCATTCG
GTTGATGCGC TGAAGGAAAC CAGGATTCCC GTCCTGGCCG TCATCCCCAA GATTCGCACC
ATCGAGGCGG TTGCCCGGGA GCGGAGCGAG AACATCAGGG TATTTGCTGC GGCTGGGGCG
TGTTTCATGG TTATTCTCGC GTTTCTCGTC ATGGAACTGC TGAATATCTC ACCCGTGGAT
CGGATCATCA GCAGGTTGCA TGGAATGCTG TAA
 
Protein sequence
MQQSEFDYRH YLTLIVARKR LFVVVALAVM AAAVVYSYVL PKKYEARSTV FIEKNVISEL 
VKGIAVTPSM EQAIKGLSEA ITSRTLVTKV INDLDLDVTT KSNAEIEARV RAIQQNITIK
LKGDNIFTIS YVDKDPRVAR DFVNTLVRRY VEENISSKRE ESYGAIKFLS EQIDTFRGKL
EEAEGELNRY KTEKGGVIAI DEAKLFEEIN VAQQKLYDIQ LRRRHLEGLR PVTRKAGDPL
QVKLVALQKQ LEELRVSYTD SYPEVLRVRG EIETLKEQMK NRSPQQETVV DPQEYEKAEA
ELQALKVSED GLKRYIATNQ ALLRNIPSAK AGLEKLELEK KNRKDLYDQL MARHGQSEVS
KQMEVQDKTT TFRIVDPAVT PSSPISPNRV KIMLMGIVAG LGCALGLIVL LDRLNNSVHS
VDALKETRIP VLAVIPKIRT IEAVARERSE NIRVFAAAGA CFMVILAFLV MELLNISPVD
RIISRLHGML