Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GSU1984 |
Symbol | |
ID | 2688142 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sulfurreducens PCA |
Kingdom | Bacteria |
Replicon accession | NC_002939 |
Strand | - |
Start bp | 2174883 |
End bp | 2176355 |
Gene Length | 1473 bp |
Protein Length | 490 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 637126675 |
Product | polysaccharide chain length determinant protein, putative |
Protein accession | NP_953033 |
Protein GI | 39997082 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis |
TIGRFAM ID | [TIGR03007] polysaccharide chain length determinant protein, PEP-CTERM locus subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGCAGT CCGAATTCGA CTACCGGCAC TATCTCACGC TTATCGTCGC CCGTAAGCGC CTTTTTGTCG TGGTGGCCCT CGCGGTCATG GCTGCCGCAG TGGTCTACAG CTACGTCCTT CCCAAGAAGT ATGAGGCCAG GAGCACCGTA TTCATCGAGA AAAATGTCAT CAGCGAACTG GTCAAGGGGA TTGCCGTAAC TCCTTCCATG GAACAGGCCA TCAAGGGCCT TAGCGAGGCA ATAACCAGCC GTACGCTCGT CACCAAGGTG ATCAATGATC TCGACCTCGA CGTGACCACC AAAAGCAATG CCGAGATCGA GGCCCGGGTT CGCGCCATCC AGCAGAACAT AACCATCAAA CTCAAGGGCG ACAACATTTT CACCATCTCC TACGTGGACA AGGATCCCCG AGTGGCCCGC GATTTCGTCA ATACCCTGGT CCGGCGCTAC GTGGAGGAAA ACATATCCTC CAAGCGGGAA GAATCCTACG GTGCCATCAA ATTCCTTTCC GAGCAGATCG ACACCTTCCG CGGCAAACTC GAAGAAGCCG AAGGCGAACT CAACCGGTAC AAAACCGAGA AAGGGGGGGT CATCGCTATC GACGAAGCAA AGCTGTTCGA GGAGATCAAC GTCGCTCAGC AGAAACTCTA TGATATCCAG CTGCGCAGGC GCCACCTGGA AGGACTTCGA CCGGTCACGC GCAAGGCGGG AGACCCCCTC CAGGTAAAGC TCGTCGCCCT CCAGAAGCAG CTTGAAGAGC TCCGTGTCTC TTACACCGAC AGCTACCCCG AGGTGCTCCG CGTCAGGGGA GAAATCGAGA CCCTCAAGGA GCAGATGAAA AATCGTTCGC CCCAGCAGGA AACGGTCGTT GACCCCCAGG AGTACGAGAA GGCCGAAGCG GAACTCCAGG CGCTCAAGGT CAGTGAAGAC GGCCTCAAAC GCTACATCGC CACCAATCAG GCACTGCTCA GGAATATCCC TTCGGCCAAG GCCGGGCTTG AAAAGCTCGA ACTGGAAAAG AAGAACCGCA AGGACCTTTA CGACCAGCTC ATGGCGCGTC ATGGCCAATC CGAGGTGTCG AAGCAGATGG AAGTCCAAGA CAAGACAACC ACCTTCCGCA TCGTCGACCC GGCGGTCACG CCGTCAAGTC CCATTAGCCC CAACCGGGTC AAGATCATGC TGATGGGGAT TGTCGCTGGA CTCGGCTGCG CCCTGGGACT CATCGTCCTC CTCGACCGGC TTAATAATTC AGTGCATTCG GTTGATGCGC TGAAGGAAAC CAGGATTCCC GTCCTGGCCG TCATCCCCAA GATTCGCACC ATCGAGGCGG TTGCCCGGGA GCGGAGCGAG AACATCAGGG TATTTGCTGC GGCTGGGGCG TGTTTCATGG TTATTCTCGC GTTTCTCGTC ATGGAACTGC TGAATATCTC ACCCGTGGAT CGGATCATCA GCAGGTTGCA TGGAATGCTG TAA
|
Protein sequence | MQQSEFDYRH YLTLIVARKR LFVVVALAVM AAAVVYSYVL PKKYEARSTV FIEKNVISEL VKGIAVTPSM EQAIKGLSEA ITSRTLVTKV INDLDLDVTT KSNAEIEARV RAIQQNITIK LKGDNIFTIS YVDKDPRVAR DFVNTLVRRY VEENISSKRE ESYGAIKFLS EQIDTFRGKL EEAEGELNRY KTEKGGVIAI DEAKLFEEIN VAQQKLYDIQ LRRRHLEGLR PVTRKAGDPL QVKLVALQKQ LEELRVSYTD SYPEVLRVRG EIETLKEQMK NRSPQQETVV DPQEYEKAEA ELQALKVSED GLKRYIATNQ ALLRNIPSAK AGLEKLELEK KNRKDLYDQL MARHGQSEVS KQMEVQDKTT TFRIVDPAVT PSSPISPNRV KIMLMGIVAG LGCALGLIVL LDRLNNSVHS VDALKETRIP VLAVIPKIRT IEAVARERSE NIRVFAAAGA CFMVILAFLV MELLNISPVD RIISRLHGML
|
| |