Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GSU2241 |
Symbol | |
ID | 2687517 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sulfurreducens PCA |
Kingdom | Bacteria |
Replicon accession | NC_002939 |
Strand | - |
Start bp | 2455686 |
End bp | 2456696 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637126934 |
Product | capsular polysaccharide biosynthesis protein I |
Protein accession | NP_953290 |
Protein GI | 39997339 |
COG category | [G] Carbohydrate transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0451] Nucleoside-diphosphate-sugar epimerases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTTCGA TACTCGTCAC CGGAGCAGCC GGGTTCATCG GTTTTCATCT TACGAAACGC CTTCTTGACC GGGGCGATCG CGTGGTGGGG CTCGACAACC TCAACGACTA TTATGACGTG AACCTGAAGC TGGACCGACT CCGCCAGTTG GAGGGGCGCG AGGGATTCAG CTTTGTGCGG ACCAGTCTGG CAGACCGGCC GGCCCTGGAG GATCTCTTTG CCGGCCAGCG TTTCGATGTG GTGGTGAACC TGGCTGCCCA GGCCGGGGTC CGCTACTCCA TCACCAACCC CCACGCTTAC GTGGACAGTA ATCTGGTCGG CTTCATCAAC ATTCTGGAGG GGTGCCGGCA TCACGGGGTG AAGCACTTGG TCTACGCATC GTCCAGCTCC GTCTATGGTG CCAATACGGC AATGCCGTTT TCGATCCACC ACAACGTGGA TCATCCGGTT TCCCTGTATG CCGCCACCAA GAAGGCCAAC GAGCTCATGG CCCACACCTA TTCGAGCCTC TACGGGCTGC CCACCACGGG CCTGCGCTTC TTCACGGTCT ACGGCCCCTG GGGGCGCCCC GACATGGCGC TCTTCCTCTT TACCAAGGCA ATCCTCGAAG GCCGGCCCAT CGATGTCTAT AATTTTGGCA AAATGCAGCG TGATTTCACT TATGTAGACG ACATTGTCGA GGGGGTGACG CGGGTCATGG ACCGCACGCC GGAGCCCAAC CCTGCCTGGA GCGGGGCCCG ACCCGATCCC GGCACGAGCT ACGCTCCCTA TCGCATCTAC AACATCGGCA ACAACAACCC GGTCGAGCTT CTCGCGTTCA TTGAAGCCAT CGAACAGAAC CTGGGGATCA CTGCGCAGAA GAATCTGCTT CCCCTGCAGG CGGGTGACGT GCCCGCCACC TACGCCGACG TGGATGACCT GATGAACGAC GTGGGGTTCA AGCCGGCCAC TCCCATCGGG GAGGGGATAG AGCGGTTCGT CGAGTGGTAC CGGGGATACT ACGGCGTCTG A
|
Protein sequence | MSSILVTGAA GFIGFHLTKR LLDRGDRVVG LDNLNDYYDV NLKLDRLRQL EGREGFSFVR TSLADRPALE DLFAGQRFDV VVNLAAQAGV RYSITNPHAY VDSNLVGFIN ILEGCRHHGV KHLVYASSSS VYGANTAMPF SIHHNVDHPV SLYAATKKAN ELMAHTYSSL YGLPTTGLRF FTVYGPWGRP DMALFLFTKA ILEGRPIDVY NFGKMQRDFT YVDDIVEGVT RVMDRTPEPN PAWSGARPDP GTSYAPYRIY NIGNNNPVEL LAFIEAIEQN LGITAQKNLL PLQAGDVPAT YADVDDLMND VGFKPATPIG EGIERFVEWY RGYYGV
|
| |