Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GSU1855 |
Symbol | |
ID | 2688583 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sulfurreducens PCA |
Kingdom | Bacteria |
Replicon accession | NC_002939 |
Strand | - |
Start bp | 2027271 |
End bp | 2028491 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 637126545 |
Product | capsule polysaccharide export protein, putative |
Protein accession | NP_952905 |
Protein GI | 39996954 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3524] Capsule polysaccharide export protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0722635 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTAATC TACAAAATGA ATGTTGTAAC GCTAACGATA GCAGCAACAA TTTTGAAGCT GAGAGAATTT GCATTCTAGA ATACCTAGAG GTTGTTTTAT GTGGCTGGAA GCTGATAACA ATATTGACCG TTTTGGCATT TGTTTTGTCA TTGGTTATTA GTATTAGTTT GCCAAATGTC TACAAAGCAA CAACGCGCAT TCTTCCGCCA CAGCAAGATT CTGGATTGCT AGGTCTTATG TTAGGGCAAG CAGGTTCTTT TGGTGGTGTA GGACCATTGG CGACAGATCT GCTGGGTAAA GCGAATCCTG CAGATATGTA CGTTAGTATT ATGACAAGTG AGGCTATAAG TGATAAGATT ATTGATAAAT TCAATTTAAT GAAGGTATAC AACGAGAAAT ACCGCATTGA TGCATACAAA GTACTTGATA ATAATATTGA TATTCTTGCT GGTAAGAAGG ATGGTATAAT TACAATATCT GTTGAGGACG AAGATCCGCA GCGAGCTGCC AATATTGCTA ATGCTTATGT AAATGAACTG AGTAATCTTT TAGTTCAACT AAATTCGAAG GAGTCAGCGC AAAATAGAGT ATTTTACGAA GAGAGACTTG CTAAGGCAAA AGTTGATTTA GCTCGATCGG AAGACAATCT AAAGCAATTT CAGACAAAAT ATAAAGTCGT GTCAGTAGCT GATCAGGCAC AAGCAGCAAT TTCTAATATT GCTCAACTGA ATGCTCAGTT GATCGCCCAA GAAATTCAAT TATCAAATCT TAGAAAACAA TTTACTGAAG ATAGTCTAGA GGTAAAAAAC GTTAAATCAA CCATTAATGG ACTGAGGAAT CAAATATCTC GAATTGAAAG TCGTGAAGCC GGAGGTGCTA TCCCGAATAT GGGGTTGATT CCTGAGTTAG GACAACAGTA TTTTCGCCTT ATGCGAGAAT TTAAAGTCCA AGAAGCGATT GTAGATTTAC TTACAAAACA ATTTGAGTCA GCAAAACTCG GCGAACTTAA GGATTCATAT TATGTACAAA TAATCCAAAC TGCACGGGTT CCTGACAAAA AATCTAAGCC AAAACGTAGT ATGATAGTTG CTATGTCCAC AGTTATAACA TTTGGCTTTG GTATCCTATC AGTAATAATT ATGCATATGC TATCGAAACT ACCAAGTAAC GAGCTTGTTG TATGGAACAG GATAAAGTCA AGAGTAGTTA AAAGAATATA A
|
Protein sequence | MANLQNECCN ANDSSNNFEA ERICILEYLE VVLCGWKLIT ILTVLAFVLS LVISISLPNV YKATTRILPP QQDSGLLGLM LGQAGSFGGV GPLATDLLGK ANPADMYVSI MTSEAISDKI IDKFNLMKVY NEKYRIDAYK VLDNNIDILA GKKDGIITIS VEDEDPQRAA NIANAYVNEL SNLLVQLNSK ESAQNRVFYE ERLAKAKVDL ARSEDNLKQF QTKYKVVSVA DQAQAAISNI AQLNAQLIAQ EIQLSNLRKQ FTEDSLEVKN VKSTINGLRN QISRIESREA GGAIPNMGLI PELGQQYFRL MREFKVQEAI VDLLTKQFES AKLGELKDSY YVQIIQTARV PDKKSKPKRS MIVAMSTVIT FGFGILSVII MHMLSKLPSN ELVVWNRIKS RVVKRI
|
| |