Gene GSU1855 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU1855 
Symbol 
ID2688583 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp2027271 
End bp2028491 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content37% 
IMG OID637126545 
Productcapsule polysaccharide export protein, putative 
Protein accessionNP_952905 
Protein GI39996954 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3524] Capsule polysaccharide export protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0722635 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTAATC TACAAAATGA ATGTTGTAAC GCTAACGATA GCAGCAACAA TTTTGAAGCT 
GAGAGAATTT GCATTCTAGA ATACCTAGAG GTTGTTTTAT GTGGCTGGAA GCTGATAACA
ATATTGACCG TTTTGGCATT TGTTTTGTCA TTGGTTATTA GTATTAGTTT GCCAAATGTC
TACAAAGCAA CAACGCGCAT TCTTCCGCCA CAGCAAGATT CTGGATTGCT AGGTCTTATG
TTAGGGCAAG CAGGTTCTTT TGGTGGTGTA GGACCATTGG CGACAGATCT GCTGGGTAAA
GCGAATCCTG CAGATATGTA CGTTAGTATT ATGACAAGTG AGGCTATAAG TGATAAGATT
ATTGATAAAT TCAATTTAAT GAAGGTATAC AACGAGAAAT ACCGCATTGA TGCATACAAA
GTACTTGATA ATAATATTGA TATTCTTGCT GGTAAGAAGG ATGGTATAAT TACAATATCT
GTTGAGGACG AAGATCCGCA GCGAGCTGCC AATATTGCTA ATGCTTATGT AAATGAACTG
AGTAATCTTT TAGTTCAACT AAATTCGAAG GAGTCAGCGC AAAATAGAGT ATTTTACGAA
GAGAGACTTG CTAAGGCAAA AGTTGATTTA GCTCGATCGG AAGACAATCT AAAGCAATTT
CAGACAAAAT ATAAAGTCGT GTCAGTAGCT GATCAGGCAC AAGCAGCAAT TTCTAATATT
GCTCAACTGA ATGCTCAGTT GATCGCCCAA GAAATTCAAT TATCAAATCT TAGAAAACAA
TTTACTGAAG ATAGTCTAGA GGTAAAAAAC GTTAAATCAA CCATTAATGG ACTGAGGAAT
CAAATATCTC GAATTGAAAG TCGTGAAGCC GGAGGTGCTA TCCCGAATAT GGGGTTGATT
CCTGAGTTAG GACAACAGTA TTTTCGCCTT ATGCGAGAAT TTAAAGTCCA AGAAGCGATT
GTAGATTTAC TTACAAAACA ATTTGAGTCA GCAAAACTCG GCGAACTTAA GGATTCATAT
TATGTACAAA TAATCCAAAC TGCACGGGTT CCTGACAAAA AATCTAAGCC AAAACGTAGT
ATGATAGTTG CTATGTCCAC AGTTATAACA TTTGGCTTTG GTATCCTATC AGTAATAATT
ATGCATATGC TATCGAAACT ACCAAGTAAC GAGCTTGTTG TATGGAACAG GATAAAGTCA
AGAGTAGTTA AAAGAATATA A
 
Protein sequence
MANLQNECCN ANDSSNNFEA ERICILEYLE VVLCGWKLIT ILTVLAFVLS LVISISLPNV 
YKATTRILPP QQDSGLLGLM LGQAGSFGGV GPLATDLLGK ANPADMYVSI MTSEAISDKI
IDKFNLMKVY NEKYRIDAYK VLDNNIDILA GKKDGIITIS VEDEDPQRAA NIANAYVNEL
SNLLVQLNSK ESAQNRVFYE ERLAKAKVDL ARSEDNLKQF QTKYKVVSVA DQAQAAISNI
AQLNAQLIAQ EIQLSNLRKQ FTEDSLEVKN VKSTINGLRN QISRIESREA GGAIPNMGLI
PELGQQYFRL MREFKVQEAI VDLLTKQFES AKLGELKDSY YVQIIQTARV PDKKSKPKRS
MIVAMSTVIT FGFGILSVII MHMLSKLPSN ELVVWNRIKS RVVKRI