Gene GSU2241 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU2241 
Symbol 
ID2687517 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp2455686 
End bp2456696 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content60% 
IMG OID637126934 
Productcapsular polysaccharide biosynthesis protein I 
Protein accessionNP_953290 
Protein GI39997339 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTTCGA TACTCGTCAC CGGAGCAGCC GGGTTCATCG GTTTTCATCT TACGAAACGC 
CTTCTTGACC GGGGCGATCG CGTGGTGGGG CTCGACAACC TCAACGACTA TTATGACGTG
AACCTGAAGC TGGACCGACT CCGCCAGTTG GAGGGGCGCG AGGGATTCAG CTTTGTGCGG
ACCAGTCTGG CAGACCGGCC GGCCCTGGAG GATCTCTTTG CCGGCCAGCG TTTCGATGTG
GTGGTGAACC TGGCTGCCCA GGCCGGGGTC CGCTACTCCA TCACCAACCC CCACGCTTAC
GTGGACAGTA ATCTGGTCGG CTTCATCAAC ATTCTGGAGG GGTGCCGGCA TCACGGGGTG
AAGCACTTGG TCTACGCATC GTCCAGCTCC GTCTATGGTG CCAATACGGC AATGCCGTTT
TCGATCCACC ACAACGTGGA TCATCCGGTT TCCCTGTATG CCGCCACCAA GAAGGCCAAC
GAGCTCATGG CCCACACCTA TTCGAGCCTC TACGGGCTGC CCACCACGGG CCTGCGCTTC
TTCACGGTCT ACGGCCCCTG GGGGCGCCCC GACATGGCGC TCTTCCTCTT TACCAAGGCA
ATCCTCGAAG GCCGGCCCAT CGATGTCTAT AATTTTGGCA AAATGCAGCG TGATTTCACT
TATGTAGACG ACATTGTCGA GGGGGTGACG CGGGTCATGG ACCGCACGCC GGAGCCCAAC
CCTGCCTGGA GCGGGGCCCG ACCCGATCCC GGCACGAGCT ACGCTCCCTA TCGCATCTAC
AACATCGGCA ACAACAACCC GGTCGAGCTT CTCGCGTTCA TTGAAGCCAT CGAACAGAAC
CTGGGGATCA CTGCGCAGAA GAATCTGCTT CCCCTGCAGG CGGGTGACGT GCCCGCCACC
TACGCCGACG TGGATGACCT GATGAACGAC GTGGGGTTCA AGCCGGCCAC TCCCATCGGG
GAGGGGATAG AGCGGTTCGT CGAGTGGTAC CGGGGATACT ACGGCGTCTG A
 
Protein sequence
MSSILVTGAA GFIGFHLTKR LLDRGDRVVG LDNLNDYYDV NLKLDRLRQL EGREGFSFVR 
TSLADRPALE DLFAGQRFDV VVNLAAQAGV RYSITNPHAY VDSNLVGFIN ILEGCRHHGV
KHLVYASSSS VYGANTAMPF SIHHNVDHPV SLYAATKKAN ELMAHTYSSL YGLPTTGLRF
FTVYGPWGRP DMALFLFTKA ILEGRPIDVY NFGKMQRDFT YVDDIVEGVT RVMDRTPEPN
PAWSGARPDP GTSYAPYRIY NIGNNNPVEL LAFIEAIEQN LGITAQKNLL PLQAGDVPAT
YADVDDLMND VGFKPATPIG EGIERFVEWY RGYYGV