Gene GSU1959 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU1959 
Symbol 
ID2688250 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp2148200 
End bp2149210 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content57% 
IMG OID637126650 
Producthypothetical protein 
Protein accessionNP_953008 
Protein GI39997057 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG5653] Protein involved in cellulose biosynthesis (CelD) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTGGGCGC ACTGTAGCGC CATTGCCCGG ACCATGGTCT ACCCCAGCTT CTTTTGCAGC 
GGTGACTGGT TAAAGGTTGC GGCGGAGAAT CTGTGCCCGG GGGACGAAGC GTATATCCTT
CTGGCCAAAG ATGCCGACGA CATCCGGGGG GTGCTCCCCC TGGTCAGAAA GCGCAACGCC
CTCGGAGGTA CCGACCTGCA CTACCTGGGC TCGGATTTTT ATCCCGATCC GCTTGGCCTG
ATCTGTTCGC CTGCCGATCG GGCCGATTGT GCCGCTGCCC TGAGAAACCA TCTACTCAAT
GCCCCCGATT GGGACCGATT GATACTGGAC TTTCTGCTGG AGGACGAACC GGCTGATTGG
ACTCTGCCCG GCAAGCCGGT TTCAGTGGCG CCGTTTAAAG TGCTGCCACG GGACTTTTCC
GAGCTGCTGG GAGAGTTCAA GAAGAAAAAG CGCTACAACC TGAGAGCGAT GGTGAAAAAG
CTCCTTGATG CCGGTGGAGA GCTTGCTGCT TCTTCGGGGC CGGAATCGAA CATAGCGTAT
CTGGACGCCT TGTTTTTCCT GCATGAGAAA AGGGCGAGCG AAAGGTCGCT GGACAGCAGC
TTCACTGGAC CGAGGGTGCA ATCGCTCCAC CGAGCCCTTG TTGCTGCGTC GGATAATGTG
AGGTTTTTCG GACTCAGGCT CAATGGGCAG ATGATAGCTG TGATCTACGG CTTCGAGTTC
TGCAATCGCT TTTTCTACTA CCAGGTGGCC CACGACCCGG ACCATGGCCA CCTCAGCCCG
GGGACGGTGC TGCTCTATCT CGTGATTGAA GCCTGTTGCG TCAACGGACT GACGGAATTC
AACTTCCTCC AGGGAGATGA AACCTATAAA GGCATCTGGA CCAACGATTC GAGGATACTC
TATCGCTGCG TACTCAATCG AAGCACGTGG CGCTCCCGTG TGTTCAGCGC CGTGGAGGAA
TCCAGAGGTT ACGTCAAGCG GGCAATGGGG TTGATGTCTC GTGGGAATTA A
 
Protein sequence
MWAHCSAIAR TMVYPSFFCS GDWLKVAAEN LCPGDEAYIL LAKDADDIRG VLPLVRKRNA 
LGGTDLHYLG SDFYPDPLGL ICSPADRADC AAALRNHLLN APDWDRLILD FLLEDEPADW
TLPGKPVSVA PFKVLPRDFS ELLGEFKKKK RYNLRAMVKK LLDAGGELAA SSGPESNIAY
LDALFFLHEK RASERSLDSS FTGPRVQSLH RALVAASDNV RFFGLRLNGQ MIAVIYGFEF
CNRFFYYQVA HDPDHGHLSP GTVLLYLVIE ACCVNGLTEF NFLQGDETYK GIWTNDSRIL
YRCVLNRSTW RSRVFSAVEE SRGYVKRAMG LMSRGN