Gene GSU1970 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU1970 
Symbol 
ID2686195 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp2161156 
End bp2162208 
Gene Length1053 bp 
Protein Length350 aa 
Translation table11 
GC content60% 
IMG OID637126661 
Productpolysaccharide biosynthesis protein, putative 
Protein accessionNP_953019 
Protein GI39997068 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2089] Sialic acid synthase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.237085 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACAGCA TGAAATTAGG AAATGTTCCG GTTGGAGCGG AGAATCCCCC CTATGTGATT 
GCCGAAATCG GTTCCAACCA TAACGGAGAC ATGAACCTCT GCCGTCAACT GATCGATGCC
GCGGCCGAGG CCGGCGCCCA TGCCGTGAAG TTCCAGTCGT GGACGGAAAA CTCGCTCATC
GCCCGGGAAG AATATGAGCG AAATACCGAG TATTCCGACA AAAAGCGTCA CTTCGGTTCG
CTCAGGGATA TGGTTAAGAC CTATCAGCTC ACGACAGACC AGCATAGGGA GGCCCATGCC
TACTGCCGCG AACGGGGGAT AGCCTTTTGC TCCACCCCCT TTTCGCCCGA AGAGGCCGAT
CTGCTGGAAT CGCTGGATGT GCCCTTCTTT AAGATCGCAT CTATGGATAT CGTCCATCTG
CCGTTTCTGA AGTACGTGGC GCGCAAGCGG CGGCCGATGC TGCTCTCGAC CGGCATGGCG
ACCCTGGCGG AAATCGAGGC GGCCGTTGAA ACCGTGCGTG CCGAAGGTAA CGATCAGGTC
GTGCTGCTGC ACTGTGTATC AATCTATCCG CCCGACTACG AGATGATCCA TCTGCGCAAC
ATGGCCACGC TCCAGCAGGC GTTTGATGTC CCGGTCGGGT TCAGCGACCA CACCATGGGC
ACGGCCATCC CCCTCGCGGC CATCGCCTTG GGAGCCTGCG TCATCGAGAA GCACTTCACC
CTCGACCAGG ACATGGAGGG ATGGGATCAC GCCATCTCCG CCACGCCGAC CGATTTGCGG
ACCATCGTGG AGGAGGGGAG AAACATCCGG ATAGCGCTCG GCGGCGGCAA GCGAATCGTC
ACCGAAGCCG AGATGGAAAA GCGCAAGAAA TTCAGGCGGA GCCTGGTGAC GCGCCGCGCC
CTTCCTCAGG GGCATGTGCT CGTGGAAGCG GATCTGGACG CCAAGCGTCC CGGTACCGGC
ATTTCTCCCG CCGAGATGAC CTATGCCCTT GGCCGCAGAC TGGCCCGGGA CATGCAGGAA
GATGACGTTA TCCGCTGGGA GGATCTGGTG TGA
 
Protein sequence
MNSMKLGNVP VGAENPPYVI AEIGSNHNGD MNLCRQLIDA AAEAGAHAVK FQSWTENSLI 
AREEYERNTE YSDKKRHFGS LRDMVKTYQL TTDQHREAHA YCRERGIAFC STPFSPEEAD
LLESLDVPFF KIASMDIVHL PFLKYVARKR RPMLLSTGMA TLAEIEAAVE TVRAEGNDQV
VLLHCVSIYP PDYEMIHLRN MATLQQAFDV PVGFSDHTMG TAIPLAAIAL GACVIEKHFT
LDQDMEGWDH AISATPTDLR TIVEEGRNIR IALGGGKRIV TEAEMEKRKK FRRSLVTRRA
LPQGHVLVEA DLDAKRPGTG ISPAEMTYAL GRRLARDMQE DDVIRWEDLV