Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GSU1970 |
Symbol | |
ID | 2686195 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sulfurreducens PCA |
Kingdom | Bacteria |
Replicon accession | NC_002939 |
Strand | - |
Start bp | 2161156 |
End bp | 2162208 |
Gene Length | 1053 bp |
Protein Length | 350 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637126661 |
Product | polysaccharide biosynthesis protein, putative |
Protein accession | NP_953019 |
Protein GI | 39997068 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG2089] Sialic acid synthase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.237085 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACAGCA TGAAATTAGG AAATGTTCCG GTTGGAGCGG AGAATCCCCC CTATGTGATT GCCGAAATCG GTTCCAACCA TAACGGAGAC ATGAACCTCT GCCGTCAACT GATCGATGCC GCGGCCGAGG CCGGCGCCCA TGCCGTGAAG TTCCAGTCGT GGACGGAAAA CTCGCTCATC GCCCGGGAAG AATATGAGCG AAATACCGAG TATTCCGACA AAAAGCGTCA CTTCGGTTCG CTCAGGGATA TGGTTAAGAC CTATCAGCTC ACGACAGACC AGCATAGGGA GGCCCATGCC TACTGCCGCG AACGGGGGAT AGCCTTTTGC TCCACCCCCT TTTCGCCCGA AGAGGCCGAT CTGCTGGAAT CGCTGGATGT GCCCTTCTTT AAGATCGCAT CTATGGATAT CGTCCATCTG CCGTTTCTGA AGTACGTGGC GCGCAAGCGG CGGCCGATGC TGCTCTCGAC CGGCATGGCG ACCCTGGCGG AAATCGAGGC GGCCGTTGAA ACCGTGCGTG CCGAAGGTAA CGATCAGGTC GTGCTGCTGC ACTGTGTATC AATCTATCCG CCCGACTACG AGATGATCCA TCTGCGCAAC ATGGCCACGC TCCAGCAGGC GTTTGATGTC CCGGTCGGGT TCAGCGACCA CACCATGGGC ACGGCCATCC CCCTCGCGGC CATCGCCTTG GGAGCCTGCG TCATCGAGAA GCACTTCACC CTCGACCAGG ACATGGAGGG ATGGGATCAC GCCATCTCCG CCACGCCGAC CGATTTGCGG ACCATCGTGG AGGAGGGGAG AAACATCCGG ATAGCGCTCG GCGGCGGCAA GCGAATCGTC ACCGAAGCCG AGATGGAAAA GCGCAAGAAA TTCAGGCGGA GCCTGGTGAC GCGCCGCGCC CTTCCTCAGG GGCATGTGCT CGTGGAAGCG GATCTGGACG CCAAGCGTCC CGGTACCGGC ATTTCTCCCG CCGAGATGAC CTATGCCCTT GGCCGCAGAC TGGCCCGGGA CATGCAGGAA GATGACGTTA TCCGCTGGGA GGATCTGGTG TGA
|
Protein sequence | MNSMKLGNVP VGAENPPYVI AEIGSNHNGD MNLCRQLIDA AAEAGAHAVK FQSWTENSLI AREEYERNTE YSDKKRHFGS LRDMVKTYQL TTDQHREAHA YCRERGIAFC STPFSPEEAD LLESLDVPFF KIASMDIVHL PFLKYVARKR RPMLLSTGMA TLAEIEAAVE TVRAEGNDQV VLLHCVSIYP PDYEMIHLRN MATLQQAFDV PVGFSDHTMG TAIPLAAIAL GACVIEKHFT LDQDMEGWDH AISATPTDLR TIVEEGRNIR IALGGGKRIV TEAEMEKRKK FRRSLVTRRA LPQGHVLVEA DLDAKRPGTG ISPAEMTYAL GRRLARDMQE DDVIRWEDLV
|
| |