Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_3813 |
Symbol | |
ID | 8139187 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 4383800 |
End bp | 4384720 |
Gene Length | 921 bp |
Protein Length | 306 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 644871432 |
Product | Capsular polysaccharide biosynthesis protein-like protein |
Protein accession | YP_003023590 |
Protein GI | 253702401 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG4421] Capsular polysaccharide biosynthesis protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 102 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTACCAGG TGATGAACGG CCGCAGCTTC AGCGAACCCA ATCCCTTGTT CGTGAAGGAC GTGTTCCTTT TCGGGCCGGA GGGGATACCC CTGCAGGGGA AATGCGTGGT GAACCGTTTC GCCAACTTCG CGGACGACCA GCTCCTGATC TGGCTCTTGC ACCACGAAAA CCGCGAACTG GAAGGGCTCG ACGGGGTGGT CATGCCGCTT CACGGCGACT GGAGCTGCAA CTTCTGGCAC TGGTGCAACG AGACGCTCCC CATGGCGCTT GCGGCCCACC AGGGGGGCTT CAACGGCAGG TACCTGGTCC CACCGGTCCC CTTTGCGGCG GAATCCCTTC GGCTTATCGG CGTGCGGCCG GACCGCATCG TCGTGGCGAA AGAGTGCGAC TACCACCTGG AGTGCATGTG CCTGGTCCCG AAGCTTTTGG GGGGCAACCC CGCTGGCATG CCCGCAAGAC TCAAGGTGCG CGAGCTTTTC AGGTCCCTCT TCGCACAGGG GGCGCCGCGG CACCGGCTCT ACTTAAGCCG CAACGGCCAC CCCGACAACC TGCGCAAGGT GCTGCACGAG GATGCACTGC TCGCCATGCT TGGCAAATAC GGCTTCACCA TGCTGCGCCT GGAGGAAATG CCGCTCGCGG AGCAGCTCGC CTACAGCTGC AACGCCGAGG CTATCGTCGC ACCCCATGGT GCCGGCGTCA CCCACTGCGC TTTCATGCCG GAAAAGTCGC TGGTGATCGA GTTCTTCGCA CCGACCTATA TCAACCCCTG CATGCTCCTT CATTGCGACT GCCTGAAGCA CCGGTACTAC CAGGTCACCA GCCCTTGCCT GTACGCAGGG TACCGGCACG ACATGGACAT CGAGGTGCAC GGCCAGATAC TGGGAATGAC TCTGGAAAGG GAGCTCACGC AGAGGCAGTA G
|
Protein sequence | MYQVMNGRSF SEPNPLFVKD VFLFGPEGIP LQGKCVVNRF ANFADDQLLI WLLHHENREL EGLDGVVMPL HGDWSCNFWH WCNETLPMAL AAHQGGFNGR YLVPPVPFAA ESLRLIGVRP DRIVVAKECD YHLECMCLVP KLLGGNPAGM PARLKVRELF RSLFAQGAPR HRLYLSRNGH PDNLRKVLHE DALLAMLGKY GFTMLRLEEM PLAEQLAYSC NAEAIVAPHG AGVTHCAFMP EKSLVIEFFA PTYINPCMLL HCDCLKHRYY QVTSPCLYAG YRHDMDIEVH GQILGMTLER ELTQRQ
|
| |