Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_0449 |
Symbol | |
ID | 8135758 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 541831 |
End bp | 542949 |
Gene Length | 1119 bp |
Protein Length | 372 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 644868067 |
Product | putative capsular polysaccharide biosynthesis protein |
Protein accession | YP_003020287 |
Protein GI | 253699098 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG4421] Capsular polysaccharide biosynthesis protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 104 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGCACG AGCACGAAAA GGCAGTCAAA GAGAAAAGAC GTCAAGCGCA GAAGCTATCT CTGGAGCTGC TAGGTCCCAT CGTCGGCAAG ATGCCGAAGT CGAGCATCGA GTCGCTGGAG AAACTCTCGT ACCTTCCGAA GATCCAACGG CATATCGCCG AAATCATCCA AGTCGCCGAT TCCTGCATAG TAAAGCACGA CCTGAAGGGC TCCTTCCCTC CATTCATCAG ATCACAGGCG GCCTTCGGCA AACGTAACCT CTACCTCTTG AAAGACGCCG TAGTCTCCCC CGATTCGGGC ATGGTCTGGA TCGGGAACAG GATTCTCGAG GAGAGCGTGG GGTCCCTGCG CCGCATCATG GACTGGGGGG ACATGCTGCA CGAACCGCTT CTTCCGGTGA GCGAGCTCAA GTCGCAGGAA CCTGTCGTCG TCTGCCATCC CGCCGCAGGG TACTACCATT GGCTCCTGGA GGTCCTGCCC AACCTTCTTT TCGCCATCTC CAGGTTCCCC CAGGTCAAGA TCGTCCTTCC CGAGAACTCC CCTGCCTACG TCTTCGACGG GTTGTCCATG ATCCTCGGCC CCGAGGGAGC CGACCGGTTC ATCTTCTGCT CCACTCCGCT GAAGGTAAGG AGCCTGGTGA TGCCGCAGTA CCACACCGCA CCCGAATTCA CCAGTCCACA GGCCATTGAC CTCCTCAAGT CGCAGGTGAA GCCGAAGGTC GTTGCCCGTG AGACCTCCGG CGCGCCCGCA TCCGGAACCA AGCTCTACAT CTCGCGGCGC AAAAGCAGGA GGCGCCGGTT GTTAGGCGAG GAGGAGCTTG AGAGGAAGCT CCAGGAAAAG GATTTCAGCA TCCTGCACCT GGAGGACTTT TCTTTCCAGG AGCAGATCCG CATCTTCCAC CACGCCGAGA CGGTAGTGTC GACCCATGGC GCCGGCCTGA GCAATCTCGT CTGGTGCGAG CCCCCGTGTA AGGTGATCGA GATATTTCCC AGGAATTACA TCCTCGACTG TTTCGCCTGG CTCAGCTTCA GCCAAGGTTT CGACTATCGC TACGTCATCT GCAGCACCGG GCACAAGATA GACGACGAAG CCATGGCCGG CGTGCTGGGG CAGCTGTAA
|
Protein sequence | MKHEHEKAVK EKRRQAQKLS LELLGPIVGK MPKSSIESLE KLSYLPKIQR HIAEIIQVAD SCIVKHDLKG SFPPFIRSQA AFGKRNLYLL KDAVVSPDSG MVWIGNRILE ESVGSLRRIM DWGDMLHEPL LPVSELKSQE PVVVCHPAAG YYHWLLEVLP NLLFAISRFP QVKIVLPENS PAYVFDGLSM ILGPEGADRF IFCSTPLKVR SLVMPQYHTA PEFTSPQAID LLKSQVKPKV VARETSGAPA SGTKLYISRR KSRRRRLLGE EELERKLQEK DFSILHLEDF SFQEQIRIFH HAETVVSTHG AGLSNLVWCE PPCKVIEIFP RNYILDCFAW LSFSQGFDYR YVICSTGHKI DDEAMAGVLG QL
|
| |