Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_3517 |
Symbol | |
ID | 8138889 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 4058415 |
End bp | 4059428 |
Gene Length | 1014 bp |
Protein Length | 337 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 644871136 |
Product | hypothetical protein |
Protein accession | YP_003023296 |
Protein GI | 253702107 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1215] Glycosyltransferases, probably involved in cell wall biogenesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 45 |
Fosmid unclonability p-value | 0.0000347699 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | TTGCGGCCTG GCCCTGCGGC GCTCCCCAAG TTTTCCATCA TCATACCGGT GAAGCCGGGG GGCGAGGTGC GCGCCCTGTC CGGCCTGAAC CAGGCGACCT ACCCGGAAGA GCTGTTCGAG GTGCTGATCG CCTACGGCCG TCAGCCCAGC GTGCAGAGAA ACGTCGCCGC CCGCGACGCG AAGGGAGAGA TCCTTTATTT CCTGGACGAC GACTCCCTGG TCGCCCCGGG GTTTCTCGAG CGCGCGGCGT CACACTACCG GGAGCCGAAG GTCGCGGCGG TTGGCGGGCC TTCTCTTACC CCCGCCAACG ATTCCCCCCT GCAAAGAGCG ATAGGGACCG CCTTCACCTC CCCCGTCGGC GGGGGAGGAG TCCGCAACCG CTACCGCAAA AACGGTAGCG CCCGGTACAG CAACGACAGC GAACTCATCC TGTGCAATTT GAGCTTCAGG CGCGATATCT TCCTGACCCA CGAGGGGTTG GATGAGCGGC TCTATCCCAA CGAAGAGAAC GAGCTGATGG ACCGACTGCA ACAGGAGGGT CACCTGCTGG TGCACGACCC CGAGCTGGCC ATCGTGCGCA GCCAGCGCAA CACCTATCGC GCCTATGTGA GGCAGATGTA CGGCTACGGC CGGGGACGCG GAGAGCAGAC CCTGATATCG GGGCAGTTGA AGCCTGTCTC CCTGGTGCCG TCGCTGTTTC TGATCTACCT CCTGTCGCTC CCATTTCTCG GTGGGGGCGT ATTTTTGCTG CCGCTTCTTT GCTACCTGGC GGTTGTCGCG GCGGCCTCCG TCGCCGGGAG CATTTCCGGT CGCGACCTGG CGCTTTTGCC GAGGCTGTTG CTGGTCTTTC CGACGCTGCA CCTGGTCTAC GGCGCCGGTG TCCTGCGCGG CCTGACGCGC CCCCGTTATC GTGGGGGGAG GCAGACCCAC TGGGAAGTCG AAGTCAGGCG GGTGAAGGCA TTCTCGGAAC CTGTAATTAA CCGTTCAACA ACGGTTCTCA ACGTTGAACG TTGA
|
Protein sequence | MRPGPAALPK FSIIIPVKPG GEVRALSGLN QATYPEELFE VLIAYGRQPS VQRNVAARDA KGEILYFLDD DSLVAPGFLE RAASHYREPK VAAVGGPSLT PANDSPLQRA IGTAFTSPVG GGGVRNRYRK NGSARYSNDS ELILCNLSFR RDIFLTHEGL DERLYPNEEN ELMDRLQQEG HLLVHDPELA IVRSQRNTYR AYVRQMYGYG RGRGEQTLIS GQLKPVSLVP SLFLIYLLSL PFLGGGVFLL PLLCYLAVVA AASVAGSISG RDLALLPRLL LVFPTLHLVY GAGVLRGLTR PRYRGGRQTH WEVEVRRVKA FSEPVINRST TVLNVER
|
| |