Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_3505 |
Symbol | |
ID | 8138877 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 4046018 |
End bp | 4047001 |
Gene Length | 984 bp |
Protein Length | 327 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 644871124 |
Product | glycosyl transferase family 2 |
Protein accession | YP_003023284 |
Protein GI | 253702095 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0463] Glycosyltransferases involved in cell wall biogenesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 5.6726e-25 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGGAAAGT CAGAGAGGCA GGAGTCTGTA AAGTTATCCG TTCTGGTGCC AGTGTACAAT TGGGATGTCT CCTTATTAGT GCGCAAACTT ATTGAAGAAG CGGACTCCTC AGTACTGTGG GCAAAGATTG AGATCATAGT TATCGATGAT CACTCGACAG ATCCGGTTAC GACAGATACT AGCAAGATCT TAGAATGTGA GAACCAAAGG TCCGGATTCC ATTACTCCCG TCTGCCTCAA AACGTCGGTA GATCAGCTGT ACGAAACTTG CTGGCAGCGA AGGCGAAAGG CGAATTCCTC CTGTTTCTGG ATTGCGATGT TGCTCCGGAT TCTAAACATT TTCTTGCGTC GTATCTTGAG TTTGCTGAGA AGGGCAGCCA CGATGTAATC TGTGGCGGTA GAAGCTATAA CTTACGAGTG ATGACGGATG AAGAGTATGA CTACTACGTC TACTTTGGGA ATGTAAAAGA GGTTAAATCA GCAGCCGAGA GAAATATTAT GCCATGGAGG CACCTTCTGA CTTCGAATGT GATGGTGCGT AAGAAGGCTT TAGAAGAAAC CCCCTTCAAC GAAAACTTTG TGGGCTATGG GTATGAAGAC ATTGAGTGGG GCGTCCGCCT GGCACAGGCA TACAGCATTC TGCACATCGA TAACACTGCC TCTCATCTTG GTTTAGTCAC CAAACAAAAA GCCTACGAAA AAATGCGTGA GTCTGTGTCC AACTACCTGC TCCTTAGGGA CCTTTACCCA CTCGCATTTA ATGTCTCTGC CATAAGCAAA GTGGTACGCC TGCTGGAGTC TGTTCCTGCG CCGCTCCTGG GTGTGATGGA CCGGCTTCTG AAAAACATGT TCCTGTCCAG CGGCAGCAAC CGCCTCGCTT TTCTCTTTTT TCAGCTCGAT TTCGCGGTGC TTCTGGCGTG CACCCTAAGG GCGCGGCAGC GTGACCTGCT GGCCCCAAAG CCGGCGCAAG GGGGGAAGCG TTGA
|
Protein sequence | MGKSERQESV KLSVLVPVYN WDVSLLVRKL IEEADSSVLW AKIEIIVIDD HSTDPVTTDT SKILECENQR SGFHYSRLPQ NVGRSAVRNL LAAKAKGEFL LFLDCDVAPD SKHFLASYLE FAEKGSHDVI CGGRSYNLRV MTDEEYDYYV YFGNVKEVKS AAERNIMPWR HLLTSNVMVR KKALEETPFN ENFVGYGYED IEWGVRLAQA YSILHIDNTA SHLGLVTKQK AYEKMRESVS NYLLLRDLYP LAFNVSAISK VVRLLESVPA PLLGVMDRLL KNMFLSSGSN RLAFLFFQLD FAVLLACTLR ARQRDLLAPK PAQGGKR
|
| |