Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_2231 |
Symbol | |
ID | 8137570 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 2602328 |
End bp | 2603443 |
Gene Length | 1116 bp |
Protein Length | 371 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 644869846 |
Product | glycosyl transferase family 9 |
Protein accession | YP_003022038 |
Protein GI | 253700849 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0859] ADP-heptose:LPS heptosyltransferase |
TIGRFAM ID | [TIGR02195] lipopolysaccharide heptosyltransferase II |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 55 |
Fosmid unclonability p-value | 0.00858313 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGAGGGCGG TGCCCCAGCC AAGGTCTTGC CTTCGTGGCG TCTCCCTCAC CCTGTCCCTC CCCCAGAGGG GGAGGGGACC CGAAGTTGCC CGGAATTGCT CGAATAACCT TCCCAAAAGG GGGGTGCCGG ATTCCTCGTT TCCCCCCCGA CGTTTCCTGG TCATCCGTCC CGGCGGCATC GGCGACGCGG TCCTCCTGGT CCCGGCGTTG ACGGCGCTGC AAAAAGCTTT TCCCGGCTGC CGCATAGACG TTCTCGCCGA AAGCCGCAAC GCCGCCGCCT TTCTCATGTG CCCCGGGCTG AACTGGGTGT ACCGCTACGA TTGTCTTTCC GACATGGCGG CTTTGCTCCG CACCCCTTTC GACGTGGTGA TCGACACCGA GCAGTGGTAC CGCCTCTCCG CGGTCATCGC CAGGGTGGTC CGCGCCCGGC GCTCCATCGG TTTTTGCAGC AACGAAAGGG GGAGGCTCTT CACCGACCCC GTGCCTTACC CCTTGCAGGA TTACGAACTC CTCTCCTTCT TCAAGCTCCT AGCCCCGCTC AAGGTGCAGC CTCCCCCGGA ACTGCCGGCT CCCTTTCTTG AACTCCCCGC CGGGGCGAAG GAAGGGGCGC GGCGACTTTT GGCCCCGCTG GCCGGACAGA AATTCGTCGC CATCTTCCCC GGAGCGAGCG TTGCCGAGAA ACAATGGGGG AGGGAGAACT TCCGGCAGGT GGCGGAGAGC CTTTTCGCGG CGGGGATCGC GGTGGTTGTA GTCGGCGCAG ACGACGCCCG CGCCTCGGGC GACTTCATTG CCCGCGGCGG TCTTGCCCTG AACCTGGCGG GGAAGGGGGG GCTCATGGAA AGCGCCGCCG TCCTCGCAAA GGCGCGAGTC CTATTAAGCG GCGACTCGGG GCTGTTGCAC ATCGCCGCGG GGCTTGGCAC CGCGACCGTT TCGCTTTTCG GTGCCAGCGA CGCAGCCAAG TGGGCTCCCA AGGGCGAACG GCACGCCGTA TTCAGTTCGT CGCTTTCCTG CGCCCCCTGC TCCAGTTACG GAACCATCCG CTGCAGCGCG GGCGCCCGCT GTCTCGATGC CGCGCCGTCA GAAGTGACCG CCGCGCTTTT GAGGCTGTGG GAATAG
|
Protein sequence | MRAVPQPRSC LRGVSLTLSL PQRGRGPEVA RNCSNNLPKR GVPDSSFPPR RFLVIRPGGI GDAVLLVPAL TALQKAFPGC RIDVLAESRN AAAFLMCPGL NWVYRYDCLS DMAALLRTPF DVVIDTEQWY RLSAVIARVV RARRSIGFCS NERGRLFTDP VPYPLQDYEL LSFFKLLAPL KVQPPPELPA PFLELPAGAK EGARRLLAPL AGQKFVAIFP GASVAEKQWG RENFRQVAES LFAAGIAVVV VGADDARASG DFIARGGLAL NLAGKGGLME SAAVLAKARV LLSGDSGLLH IAAGLGTATV SLFGASDAAK WAPKGERHAV FSSSLSCAPC SSYGTIRCSA GARCLDAAPS EVTAALLRLW E
|
| |