Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_3184 |
Symbol | |
ID | 8138536 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 3696282 |
End bp | 3697610 |
Gene Length | 1329 bp |
Protein Length | 442 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 644870789 |
Product | O-antigen polymerase |
Protein accession | YP_003022969 |
Protein GI | 253701780 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3307] Lipid A core - O-antigen ligase and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 111 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTAGGTA TGCTTGCCTT GCTTGCCTGT CTGACTTTAG TTGCCTGGCT TTTCATCCGG GACAACAGGG TTCGCCCGAT GCCGTCCCCA GAGTCGTGGC TCACGCTTGC CTGGTTCTTT ATCGTGGGCA CCAGGCCGCT CTCGGCATGG TTTTCAATCC CGGAAGAGGA TTCCTCCGAC GCCTTCCTGG AGGGAAGCCC GCTGGACCGG TACTCCCTGC TGGTGCTGAT CCTGTGCGGC TGCCTCGTCC TGTTGCACAA GCGGCCGCAG TGGCGGAATG TGCTTCGTTC CAACCTCTGG TTCACCGGCT TCATCGCCTA CTGCGCTATC AGCGTCATCT GGTCCGACTA CCCTTTTGTC AGCTTCAAGA GATGGGTGCG TGAGTTTGGC AACCTGGTGA TGGTCGTACT CATCCTCACT CAGGACGACC CTGCCAAGAC CTGCCGGGCG CTGCTGGCAA GGTTCGCTTA CCTGGTGATT CCGCTTTCCG CAGTTCTCAT AAGTTATTTT CCCTCCCTTG GCACCTACTA CAGCAGCGAC CTTGCCGGCA TCGCCTATTG CGGGGTCGCC ATCCACAAGA ACATGCTAGG CAGCATCATG TTCATCTCGG CAGTTTACCT TGCCTGGGAA CTCATCTACG TGCCGGACGC CAGGGAGACC AAGGCATGGG ACCTGACCCT GCTTGCCGCG CTCTGGTTGA TGGCGGTATG GCTCATGCTG GTGGCAAGCA GCTCCACAGC TCTCATCTGT CTGGCACTGG GGTCAGCGAT GCTGCTTATG TTGAAGCTTT CCTTTGCCAG AAAGCAGGTC CGGCACCTCG GTGTGTACAG TCTGCTCGGG GCCAGCCTGC TTGTGACGCT ATTCTCACTT CAAGGAGCGG TGGAGATGAT CACGGGGGCG GTGGGGCGGG ACCTGACCTT CACCGGGCGC ACCGAGCTCT GGGCTGACGT CCTCAGGGAG CCCATAAACC CGCTAGTTGG CACCGGATAC CAGAGCTTCT GGCTTGGAGC CCGTGCCGAT GATCTGTGGG AGCGCTACCT TTTCCATCCG CGGCAATCCC ACAACGGTTA TCTGGAAACC TACCTGAACG GCGGGCTGCT TGGTCTGTTC CTGCTGCTCG CGGTGATCGC GTCCATCGGG AAACGCCTGA AAGGGGGGCT TCTATCCGGT AACAATTTCG CTGTCCTGCT CTTTTCCTTC TGGGTGGCGG GGCTTTTTTA CAATTTCACC GAAGCACGCT TCGTAGGTCC CAATCTGATT TGGATCATGC TGAGTCTCGC CGCGTTGTAC CAGCCAGAGA AGGGAGAGTC GCTGCAAACG GCGGGGTAG
|
Protein sequence | MLGMLALLAC LTLVAWLFIR DNRVRPMPSP ESWLTLAWFF IVGTRPLSAW FSIPEEDSSD AFLEGSPLDR YSLLVLILCG CLVLLHKRPQ WRNVLRSNLW FTGFIAYCAI SVIWSDYPFV SFKRWVREFG NLVMVVLILT QDDPAKTCRA LLARFAYLVI PLSAVLISYF PSLGTYYSSD LAGIAYCGVA IHKNMLGSIM FISAVYLAWE LIYVPDARET KAWDLTLLAA LWLMAVWLML VASSSTALIC LALGSAMLLM LKLSFARKQV RHLGVYSLLG ASLLVTLFSL QGAVEMITGA VGRDLTFTGR TELWADVLRE PINPLVGTGY QSFWLGARAD DLWERYLFHP RQSHNGYLET YLNGGLLGLF LLLAVIASIG KRLKGGLLSG NNFAVLLFSF WVAGLFYNFT EARFVGPNLI WIMLSLAALY QPEKGESLQT AG
|
| |