Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_3191 |
Symbol | |
ID | 8138543 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 3704054 |
End bp | 3705487 |
Gene Length | 1434 bp |
Protein Length | 477 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 644870796 |
Product | lipopolysaccharide biosynthesis protein |
Protein accession | YP_003022976 |
Protein GI | 253701787 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 104 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTCTGC TAGAGATATT GACACTGCTG TTCAAGCGGA AAAAGGCAAT CATCGGGATT TTCCTGCTGC TTTTTATCTC TGCCGCGGCC TATACGCTGG CCCAGGATCC GACCTATGAA GCCAAAGCAA GCATCCTTGT GAAGATGTTT CGTGAGGACC CTTCCAGGCC TGGGATGGGA GCTGACGCGA ACAACCTGCC TCGTATAGTG AGCCAGGACG AGGTGGTCAA TGCGGAGATC CAGATCCTGA CCGGTCGTGA ACTGGCGGAA AAAGTGATGG GGACGCTGAA GATGGAGCGG ATCTATCCCC ACCTTGCCTC AGGGGAACTG CTGCCGGCCG CCCGTATGGA CCAGGCCGTG CAAACCTTTG CCCAGAGCCT GCAAGTGCAG GGGGTGAGGA AATCCAATGT AATTGCCGTC TCTTTTCAGC ATAAAAACCC CGAAATGGCC GCCAAGGCTG TGAACCTTTT GATCGAGGTC TTCAAGGAGA AGCATCTTGC TGTGCACAGC GACCCGCAAT CATCTTTCAT TGCGAGCCAA TTGGCCTCCT TCGAGGGGAA GCTCAAGGAA TCGGAGAAGC AATTGCAGGA TTACCAGCAA CGTACTGGGG TCTATTCGAT CGACGAGCAA AAAACCCTGC TGTTGAGGCA GCGCACCGAG TTGGATTCAG CCTACAGGCA GGCTGTCACG AACGTTCGGG AAAACCAGGA TAAGATCGCA TCCCTGAAGC TGCAGATGAA ATACATCACC GACAACAAGG ACAGGTACAC CCAGACTGAA AGGGACCGTA TCATCATTGA GGCCAAGTCA AAGTTGTTGG AATTGCAGCT CAAGGAACAA GAGCTCAAGA TGAAGTACAC CGACAAAAAC AAGCTCCTTG CCGACACCAA GAAGGAATTG GAGCTTGTCA GCAAGTTCCT CAAGGAACAG GAAGAAATCA TCATACGGAA GGTGAAGACG GCGAACCCGG TTTACCAGAG CATGGAGACG GATCTTTTCC GCGTGCAGGC TGACCTGAAG TCGCAAACGG CAAGGGCCGA GGCGCTTAAG GCCCAGTTGA GGCAGCTTGA TGCGGAAATA GCTACACTTG ACCGGAGCCA GAACCAGATC CAGGATCTGA AGCGGCAGAT AGCGTTGAAC GAAAAAAATT ACATGACTTA CATGGAGAGG AACGAGGATG CACGCATTTC CGATGCAATG AACCGTCTAA AGTTGTCGAA TATCAGCGTA ATCCAGCAGG CAGTGGCACC GGCAAAGCCG ATCAAGCCCA ATAAATCGTT GTCACTTGCC TTGGGTATGG TCTTCGGGAT GGCCGCGGGG CTCCTGTATG CCTATGCAGC GGAAAGACTC AGCCAGACAT TCACGGATCC CAAAAGTGTG GAAAAGTACC TCGAACTGCC GGTTCTCGTG ACAGTCCCGC TAAAAAAGGA TTAA
|
Protein sequence | MSLLEILTLL FKRKKAIIGI FLLLFISAAA YTLAQDPTYE AKASILVKMF REDPSRPGMG ADANNLPRIV SQDEVVNAEI QILTGRELAE KVMGTLKMER IYPHLASGEL LPAARMDQAV QTFAQSLQVQ GVRKSNVIAV SFQHKNPEMA AKAVNLLIEV FKEKHLAVHS DPQSSFIASQ LASFEGKLKE SEKQLQDYQQ RTGVYSIDEQ KTLLLRQRTE LDSAYRQAVT NVRENQDKIA SLKLQMKYIT DNKDRYTQTE RDRIIIEAKS KLLELQLKEQ ELKMKYTDKN KLLADTKKEL ELVSKFLKEQ EEIIIRKVKT ANPVYQSMET DLFRVQADLK SQTARAEALK AQLRQLDAEI ATLDRSQNQI QDLKRQIALN EKNYMTYMER NEDARISDAM NRLKLSNISV IQQAVAPAKP IKPNKSLSLA LGMVFGMAAG LLYAYAAERL SQTFTDPKSV EKYLELPVLV TVPLKKD
|
| |