Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_2054 |
Symbol | |
ID | 8137390 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 2380013 |
End bp | 2381173 |
Gene Length | 1161 bp |
Protein Length | 386 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 644869669 |
Product | protein of unknown function DUF214 |
Protein accession | YP_003021864 |
Protein GI | 253700675 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG4591] ABC-type transport system, involved in lipoprotein release, permease component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 4.01978e-17 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAACTGC ACACGATATC CATCAACAAT CTGAAGCGCC GCAAGGCCAA GATGGCTTTC CTCACCATCG GCCTCATGGT CGGGATCGCC ACCATCGTCA CCCTGGTGAC CCTCACCAAC TCCATGTCCA CCGATATCGA AAGAAAAATG GAGGAGTTCG GCGCCAACAT CCTGGTCACC CCCCAGAGTA ACGGCCTCGC CATGAACTAC GGCGGCATAA GCCTGGGCGG GATCACCTTC GACCAGCGCG AGATCAAGGA AGAAGACCTG GCCCAGATCC GCAAAATAAA GAACCAAAAG AACATCGCGG TCATCTCGCC CAAGGTGCTG GGCGGGATCA AGGTCGGCAG CCAGGACGTG CTGCTGGTCG GCGTCGACTT CGCCAGCGAA CTGAAGATGA AGCAGTGGTG GCAGATCTTC GGCGACGCCC CGAAGGGAGA CAACGAGCTC CTTTTGGGAA GCGACGCCTC CAACGTCCTC GATGCCGGCT CCGGCGACAG CATCCAGATC AAGGGGGAGA CCTTCAAGGT CGCCGGCGTC CTGAACCAGA CCGGCTCGCA GGACGACTCG CTGGTCTTCG CCTCGCTCCC CAAGGCCCAA AAGCTCCTGG GCAAGGAAGG GAAGATAACC ATGGCCGAGG TCGCCGCACA CTGCTCGGGC TGCCCCATAG GGGACATGGT GACCCAGATC GCCGAGAAGC TCCCCGACAC CAAGGTCTCC GCCATCCAGC AGGTGGTCGA GGGGCGGCTG AAGGCGCTGG ACCACTTCAA GCGCTTCTCC TACGCCATGG CTGCGGTCGT CGTCTTCATC GGCTCGCTCA TCGTCTTCGT CACCATGATG GGTAGCGTCA ACGAGCGCAC CACCGAGATC GGCGTGTTCC GCGCCATCGG TTTCCGCAAA AGCCACATCA TGCGCATCAT CCTCCTGGAA GCCGCGCTGG TGAGCCTCCT GGCTGGGCTT TTGGGTTACG CCGCCGGGAT GGGCGGGGCC AAGCTGGCGC TTCCCTTCAT GGCCGAAACG AAAAACGCGC ATCTGGTCTG GGACAGCACC GTCGCCTTTG GTTCGGTGGG ACTTGCCGTA CTGCTCGGCC TTCTGGCGAG CCTTTACCCC GCGCTTCACG CCAGCAAGAT GGATCCGACC GAGGCCCTCA GGGCTCTTTA A
|
Protein sequence | MKLHTISINN LKRRKAKMAF LTIGLMVGIA TIVTLVTLTN SMSTDIERKM EEFGANILVT PQSNGLAMNY GGISLGGITF DQREIKEEDL AQIRKIKNQK NIAVISPKVL GGIKVGSQDV LLVGVDFASE LKMKQWWQIF GDAPKGDNEL LLGSDASNVL DAGSGDSIQI KGETFKVAGV LNQTGSQDDS LVFASLPKAQ KLLGKEGKIT MAEVAAHCSG CPIGDMVTQI AEKLPDTKVS AIQQVVEGRL KALDHFKRFS YAMAAVVVFI GSLIVFVTMM GSVNERTTEI GVFRAIGFRK SHIMRIILLE AALVSLLAGL LGYAAGMGGA KLALPFMAET KNAHLVWDST VAFGSVGLAV LLGLLASLYP ALHASKMDPT EALRAL
|
| |