Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_1171 |
Symbol | |
ID | 8136493 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 1363066 |
End bp | 1364331 |
Gene Length | 1266 bp |
Protein Length | 421 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 644868782 |
Product | protein of unknown function DUF445 |
Protein accession | YP_003020990 |
Protein GI | 253699801 |
COG category | [S] Function unknown |
COG ID | [COG2733] Predicted membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 1.23014e-26 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTTGGATG AGAGAAGAGC TGCACTGAAA AAGAACAAGA TGATCGCCCT AGCCCTCATG GCGGGCGCCG CCGCCCTATT CGTCGTGGCC CGTCTGCAAC GGGGAAGCAG CGGCTGGGAA TGGGTGGCCG CGTTCGCCGA GGCCGCCATG GTCGGCGCCC TTGCCGACTG GTTCGCCGTG GTGGCGCTGT TCAGGCACCC GCTGGGACTT CCCATACCCC ACACCGCCAT CATCGCCGAC AAGAGGGAAA CCATCGCCGA CAACCTGGCC CGCTTCATCC AGGAGAAGTT CCTGACCACA GAGGTCCTGG TGGGAAAGAT GAGGGAATTC GACCCCGCGC GGCAGCTTTG CCGTTATCTC ACCTCGCGGG ATAACGCGCA GGGGCTGGCA AGGGGGGTGG CTCGCATCCT CTCCGAATCG ATCGGTTTCC TGGAGGACGA GCGGGTCAGC AGGATAATCA TGGCCGCAAT GCACGACCGG ATCGGCAAAT TCGACATGGC CGGCTCCGCG GCGAGCCTGC TTGAGTCCCT GAGAAAGGAC GATCGCCATC AGGCCGTTCT GGACGAGATC CTGCGGAGGC TGGGCGCCTG GCTTGGGACC CCCGAGTCGC AGGAGAAGAT CGCCATGGCG CTGGACAACT GGGTCGACAC CGAATACCCG CTCCTCAGCA AGTTCATCCC GAACCGTCCG CAGTTCTCCA GGAACGCCGG CGAAAAGATC ATAGGTAAGG TGAGCGGCTT CCTGCACCTG GTCAACGCCG ACCCGGACCA CGAGCTGCGC CAGGAGTTCG ACCGGGCAGT GGGCGACTTC ATCGTGAAGC TGAAACACGA CTCCGGCACC AGGGAGAAGG TCGCCGAGCT GAAGCGGGAG GTGATAGATA ACGAGCAGCT CTCGCTGTAC GCCAAGAGCC TCGTTGCCGA CCTCAAGCAA TGGATGGTGG AGGACCTGGA CCGGCCATCA TCAAGCATCC GCGCCAAGAT AGCCGACGCC GCCGTTGCCC TTGGCAACAC CCTGTCCGAG AACTCCGATT TGGCCGATTC AGTGAACGAG CACCTGGAGC GCGTGGTGAG AAAGTACGCC GACAACCTGA GAGCCGGGTT CTCCCGTCAT GTAGCAGGTA CCGTAAAAGA GTGGAAGGAA GAGGAGTTTA TCGAGGAGAT CGAGCTCAGC ATAGGGAGCG ATCTGCAGTT CATCAGGATG AACGGCACGC TCGTCGGCGG CATGATAGGC CTATTGTTGC ACGCGGTGTC GCTGCTTCTA GGATAA
|
Protein sequence | MLDERRAALK KNKMIALALM AGAAALFVVA RLQRGSSGWE WVAAFAEAAM VGALADWFAV VALFRHPLGL PIPHTAIIAD KRETIADNLA RFIQEKFLTT EVLVGKMREF DPARQLCRYL TSRDNAQGLA RGVARILSES IGFLEDERVS RIIMAAMHDR IGKFDMAGSA ASLLESLRKD DRHQAVLDEI LRRLGAWLGT PESQEKIAMA LDNWVDTEYP LLSKFIPNRP QFSRNAGEKI IGKVSGFLHL VNADPDHELR QEFDRAVGDF IVKLKHDSGT REKVAELKRE VIDNEQLSLY AKSLVADLKQ WMVEDLDRPS SSIRAKIADA AVALGNTLSE NSDLADSVNE HLERVVRKYA DNLRAGFSRH VAGTVKEWKE EEFIEEIELS IGSDLQFIRM NGTLVGGMIG LLLHAVSLLL G
|
| |