Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_1521 |
Symbol | |
ID | 8136850 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 1777163 |
End bp | 1778470 |
Gene Length | 1308 bp |
Protein Length | 435 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 644869133 |
Product | conserved repeat domain protein |
Protein accession | YP_003021335 |
Protein GI | 253700146 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01451] conserved repeat domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 40 |
Fosmid unclonability p-value | 0.00000177441 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACATTAG TCCATCTTAA TAAATACGGC GCAAGGGTTA AACGCCCGTC CTTCCTGCTG GCGCAGCTTT GCTGCGCATT GCTTTTTCTG CTTCTCATGG GAAAAGAGGC CTTTGCGGCA TACCAGGCCG ACCTCATGGT GAGGCTTGCC AACGAGGGCG ATTCCTCCTA CGCGGGCGCC GGGATATTCG AGACCACAGC GGTGATCCAG TCCAAATCCC AGGGTTCCTA TTCCGGGTAC CCGGCGCAGT TCCGGGTCCA GGTGAAGAAC GCGGGCGACC AGACGGACAG CTTCGTCCTC ACCGGTCCCG CCGCGGGGAG CGGCTTCACG GTGAGCTACC GGGACCAGGG AGGTGTGGAG CGTGCGGCCC AGTTCGCCTC AGGAGGGTAC CGGACCCAAT CCCTTGCCCC AGGCGCCTCC GTCGTGCTGC TGGTGCAGGT GACGCTGAGC CGGTTCACCC CGGGGGCGAG CTACCGCGTC CCCGTCACCG CGGTATCAGC GGGCGACCCT GCCGGGGCGG ACCAGGTGAA GACGGAGACC GTCGCCTGTG GCCTCGCCGC CGCTGTCACC GTTTCGGCGC CCCCCGACGG CTCCGGTGCG CCAGGCTCCC TGGTGCTCTA TCCCTACACC GTCACCAACG TCGGCAACGC CGTGAACAGC TTCGCTCTTT CCTTGGAGGG GGGCGCCCCT TGGCCGGGGA TCCTTTACGC GGACGACGGG GCGGGGGGAG GAATCGCAGG TGACGGGGTT AGGCAGCCCG GGGAAGAAAA CCGCTGCGTC TCCACCGGCC CGCTCCCCCC CGGCGCATCC CATCGCTTCT TCCTTGCCGT CGCCATACCC GAGTCGGGGA GCGACGGCGC ACGGGCGGAC GCTCGCCTGA CTGTCACAGG GGAGGGGGCG AGCGGTAACG ATCAGGTCAC CACCACCGCC CTGGCCGCGG TCCTCTCGCT CGTCGACGGC GTGCGCAACC TGACCAAGGG GGGGATCTTC GCCTCGGCCG TCGATGCGGT CCCAGGCGAC CTGCTCCAGT ACCGGATGGC GATCACCAAC AGCGGTTCGG CCCCGGCCAA AGCGGTGCGG GTCGAGAGCC CGCTGCCGGC CGGGTTGAAA CTGACGCCCG ATTCCATGGT GGTGACTTTA GCCGCCGATG GTGAGGGGGC GCCTTGCCCG GCGGCTCAAT GCGGCCGTGC CTGGGGAGGC GAGGGGAACA TCGTCGCCCT TTTGGGCGAG GGGGCCGGCG ACGCCGTCGG CGGCTCTCTG CCGCCGGGAA AGACCCTTTA TCTTTTTTTC AAAGCGCAGG TCGAATGA
|
Protein sequence | MTLVHLNKYG ARVKRPSFLL AQLCCALLFL LLMGKEAFAA YQADLMVRLA NEGDSSYAGA GIFETTAVIQ SKSQGSYSGY PAQFRVQVKN AGDQTDSFVL TGPAAGSGFT VSYRDQGGVE RAAQFASGGY RTQSLAPGAS VVLLVQVTLS RFTPGASYRV PVTAVSAGDP AGADQVKTET VACGLAAAVT VSAPPDGSGA PGSLVLYPYT VTNVGNAVNS FALSLEGGAP WPGILYADDG AGGGIAGDGV RQPGEENRCV STGPLPPGAS HRFFLAVAIP ESGSDGARAD ARLTVTGEGA SGNDQVTTTA LAAVLSLVDG VRNLTKGGIF ASAVDAVPGD LLQYRMAITN SGSAPAKAVR VESPLPAGLK LTPDSMVVTL AADGEGAPCP AAQCGRAWGG EGNIVALLGE GAGDAVGGSL PPGKTLYLFF KAQVE
|
| |