Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_0967 |
Symbol | |
ID | 8136288 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 1144013 |
End bp | 1145332 |
Gene Length | 1320 bp |
Protein Length | 439 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 644868581 |
Product | protein of unknown function DUF21 |
Protein accession | YP_003020790 |
Protein GI | 253699601 |
COG category | [R] General function prediction only |
COG ID | [COG1253] Hemolysins and related proteins containing CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 76 |
Fosmid unclonability p-value | 0.854838 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGGACACAA TCTTCGTCGA ACTGCTTGTC ATCGCCATCC TCATACTGCT GAACGGATTT TTCTCCTGCG CCGAATTCGC CATCATCTCC ATCAGAAAGA GCCGCGTTGC TCAACTGGTC GCCCTGGGCG ACCAGCGTGC CGCGCTCGTG GAATCCCTGC AGAAGGATCC GCACCGCCTG CTGGCCATCG TGCAGATCGG CGTCACCGTG GTCGGATCGA CCGCCTCCGC GGTGGGGGGC GTGATAGCGG TCGACTACAT CCGCCCCATC CTGCAGCTAT CCCCCTTCGC CATGATACGC AACGCCGCCG AACCGCTCTC CCTCACCATG GTGGTAGCCG TGATCTCGTA TCTCCTGCTC ATCCTGGGAG AACTGGTTCC GAAGACCATC GGCCTGCAGT ACGCCGACCC CGTGGCGCTT CGCATCGCCA AGACCATCAC CTTCCTGGCG AGGATCGCCA GTGTCCTGGT ATCGTTGCTC AGCTACTCCA CCAGGGGGGC GCTGGCGCTG TTCCGGATCA AGGGTGAGGG AAAGGCCTTC ATGACGCGGG AGGAGGTGCA GCATATCGTC GCCGAGGGGC ATGAGAGCGG CATCTTCAGC GAGGCCGAGC ACACCTTCAT CGACAACCTC TTCGACTTCA CCCATACCGC CGTAAGGGAG GTGATGGTCC CCCGCACCAG GGTGGTCGCC TTCGACCTCA ACCTTTCCAA CGAGGAGATC CTGAACCAGG TCCTGGACAA CATGTACTCC CGTTACCCGG TGTACGTGGG GAGCATCGAG GAGACGGTCG GTTTCATCCA CGGCAAGGAC CTCTTAGGGA GGATGGTGCG CGAGCCGGAT TTCGATATCC GCTCCATCGT CCGTCCCCCC TTCTTCGTTC CGGAGGGGAA GAAGGTGAGC GAACTCTTGA AGGAGATGCA GAAGACCCGC GTGCACATGG CTTTCGTGGT GGATGAGTAC GGCAGCATCA GCGGCATAGT GACCACCGAG GACCTGCTCG AGGAGCTGGT CGGCGAGATC GAGGACGAGC ACGATGTCGG CGAGCCGAGC ACGGTGCAGA TCCTGGCCGA CGGGAGCTAC CTGGTGGATG CCTTCATCTC CGTTTCCGAT CTGGAGGACC TGCTGGAGAT GGATCTTGGC GAGGATCTTC CCTTCGACAC CCTGGCCGGG CTGATACTGG ACCGCATCGG CGGGTTTCCG GAGCAGGGCG AGAAGCTTCA GTTGGGCGAG TACACCCTCA TCTGCGAGGA AGTCACCCGC ACCGGCATCA CCAAGGTGAG AATCGGGAAA ACAGAGGGGA AATCTGGGGC TGGGGACTAG
|
Protein sequence | MDTIFVELLV IAILILLNGF FSCAEFAIIS IRKSRVAQLV ALGDQRAALV ESLQKDPHRL LAIVQIGVTV VGSTASAVGG VIAVDYIRPI LQLSPFAMIR NAAEPLSLTM VVAVISYLLL ILGELVPKTI GLQYADPVAL RIAKTITFLA RIASVLVSLL SYSTRGALAL FRIKGEGKAF MTREEVQHIV AEGHESGIFS EAEHTFIDNL FDFTHTAVRE VMVPRTRVVA FDLNLSNEEI LNQVLDNMYS RYPVYVGSIE ETVGFIHGKD LLGRMVREPD FDIRSIVRPP FFVPEGKKVS ELLKEMQKTR VHMAFVVDEY GSISGIVTTE DLLEELVGEI EDEHDVGEPS TVQILADGSY LVDAFISVSD LEDLLEMDLG EDLPFDTLAG LILDRIGGFP EQGEKLQLGE YTLICEEVTR TGITKVRIGK TEGKSGAGD
|
| |