Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_1424 |
Symbol | |
ID | 8136752 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 1675610 |
End bp | 1676770 |
Gene Length | 1161 bp |
Protein Length | 386 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 644869037 |
Product | extracellular repeat protein, HAF family |
Protein accession | YP_003021240 |
Protein GI | 253700051 |
COG category | [S] Function unknown |
COG ID | [COG5563] Predicted integral membrane proteins containing uncharacterized repeats |
TIGRFAM ID | [TIGR02913] probable extracellular repeat, HAF family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.0000000000000371296 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAAGGAC TCGCTGCAAA ACTTGCAAGG TTAGGTACTT TTTCTTTTAC GCTGCTTTCC CTGATACTTT TGATCGAAAC TGAAAGTTGG GGGAGGGATG GCGCCAGTGT GCCCTCTGAA CGAAGGATTC TCTCTGTATC AGTCCAGGAT CTTGGAACCT TCGGTGGATT ATACAGCTGG GCTTCAGATA TCAATGACAA AGGGCAAGTG GTAGGAACAA GCCAAACGTC TACAGGCGCT AGCCGCGGCT TCATCTGGCA GAACGGGGTG TTAACCGACT TGGGAACCCT GGGATATGCA ACCACAGCTG GACACATCAA TAACAAAGGA CAAGTCGTAG GTGTTAGCAA AGCCTCTGCA ACGGTTACCT CTGCTTTCAT CTGGCAGGAT GGTGTCATGA CCGACATCGG GAGTCTCGGA GGAGGGGGCA CTTCGCCTGC AGATATAAAC GATAAAGGGC AGGTGATAGG GACAAGCAGA ACCTCTCAAG GTGCCATGCA CGCATTCATT TGGCAGGAAG GAGTGATGAC CGATCTAGGA ACTCCTGACG GTGTTTACTC AACGGCACAG GATATAAACG AGCATGGGCA GGTCATAGGC CAGATTGCGT CACCAGGGTC GACCGGGCAT GGGTACATCT GGCACGACGG TATTATGACC GATCTTGGAG AGGGGTTTTA CCCGGAACGT ATTAACGAGA AGGGACAGGT TATCATCAGG GAGTTTGCTT CTTTTGGCAA CCCGCATGGC TTCCTCTGGC AAGACGGCGT GATGACCGAT TTGGGAACCT TAGGTGGAAA CGAGTCTGAC GTCATCGATA TAAACGACAA GGGGCAGGTC GTAGGCCATA GCATGACTAC TTCTGGAGAA ATGCACGTTT TCATCTGGCA TAACGGAGTG ATGACCGACT TGGGAACGAC TCAAATCGGG GGTTTTTACC CGAGAGACAT CAACGATAAA GGAGAGATCC TTGGGGTAAG GAGTCAAGCC TCGGGAATCG TTCAGCCCGT CCTTTGGCAA AAGGGCACCA TAACCGAACT GGGAACGCTT GGCGGGGAGT GCAACGCCCA CGTTCTAAAT AACCACGGGC AAGCAGTCGG AAGTAGCCAA ATTTATGCGA ATTCTTATGA GCGGCATCCC GTTGTCTGGA CGATAAAATA A
|
Protein sequence | MKGLAAKLAR LGTFSFTLLS LILLIETESW GRDGASVPSE RRILSVSVQD LGTFGGLYSW ASDINDKGQV VGTSQTSTGA SRGFIWQNGV LTDLGTLGYA TTAGHINNKG QVVGVSKASA TVTSAFIWQD GVMTDIGSLG GGGTSPADIN DKGQVIGTSR TSQGAMHAFI WQEGVMTDLG TPDGVYSTAQ DINEHGQVIG QIASPGSTGH GYIWHDGIMT DLGEGFYPER INEKGQVIIR EFASFGNPHG FLWQDGVMTD LGTLGGNESD VIDINDKGQV VGHSMTTSGE MHVFIWHNGV MTDLGTTQIG GFYPRDINDK GEILGVRSQA SGIVQPVLWQ KGTITELGTL GGECNAHVLN NHGQAVGSSQ IYANSYERHP VVWTIK
|
| |