Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_3162 |
Symbol | |
ID | 8138514 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 3672917 |
End bp | 3674104 |
Gene Length | 1188 bp |
Protein Length | 395 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 644870767 |
Product | NHL repeat containing protein |
Protein accession | YP_003022947 |
Protein GI | 253701758 |
COG category | [S] Function unknown |
COG ID | [COG3391] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 119 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTAAGCA ACATCACCGC CAACTGCCGT CGTCATCTTG CCAAAAGCTC CATCGTTGTG ATTGCCTTGC TGCTCTCAGC CTGTGCTTCC CAAAAGAAGG CCTTCGCGCC GGTCTTTTTC CCGCCCGAGC CGAGCCCCCC CAGAATCCAG TACCTGATGG GAATCTCGGT ATCGACCGAC GTAGAGGAGG AGAAGAAAAA CGACTTCTCC CTGGTCGTCA CCGGCAAGGG AAGCTCCGAG GCCAAGCGCA TGATCGCCAA GCCCTACGGC ATTACCAGCG CCAACGGCAA GATCTTCGTC TGCGACATCG GTGTGGGCAA CCTCGTCGTC ATCGACCCCG CAAAAAAGAC CTTCGATTAC CTGAAGGGGA ATAACGGTCT GGGCAAGCTG AAAAAGCCGG CAAACGTAGG CCTGGACCAA TACGGGAACC TCTTCGTCGC CGACGCCGCC AGGAGAGAGA TCATGGTCTA CAACGCGGAG GGGCAGTTCG TGCGCTCCTT CGGCAAAGAA GAGAACATGA AGCCCTCCGA CGTGGCGATC GACGGCGACC TGGTTTACGT CCTCGACCTC CAGAAGACCA AGAACGAGAT CAAGGTGTTC GATCGCGAGA GCGGCAAGCT GACTAACACT TTCGGCAAGC GCAGCGACAA CGCAGAGGGG GTCAACATCC CCACCAACTT CACCATGGAC CACAAGGGGG ACATCTACGT CACCAACGCC GGCAACGGCA AAGTGATGAA GTTCGACCGC GACGGGCATC TGCTTCTCAC CTTCGGGGAC CTGGGGGACA TCTCCGGCAT GTTCTCGCGT CCCAAGGGGG TGGCAGTCGA CCGTGAAAAT CGCATCTACG TCGTCGACGG CGGCAACCAG AACGTGCAGG TGTTCAACGA AAAGGGGCGC ATTCTCACTT CCTTCGGCGA TCCGGGCTTG ATGCGCGGGT CCCTCAACCT GCCGGTATCG GTGACGGTCA CCCAGGAAAA CCTGGATTAT TTCCAGAAAT TCGCCGCCCC GGGGTTCACC GTAGAATCGG TCATCCTGGT CACCAACCAG TACGGCGAGG ATAAGATCTC CGTTTACGGC ATGGGCAAGA TGGCGGGGCG CGACTACAGC GAACGCCCCC CTACCGCCAA AGCTACCGAG GGCCCGAAGG CTAAGGAAAC CGGGACGCCG GAGACCCAGA CGAAGTAA
|
Protein sequence | MLSNITANCR RHLAKSSIVV IALLLSACAS QKKAFAPVFF PPEPSPPRIQ YLMGISVSTD VEEEKKNDFS LVVTGKGSSE AKRMIAKPYG ITSANGKIFV CDIGVGNLVV IDPAKKTFDY LKGNNGLGKL KKPANVGLDQ YGNLFVADAA RREIMVYNAE GQFVRSFGKE ENMKPSDVAI DGDLVYVLDL QKTKNEIKVF DRESGKLTNT FGKRSDNAEG VNIPTNFTMD HKGDIYVTNA GNGKVMKFDR DGHLLLTFGD LGDISGMFSR PKGVAVDREN RIYVVDGGNQ NVQVFNEKGR ILTSFGDPGL MRGSLNLPVS VTVTQENLDY FQKFAAPGFT VESVILVTNQ YGEDKISVYG MGKMAGRDYS ERPPTAKATE GPKAKETGTP ETQTK
|
| |