Gene Emin_1071 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_1071 
Symbol 
ID6263834 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp1166106 
End bp1167134 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content43% 
IMG OID642611551 
Productankyrin 
Protein accessionYP_001875960 
Protein GI187251478 
COG category[R] General function prediction only 
COG ID[COG0666] FOG: Ankyrin repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones88 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAAA TATATTTTTT GTTAATTTTA CCTTTAGCAG TATTTTCTTA TGCCCAGGCG 
GAAAGCCAAT CTAATCAAAT GCAGTATACT GCTAAAGACG CTAAACAGAT GACGGAAGAT
CCTTATAAAA CCTATCCCGG TTATCTATTG GGTGAAAGGG ATGGCGTAAA AGGCAGATAC
GTTTATGACA CTTCTTTAAT CCGTTCCGTA AGAGCGCAAA ACGTTGACAG CGTAAAAACG
CTTTTACGCG CCAGGGTTGA CCCTAATGAA AAAAATGACG AAGGCTTTAC CCCTTTGATA
AAAGCGGCTG AAACAGGCAA TTTGGAAATA ATACAACTGC TTGTGGAAGC GGGCGCGGAA
ATTGATAGTC CGGCTCAATA TGGCATAACG CCTTTAATGG TTGCCGCCGC CGGCGGGCAC
CACCAGGTTG TTTCTTATTT AATAAATAAA GGCGCAAGCG TGCACAGGCA GGACGTTTTG
CTTAAAACGC CTCTTGCCCA CGCCGCCGCC GGGGGCAATA AAAAGACGGT AAACATTCTT
TTAAAAGCGG GCGCCAAAAT TGAGCAAAAA GATAAAAGCG GTGAAACTCC TTTAGTTATA
GCGCTTAGAA CAGGCAACGA TGGTTCCGCC GCCGCTTTAA TAAATGCCAA CGCTGACTTA
CAGGCTCCAG CGGGAAGAGA TGTTACCGCT GATTTTTTAG CTGAAAGCTA CGCTGGAAGT
TCACAGGTGC AAAAAGCCAT AAAGCAGAAA GAAAAAGAAG CTGAAAAAGC CGCTAAAGCG
GAAGCTAAAA AGGCCGCTAA AGAAGCTAAA TCCGACGTAA GCGCGGTTAA GGCGGACGCC
TCTAAAACCA AATATATAGG TACGGAAACA ACTTTTAAAA AAGCAGAGGA TATTTCTTTT
GAAAATAATC TCAGAATATC CGGCAGCACG GAGGATATGA AGCAATATGA AAGGCCCGAC
GCCGAGATGG ACGAAGGTTT GATTATTGAA AAAGTGCAGC CCATAATTTT AGAAAAGAAG
AAAAACTAA
 
Protein sequence
MKKIYFLLIL PLAVFSYAQA ESQSNQMQYT AKDAKQMTED PYKTYPGYLL GERDGVKGRY 
VYDTSLIRSV RAQNVDSVKT LLRARVDPNE KNDEGFTPLI KAAETGNLEI IQLLVEAGAE
IDSPAQYGIT PLMVAAAGGH HQVVSYLINK GASVHRQDVL LKTPLAHAAA GGNKKTVNIL
LKAGAKIEQK DKSGETPLVI ALRTGNDGSA AALINANADL QAPAGRDVTA DFLAESYAGS
SQVQKAIKQK EKEAEKAAKA EAKKAAKEAK SDVSAVKADA SKTKYIGTET TFKKAEDISF
ENNLRISGST EDMKQYERPD AEMDEGLIIE KVQPIILEKK KN