Gene Emin_1068 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_1068 
Symbol 
ID6263434 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp1161342 
End bp1162577 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content37% 
IMG OID642611548 
Producthypothetical protein 
Protein accessionYP_001875957 
Protein GI187251475 
COG category[L] Replication, recombination and repair 
COG ID[COG2887] RecB family exonuclease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones93 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATACACT TTATTATCAG AACTTTAATA AAGGGCAGTA AAATGGCTAC ATCTAAACTT 
TCGTTTTCCT ATTCTAAAAT GACGCTTTAC CGCGAGTGTC CGCAAAAATA TAAATTCCGT
TATATACATA AAATACCGGA AGCTCCCAAA TATTATTTCG CGTTCGGTTC GGCCATGCAT
AAAGCGCTTG AGTTTATTTA CAGCGTTAAA CAGCCGCCGT TTCCTTCCTT AGAACAAATT
TTAGATTTTT TTGACGCCGA TTGGCGCAGC ACAAGTTATC AGGACAAAGG CTATGCCAGC
ATTTCCAAAG AGCTTGAAGG ATACGACGAA GGCCGCCGCA TTTTAATATC TTACTACCAA
AAACATAAAG ACAGTTTTTT TATTCCTTTA GCCGTTGAAT TTAGAACAAC TTTAGATATT
GATAATCTTT CCCTCATAAG CATTATTGAC CGTGTTGATT ATTTTGGCAA CGGAGCGTTG
GCGATTACGG ATTATAAAAC TGGCAAAACC GTACAGCGCG AGCCCGACCA GCTTTATATG
TACCAAAAGG TAATGCAAAA CTCTCCTGTT TTAAAAAACA TCATAAATGA AAAAGAGGGC
AAAGAAACGG AAGTAAAAAT TGAAAAACTT TCCTTTTACC ATTTACCGTC ATTAAAAGTT
ATGGATTTTG AGCCCGCCCC CCAAAAAGAA ATTGATGTTT TTTGGGAAGG CGTTTTAAAA
ACCGCCGATG AAATACGCGG CAAAAACTTT AACCCAGATC CTTCCGAAAG CAAATGCCGC
TGGTGCGATT ACAAAGCAAT GTGCCCTGTT TTTACGGGTA TGGAGTTTGA GCAGTTTCAA
AAAACGGAAA AGCCTGTATT TTCAGATATC CCGGTAACAA ATGAGGATAT TTTATCTTCA
AAAATAGACG AACTTGCCGA AACGGGGCAA AAATATTCTT CTTTAAAAAA AGAAATTATT
TCCTTAATGA AACAAAACAA CTACAACCAA CATTTCGGCT CAAATTACAA AGTTGAGCTT
AAACAAAAAG AATTTTTAGA TTTTGAAGAC AAAGAAAAAG TTATAGAATT TTTAAAAGAA
AAAAATCTTA TTAAAAAAAC GCTTGTACCT ACGCAATGCT CAATAGAAGC GCTTTTGGAC
GACCCTTCCG TACCGGAAGA CGATAAAGCC CGCCTTAAAG AGCTTGGCGT TAACCGCGTT
TCGGACGAGC TTCAAATAAA AAAGGTTGAA AAATAG
 
Protein sequence
MIHFIIRTLI KGSKMATSKL SFSYSKMTLY RECPQKYKFR YIHKIPEAPK YYFAFGSAMH 
KALEFIYSVK QPPFPSLEQI LDFFDADWRS TSYQDKGYAS ISKELEGYDE GRRILISYYQ
KHKDSFFIPL AVEFRTTLDI DNLSLISIID RVDYFGNGAL AITDYKTGKT VQREPDQLYM
YQKVMQNSPV LKNIINEKEG KETEVKIEKL SFYHLPSLKV MDFEPAPQKE IDVFWEGVLK
TADEIRGKNF NPDPSESKCR WCDYKAMCPV FTGMEFEQFQ KTEKPVFSDI PVTNEDILSS
KIDELAETGQ KYSSLKKEII SLMKQNNYNQ HFGSNYKVEL KQKEFLDFED KEKVIEFLKE
KNLIKKTLVP TQCSIEALLD DPSVPEDDKA RLKELGVNRV SDELQIKKVE K