Gene Emin_1233 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_1233 
Symbol 
ID6263688 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp1332918 
End bp1333892 
Gene Length975 bp 
Protein Length324 aa 
Translation table11 
GC content38% 
IMG OID642611711 
Productoligopeptide/dipeptide ABC transporter, ATPase subunit 
Protein accessionYP_001876120 
Protein GI187251638 
COG category[E] Amino acid transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG0444] ABC-type dipeptide/oligopeptide/nickel transport system, ATPase component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones56 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATACAT TACTTCAGGT TAAAGATTTG TCTGTGTTTT TTAAAACGTC CGAAGCAAAT 
ATAAAAGTAT TAAAAGAACT ATGTTACCAA CTTAATGCCG GCGAAACTTT GTCTATAGTG
GGTGAGTCCG GCTCGGGCAA AACGGTACAC GCTCTAAGTA TTTTAAGGTT AATGTCTACA
AACGCAAAAA TAACGGGTGA AATAATTTTT AAAAATGAGA ACCTGTCCGT TTTACCTGAA
AGCAAACTAA AAAATATACG GGGCAAAAAA ATAGCCATGA TTTTCCAAGA TCCTATGACA
AGCCTCAACC CCGTTATGAC AATAGGTTCG CAAATTTACG AAACACTGCT TACGCATAAA
AAAGCTACAA AAAAAAATAT AAAGGAAAAA ACTTTATCTC TCTTAAAATC AGTTGAAATA
CCTGACGCGA AAAAAAAACT TGACTCTTAC CCGCATGAAT TTTCAGGAGG ACAGAGGCAG
CGTATTATGA TAGCCATGGC CCTTGCCTGC GAACCGGACA TTTTAATAGC CGACGAACCT
ACTACCGCCT TAGACGTAAC CATACAAAAA CAAATATTAG CTCTTTTGAA AAAATTACAG
GAAGAAAGAA AAACAGCTTT AATTTTTATA ACGCATAACC TTGCCCTGGT AAACGAACTA
GGCGGAAGAG TGCTTGTTTT ATACGCGGGG CAATGCGTAG AAGAATGCAC AACCGAGCAG
CTTTTTAAAA GACCTCTTCA CCCTTATTCA CAAGGTCTTA TCGCTTGCGC AGCGGGCATA
ACACAAAAAG GCAGGTTAAA GACGATAGAA GGAACACCGC CTGCGCCGGG AACAATTTTT
GAAGGCTGTC CTTTTGAGCC AAGATGCCCT AAAAAACTGG AAAGATGCAA ATTCCAAAAT
CCGGAAATGT TTAATTTAGG ACAAAGAAAA TCTAGATGCT GGTTAAACGC TAATGAAGAA
TATCTTGGAG ATTAA
 
Protein sequence
MDTLLQVKDL SVFFKTSEAN IKVLKELCYQ LNAGETLSIV GESGSGKTVH ALSILRLMST 
NAKITGEIIF KNENLSVLPE SKLKNIRGKK IAMIFQDPMT SLNPVMTIGS QIYETLLTHK
KATKKNIKEK TLSLLKSVEI PDAKKKLDSY PHEFSGGQRQ RIMIAMALAC EPDILIADEP
TTALDVTIQK QILALLKKLQ EERKTALIFI THNLALVNEL GGRVLVLYAG QCVEECTTEQ
LFKRPLHPYS QGLIACAAGI TQKGRLKTIE GTPPAPGTIF EGCPFEPRCP KKLERCKFQN
PEMFNLGQRK SRCWLNANEE YLGD