Gene Emin_0157 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_0157 
Symbol 
ID6263052 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp167154 
End bp168272 
Gene Length1119 bp 
Protein Length372 aa 
Translation table11 
GC content45% 
IMG OID642610621 
ProductDNA protecting protein DprA 
Protein accessionYP_001875059 
Protein GI187250577 
COG category[L] Replication, recombination and repair
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0758] Predicted Rossmann fold nucleotide-binding protein involved in DNA uptake 
TIGRFAM ID[TIGR00732] DNA protecting protein DprA 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.835588 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones78 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATAACAG ATTCGGAAAG ACTTGCCAGA ATAAAATTAA ACGCTTTTAC CTATTTGCGT 
ACGGATTGGG CCATGCGCAT GATAGAAGTT TTTGGCAGCG CGGAAATGAT TTTAAAAACT
TCAGCTAAGG ATTTGGCGGC GCAGGGCGGA ATGTCGGAAG ATACTGCCGC CAATTTACTT
AAAGAAGCGC ACGCGCTTGA CGCTGAGAAG GAAGCGGAAC TTACAAATAA AGCGGGCGGC
AAAATTTTGC TTTTGGAAGA TTATGAGTAT CCCCAAAGTT TAAAAGATAT TAAGGACCCG
CCTTTTGTTT TATATGTGCG CGGAACATTA GAAGCGCGCG GCCCTAAAGT GGCAATGGTG
GGCACAAGGC TTATAACGCC TTACGGAAGG AGATGCGCTA AAAAATTTGC CACGGAAATC
GCGCAGGCGG GATGCGTTGT AGTAAGCGGT TTGGCCCGCG GAGTTGACAG TGTATGCCAG
CAGGCGGTGG TTGATATTAA TAAACCCACC TGGGCTGTTG TGGGCACTGG AATAGGGCGT
TGTTACCCGG CTGAAAATAA AGCTTTGGCA AACGCTGTTT TAGAAAACGG CGGCGCCATA
ATTTCGGAAC TATCTTTTAA TAAACCGCCG AACGCTTTTC ATTTTCCCAG GCGCAACAGA
ATAATTTCTG CCCTTTCAAG CGTGGTGGTT ATTATAGAAG GTAAAGTGCG CTCAGGCGCT
TTGATTACGG CAAAACTGGC CGCTGAGCAG GGTAAAGATA TTTTAGCGGT GCCCGGCTCT
ATAGAAAGCG AACAGAGCGG CGGCCCCAAT ATGTTAATTA AAGACGGCGC GCACGCCTTG
CTTGAAACGC GCGATATTAT AGACCTTATT CCTTTTGAGG AGCGCTTTGG CCTTAATGAG
GAAGTTTTTG AAAAAGATTC GGTTCAAAAA GAAATACTTG ATTTAACTGA AACGGAAAAA
CAATTTTTAG AAGTTATCGG CCCCGGTGAA CATACGATCG ATGATATTGT TGAAGCTTTG
GCAACGGATG TTCCTTCGGC CGCGGCGGTA TTATTTGAAA TGGAGATTAA AGGCGTTTTA
ATGTGCAAAG ACGGCAAATA CAGCCGTAAC AATTTTTAA
 
Protein sequence
MITDSERLAR IKLNAFTYLR TDWAMRMIEV FGSAEMILKT SAKDLAAQGG MSEDTAANLL 
KEAHALDAEK EAELTNKAGG KILLLEDYEY PQSLKDIKDP PFVLYVRGTL EARGPKVAMV
GTRLITPYGR RCAKKFATEI AQAGCVVVSG LARGVDSVCQ QAVVDINKPT WAVVGTGIGR
CYPAENKALA NAVLENGGAI ISELSFNKPP NAFHFPRRNR IISALSSVVV IIEGKVRSGA
LITAKLAAEQ GKDILAVPGS IESEQSGGPN MLIKDGAHAL LETRDIIDLI PFEERFGLNE
EVFEKDSVQK EILDLTETEK QFLEVIGPGE HTIDDIVEAL ATDVPSAAAV LFEMEIKGVL
MCKDGKYSRN NF