Gene Emin_0398 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_0398 
Symbol 
ID6262458 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp424159 
End bp425523 
Gene Length1365 bp 
Protein Length454 aa 
Translation table11 
GC content43% 
IMG OID642610865 
ProductDNA repair protein RadA 
Protein accessionYP_001875292 
Protein GI187250810 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1066] Predicted ATP-dependent serine protease 
TIGRFAM ID[TIGR00416] DNA repair protein RadA 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.0090486 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAATTAA AAACAGTATT TGTTTGCCAA AGCTGCGGTT TTAAAGCGCC TAAGTGGACC 
GGGCAATGCC CTGACTGCTC GGAATGGAAT ACTATGGTTG AGGAAGTTGA GGCCGCCCCT
TCGAAAACGG CGTCCAAATC AAAATCTTTT ACAAGTTTTT CCTCTGAGAT AATAAATCTT
TCCGACACTA AAACCCTGCG TGAAGAGCGT GAACTTACAG GCATCAGCGA GCTGGACAGG
CTTTTGGGCG GCGGCATTGT AAAAGGGCAG CTTATTCTTT TAGCAGGTGC GCCCGGTATA
GGCAAATCAA CATTAATGCT TCAAACTGCG GCAAGTTTAT CAAAAGGTAA AAAAGTTTTA
TATATTTCGG GTGAGGAAAG TTTAAACCAA ATATCGTCGC GCGCTTTAAG GCTTGGCGTG
GAAGGCAAAA ATATTTTCCT TTTGTCTGAA ACAAACATGC AAAATATTAT TGAAGCGTTA
GATAAAGTTA AGCCCGAAGT TCTTATAATA GACTCTATTC AAACGGTTTA CCACCCCGAG
TTTTCCTCAT CACCCGGAAC AATAGGACAG GTGCGCGAAT GCGCCGCCGA ACTTTTAAGA
CTTTGCAAAC CCAAAGGAAC TGTTTTATTT ATTTTAGGAC ACGTTACAAA AGACGGCGAA
CTCGCCGGCC CTAAAGTTTT AGAACATATG GTTGACACCG TTTTATATTT TGACACGGAA
AAAGATAATA TTTTAAGGCT GCTGCGGCCG CATAAAAACC GTTTTGGCTC AACGCATGAA
ATAGGTTTAT TTCAAATGAC GGGGCACGGG CTTACGCCTG TTGAGGACGC CAGCGTTTAT
TTCGCAGGAA ACTCAAGAAA CAAGCCTTTA ATAGGAAGGG CTTATTCCAT AGCTTTAGAA
GGCACCAGGC CTATTTTAAC GGAAGTTCAG GCTTTGGTTG TGCCTACAAG ATATCCTTTT
CCCAGGCGCG TTTCCACGGG TATAGATTTA AACAGATGCC AGGTTTTATT AGCTTCAATA
GAAAAAAACG CCGGCATAAG TTTGGAAAAT AAAGATATTT ATATAAGCCT TGCCGGCGGA
GTTAAAATAA AAGATCCTGC GCTTGATTTG GCACTGTCGG CCGCCGTAAT AAGCTCTGTT
AAAGATATCC CTATATCTAA TACGGACGTT TTTCTGGCTG AAGTAGGCAT CTTGGGGCCG
CTTGCTAAAG TCCCTTTGGC GGACAGGCGC ATAGCGGAAG CTGGCCGCCT TGGTTTTAAA
AGAGTGTTTA CCTCAATTAT TAGTAAAAAT GAGGAACCGT CCGACAATAA AACGCAGGTT
TTACAGTTGG AATCCATAGC CGATTTAGTA TTAAAACTAA AGTAA
 
Protein sequence
MKLKTVFVCQ SCGFKAPKWT GQCPDCSEWN TMVEEVEAAP SKTASKSKSF TSFSSEIINL 
SDTKTLREER ELTGISELDR LLGGGIVKGQ LILLAGAPGI GKSTLMLQTA ASLSKGKKVL
YISGEESLNQ ISSRALRLGV EGKNIFLLSE TNMQNIIEAL DKVKPEVLII DSIQTVYHPE
FSSSPGTIGQ VRECAAELLR LCKPKGTVLF ILGHVTKDGE LAGPKVLEHM VDTVLYFDTE
KDNILRLLRP HKNRFGSTHE IGLFQMTGHG LTPVEDASVY FAGNSRNKPL IGRAYSIALE
GTRPILTEVQ ALVVPTRYPF PRRVSTGIDL NRCQVLLASI EKNAGISLEN KDIYISLAGG
VKIKDPALDL ALSAAVISSV KDIPISNTDV FLAEVGILGP LAKVPLADRR IAEAGRLGFK
RVFTSIISKN EEPSDNKTQV LQLESIADLV LKLK