Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Emin_0398 |
Symbol | |
ID | 6262458 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Elusimicrobium minutum Pei191 |
Kingdom | Bacteria |
Replicon accession | NC_010644 |
Strand | - |
Start bp | 424159 |
End bp | 425523 |
Gene Length | 1365 bp |
Protein Length | 454 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 642610865 |
Product | DNA repair protein RadA |
Protein accession | YP_001875292 |
Protein GI | 187250810 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1066] Predicted ATP-dependent serine protease |
TIGRFAM ID | [TIGR00416] DNA repair protein RadA |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 0.0090486 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAATTAA AAACAGTATT TGTTTGCCAA AGCTGCGGTT TTAAAGCGCC TAAGTGGACC GGGCAATGCC CTGACTGCTC GGAATGGAAT ACTATGGTTG AGGAAGTTGA GGCCGCCCCT TCGAAAACGG CGTCCAAATC AAAATCTTTT ACAAGTTTTT CCTCTGAGAT AATAAATCTT TCCGACACTA AAACCCTGCG TGAAGAGCGT GAACTTACAG GCATCAGCGA GCTGGACAGG CTTTTGGGCG GCGGCATTGT AAAAGGGCAG CTTATTCTTT TAGCAGGTGC GCCCGGTATA GGCAAATCAA CATTAATGCT TCAAACTGCG GCAAGTTTAT CAAAAGGTAA AAAAGTTTTA TATATTTCGG GTGAGGAAAG TTTAAACCAA ATATCGTCGC GCGCTTTAAG GCTTGGCGTG GAAGGCAAAA ATATTTTCCT TTTGTCTGAA ACAAACATGC AAAATATTAT TGAAGCGTTA GATAAAGTTA AGCCCGAAGT TCTTATAATA GACTCTATTC AAACGGTTTA CCACCCCGAG TTTTCCTCAT CACCCGGAAC AATAGGACAG GTGCGCGAAT GCGCCGCCGA ACTTTTAAGA CTTTGCAAAC CCAAAGGAAC TGTTTTATTT ATTTTAGGAC ACGTTACAAA AGACGGCGAA CTCGCCGGCC CTAAAGTTTT AGAACATATG GTTGACACCG TTTTATATTT TGACACGGAA AAAGATAATA TTTTAAGGCT GCTGCGGCCG CATAAAAACC GTTTTGGCTC AACGCATGAA ATAGGTTTAT TTCAAATGAC GGGGCACGGG CTTACGCCTG TTGAGGACGC CAGCGTTTAT TTCGCAGGAA ACTCAAGAAA CAAGCCTTTA ATAGGAAGGG CTTATTCCAT AGCTTTAGAA GGCACCAGGC CTATTTTAAC GGAAGTTCAG GCTTTGGTTG TGCCTACAAG ATATCCTTTT CCCAGGCGCG TTTCCACGGG TATAGATTTA AACAGATGCC AGGTTTTATT AGCTTCAATA GAAAAAAACG CCGGCATAAG TTTGGAAAAT AAAGATATTT ATATAAGCCT TGCCGGCGGA GTTAAAATAA AAGATCCTGC GCTTGATTTG GCACTGTCGG CCGCCGTAAT AAGCTCTGTT AAAGATATCC CTATATCTAA TACGGACGTT TTTCTGGCTG AAGTAGGCAT CTTGGGGCCG CTTGCTAAAG TCCCTTTGGC GGACAGGCGC ATAGCGGAAG CTGGCCGCCT TGGTTTTAAA AGAGTGTTTA CCTCAATTAT TAGTAAAAAT GAGGAACCGT CCGACAATAA AACGCAGGTT TTACAGTTGG AATCCATAGC CGATTTAGTA TTAAAACTAA AGTAA
|
Protein sequence | MKLKTVFVCQ SCGFKAPKWT GQCPDCSEWN TMVEEVEAAP SKTASKSKSF TSFSSEIINL SDTKTLREER ELTGISELDR LLGGGIVKGQ LILLAGAPGI GKSTLMLQTA ASLSKGKKVL YISGEESLNQ ISSRALRLGV EGKNIFLLSE TNMQNIIEAL DKVKPEVLII DSIQTVYHPE FSSSPGTIGQ VRECAAELLR LCKPKGTVLF ILGHVTKDGE LAGPKVLEHM VDTVLYFDTE KDNILRLLRP HKNRFGSTHE IGLFQMTGHG LTPVEDASVY FAGNSRNKPL IGRAYSIALE GTRPILTEVQ ALVVPTRYPF PRRVSTGIDL NRCQVLLASI EKNAGISLEN KDIYISLAGG VKIKDPALDL ALSAAVISSV KDIPISNTDV FLAEVGILGP LAKVPLADRR IAEAGRLGFK RVFTSIISKN EEPSDNKTQV LQLESIADLV LKLK
|
| |