Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Emin_0957 |
Symbol | |
ID | 6262913 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Elusimicrobium minutum Pei191 |
Kingdom | Bacteria |
Replicon accession | NC_010644 |
Strand | - |
Start bp | 1056533 |
End bp | 1057942 |
Gene Length | 1410 bp |
Protein Length | 469 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 642611437 |
Product | hypothetical protein |
Protein accession | YP_001875847 |
Protein GI | 187251365 |
COG category | [S] Function unknown |
COG ID | [COG5410] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | [TIGR01630] phage uncharacterized protein (putative large terminase), C-terminal domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 39 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.0000035245 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAAAAAA ATCTGCAAAA GGAAGTGCAA AGTGATTTCT ATACATTCTA TCGGTATGTA GCTAAAGGGC ATACCGGATG GTGGCTTAGA GAGTTGTGCG ATTTTTTGCA GTATGAAGTC TATTTCAGAT TTTTAAAAAA AGAGTTGCCG ATTTCAACCA TAGAAGCACC AGTACAGCAT GGCAAGAGCA GGGTTTTAAG GCATTTTCTT TGTTGGTTGA TAGGGTTACA TCCTGAGCTG AGATTTAACT TCTACACCGC AGCAGAAGAT TTAAGGGATG AGACAAAGAT TGATGTCGAT ATTATTTTGG AGTCGCCAGA ATACATGGCG ATATTTGGAC AGAGAAAGTC CAGCACTTTG AAAGATACAT CTGAAACATT TCAGATATAC AACCCGGAAG GGCCAAACGG CAAGGTCAAT TTTAGACTTA TGGGGGCAGG CAATATAGGC CACCCTTCGC ATATCTCTCT TATTGACGAT CCTTACAGAA ATAAAGAGGA CGCACTTTCT AAGACCATGA GAGACAAGAT TGCCAGCAGG TTCAGGGCAG ATATTATTAC CAGAAGGCAG GAACGCTCAA TGGTAGTGGT ATTGCACAGC CGATGGCATG AGAGCGACCT TATAGGCTGG ATAACAAAGA ACATAAGCAA AGATGAGCTT ATTTCATTTT CTTATCCGGC AATTATGCCA AACGGAGAGG CCCTATTCCC TGAATTAAGG AGCCTTGCTT TCTTAAATAA GCAAAGGGGC ATATTAACAC CGGGGGAGTT CGCTTCCCTT TACCAGCAAA GTCCTATTGT TGAGGGCGGT AATAAGTTTA AGGCTGAAAT GTTTGAGTTT GTTGATGAGT TGCCGGAAAC CTTTGACTAT ACATTCTCCA CATCGGACAC CTCTTATAAA AAGGGGCAGG AGAACGATTA TACGGTTTGT GCTAACTGGG GCGTGTATAA GGATGACTTA TATTTAACCA GCATATTCCG TGAGCGTATA GAAGCTAAAG AGGCAGACGG CAGATTAAGG CCGATTATTA AACAGCACTC TGTCTGGGGA TATAGGAAGG CTTGGATTGA ACCTAAAGGG CATGGCATAT TTTTAAACCA GACCTTCAGC GATGACAAAG AATTAATGAT GCCGGACGAA GCTGAATTAA AAGAGTTCTT TAAAGACAGA AGCGTAGATA AGGTGGAAAG GGCAAATAAT GCAACCGCCT CCCTATCAAA TAGAAAGGTC AAGATATACT CAAGAATACA TTGTAAGGAC GAGATTTTAA TTGAGGCTTT ATCTTTTCCA AACGGAGACC ATGATGACTT TGTGGATACG CTTATAGACG CAATAAAAAT TTTAGTTAGT TCTTCTAGCG GTCGTGCAGT TGCAACAGCC ATACCAATTA GAAGGAATAG GGAAGAATAA
|
Protein sequence | MKKNLQKEVQ SDFYTFYRYV AKGHTGWWLR ELCDFLQYEV YFRFLKKELP ISTIEAPVQH GKSRVLRHFL CWLIGLHPEL RFNFYTAAED LRDETKIDVD IILESPEYMA IFGQRKSSTL KDTSETFQIY NPEGPNGKVN FRLMGAGNIG HPSHISLIDD PYRNKEDALS KTMRDKIASR FRADIITRRQ ERSMVVVLHS RWHESDLIGW ITKNISKDEL ISFSYPAIMP NGEALFPELR SLAFLNKQRG ILTPGEFASL YQQSPIVEGG NKFKAEMFEF VDELPETFDY TFSTSDTSYK KGQENDYTVC ANWGVYKDDL YLTSIFRERI EAKEADGRLR PIIKQHSVWG YRKAWIEPKG HGIFLNQTFS DDKELMMPDE AELKEFFKDR SVDKVERANN ATASLSNRKV KIYSRIHCKD EILIEALSFP NGDHDDFVDT LIDAIKILVS SSSGRAVATA IPIRRNREE
|
| |