Gene Emin_0957 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_0957 
Symbol 
ID6262913 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp1056533 
End bp1057942 
Gene Length1410 bp 
Protein Length469 aa 
Translation table11 
GC content41% 
IMG OID642611437 
Producthypothetical protein 
Protein accessionYP_001875847 
Protein GI187251365 
COG category[S] Function unknown 
COG ID[COG5410] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR01630] phage uncharacterized protein (putative large terminase), C-terminal domain 


Plasmid Coverage information

Num covering plasmid clones39 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.0000035245 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAAAAAA ATCTGCAAAA GGAAGTGCAA AGTGATTTCT ATACATTCTA TCGGTATGTA 
GCTAAAGGGC ATACCGGATG GTGGCTTAGA GAGTTGTGCG ATTTTTTGCA GTATGAAGTC
TATTTCAGAT TTTTAAAAAA AGAGTTGCCG ATTTCAACCA TAGAAGCACC AGTACAGCAT
GGCAAGAGCA GGGTTTTAAG GCATTTTCTT TGTTGGTTGA TAGGGTTACA TCCTGAGCTG
AGATTTAACT TCTACACCGC AGCAGAAGAT TTAAGGGATG AGACAAAGAT TGATGTCGAT
ATTATTTTGG AGTCGCCAGA ATACATGGCG ATATTTGGAC AGAGAAAGTC CAGCACTTTG
AAAGATACAT CTGAAACATT TCAGATATAC AACCCGGAAG GGCCAAACGG CAAGGTCAAT
TTTAGACTTA TGGGGGCAGG CAATATAGGC CACCCTTCGC ATATCTCTCT TATTGACGAT
CCTTACAGAA ATAAAGAGGA CGCACTTTCT AAGACCATGA GAGACAAGAT TGCCAGCAGG
TTCAGGGCAG ATATTATTAC CAGAAGGCAG GAACGCTCAA TGGTAGTGGT ATTGCACAGC
CGATGGCATG AGAGCGACCT TATAGGCTGG ATAACAAAGA ACATAAGCAA AGATGAGCTT
ATTTCATTTT CTTATCCGGC AATTATGCCA AACGGAGAGG CCCTATTCCC TGAATTAAGG
AGCCTTGCTT TCTTAAATAA GCAAAGGGGC ATATTAACAC CGGGGGAGTT CGCTTCCCTT
TACCAGCAAA GTCCTATTGT TGAGGGCGGT AATAAGTTTA AGGCTGAAAT GTTTGAGTTT
GTTGATGAGT TGCCGGAAAC CTTTGACTAT ACATTCTCCA CATCGGACAC CTCTTATAAA
AAGGGGCAGG AGAACGATTA TACGGTTTGT GCTAACTGGG GCGTGTATAA GGATGACTTA
TATTTAACCA GCATATTCCG TGAGCGTATA GAAGCTAAAG AGGCAGACGG CAGATTAAGG
CCGATTATTA AACAGCACTC TGTCTGGGGA TATAGGAAGG CTTGGATTGA ACCTAAAGGG
CATGGCATAT TTTTAAACCA GACCTTCAGC GATGACAAAG AATTAATGAT GCCGGACGAA
GCTGAATTAA AAGAGTTCTT TAAAGACAGA AGCGTAGATA AGGTGGAAAG GGCAAATAAT
GCAACCGCCT CCCTATCAAA TAGAAAGGTC AAGATATACT CAAGAATACA TTGTAAGGAC
GAGATTTTAA TTGAGGCTTT ATCTTTTCCA AACGGAGACC ATGATGACTT TGTGGATACG
CTTATAGACG CAATAAAAAT TTTAGTTAGT TCTTCTAGCG GTCGTGCAGT TGCAACAGCC
ATACCAATTA GAAGGAATAG GGAAGAATAA
 
Protein sequence
MKKNLQKEVQ SDFYTFYRYV AKGHTGWWLR ELCDFLQYEV YFRFLKKELP ISTIEAPVQH 
GKSRVLRHFL CWLIGLHPEL RFNFYTAAED LRDETKIDVD IILESPEYMA IFGQRKSSTL
KDTSETFQIY NPEGPNGKVN FRLMGAGNIG HPSHISLIDD PYRNKEDALS KTMRDKIASR
FRADIITRRQ ERSMVVVLHS RWHESDLIGW ITKNISKDEL ISFSYPAIMP NGEALFPELR
SLAFLNKQRG ILTPGEFASL YQQSPIVEGG NKFKAEMFEF VDELPETFDY TFSTSDTSYK
KGQENDYTVC ANWGVYKDDL YLTSIFRERI EAKEADGRLR PIIKQHSVWG YRKAWIEPKG
HGIFLNQTFS DDKELMMPDE AELKEFFKDR SVDKVERANN ATASLSNRKV KIYSRIHCKD
EILIEALSFP NGDHDDFVDT LIDAIKILVS SSSGRAVATA IPIRRNREE