Gene Emin_0499 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_0499 
Symbol 
ID6262709 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp547015 
End bp548508 
Gene Length1494 bp 
Protein Length497 aa 
Translation table11 
GC content42% 
IMG OID642610969 
Producttranscription termination factor Rho 
Protein accessionYP_001875392 
Protein GI187250910 
COG category[K] Transcription 
COG ID[COG1158] Transcription termination factor 
TIGRFAM ID[TIGR00767] transcription termination factor Rho 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000619502 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones55 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGAAA AAAGTCTTGA AGAAAATAAA GAAAAAGTTA CGACTAAAGA AGTCTCCAAA 
GAAGTTAAAG AAATAAAAGA TGTAAAGGAA AGCAAAGAGG CTAAAGAAGT CAAAGAACCT
AAGGAAACAA AAGAGGTTAA AGACGCTTCC GCGTCCAAAA CACAGCAGCG TCCTTTTACT
CGCAACGGCA ACGGTTACCA GCAACAACAG GTGCGCACGC CTAACGGCGC AAAAGTTATG
GACGCGCGTG AATTAGGCAA ATTAAGCGTG GCTGAACTTA CTAAACTTGC GGTCAACTTA
AATATTGAAG AAACTTCAGG CTTAAAAAAA CAGGCTTTAA TAGGCAAAAT TATAGCAGTG
CAAGCTAAAC AAAACGGTTC TATTTACGGC GGCGGCGTTT TGGAAATTTT GCCCGACGGT
TTCGGCTTTC TCCGCTCCGA GGATAATAAT TACTTAGCCG GGCCGGAAGA TATTTATGTT
TCGCCTTCGC AAATTAAACG TTTTGGTTTA AGAAAGGGCG ATACTATTGA AGGCCTTATC
CGCCCGCCAA AAGACGGTGA AAGATTTTTT GCCATGTTGC AGGTGCAAAA AGTTAATGAC
ATAGAGGTTG AAAAAATTTA TAACAGGCCT TTGTTTGACA ACTTAACGCC GCTGCATCCT
AATAAACGCT TTACTCTTGA ATTAGATAAA AACGATATAA CCCAGCGTAT TATTGACCTT
ATGGCCCCCA TAGGCCGCGG GCAGAGGGCT CTTATAGTGG CGCCTCCAAA AACCGGTAAA
ACTATGATGA TGCAAAGCAT AGCAAACTCA ATAACAAACA ATTATAAAGA AGTAAAACTT
ATAGTTTTGT TAATTGACGA AAGGCCCGAA GAAGTTACCG ACATGAGCCG CAGCGTTAAG
GGCGAGGTTA TTGCCAGCAC TTTTGACGAA GCCCCGGACA GACACGTACA AGTGGCCGAA
ATGGCTTTGG AAAGAGCAAA AAGACTTGTT GAGCAAGGCA CAGACGTTGT GATTTTGCTT
GACTCTATTA CACGTCTTGC TCGCGCTTAC AACACGGTTA CGCCTTCAAG CGGACGTGTT
CTTACCGGCG GTTTGGAAGC GACCTCCTTA CAGCGCCCTA AAAGATTTTT AGGCGCGGCA
AGAAATATGG AAGAGGGCGG TTCTCTTACA ATTATAGCTA CGGCTCTTGT TGAAACGGGC
AGCAGAATGG ACGAAGTTAT TTTCGAAGAA TTTAAAGGCA CGGGCAACAG TGAAATATGT
TTGGACAGAA AACTTTCCGA CAGACGTCTT TTCCCCGCTA TTGATTTAAA CAGAAGTTCA
ACCAGAAAGG AGGATTTGCT TCTTTCCGAA GATGAACTTA ACAAAGTGTG GATTATACGC
AAAGTTCTTG CTCCTTTAAC CTCAGTTGAC GCCATGACGC TTCTTAGAGA TAAAATTGTG
GCCAGTAAGT CTAATAAGGA CTTTTTAAAA CAAATGGAAG TTTCAAGCTT ATAA
 
Protein sequence
MTEKSLEENK EKVTTKEVSK EVKEIKDVKE SKEAKEVKEP KETKEVKDAS ASKTQQRPFT 
RNGNGYQQQQ VRTPNGAKVM DARELGKLSV AELTKLAVNL NIEETSGLKK QALIGKIIAV
QAKQNGSIYG GGVLEILPDG FGFLRSEDNN YLAGPEDIYV SPSQIKRFGL RKGDTIEGLI
RPPKDGERFF AMLQVQKVND IEVEKIYNRP LFDNLTPLHP NKRFTLELDK NDITQRIIDL
MAPIGRGQRA LIVAPPKTGK TMMMQSIANS ITNNYKEVKL IVLLIDERPE EVTDMSRSVK
GEVIASTFDE APDRHVQVAE MALERAKRLV EQGTDVVILL DSITRLARAY NTVTPSSGRV
LTGGLEATSL QRPKRFLGAA RNMEEGGSLT IIATALVETG SRMDEVIFEE FKGTGNSEIC
LDRKLSDRRL FPAIDLNRSS TRKEDLLLSE DELNKVWIIR KVLAPLTSVD AMTLLRDKIV
ASKSNKDFLK QMEVSSL