Gene Emin_0666 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_0666 
Symbol 
ID6263129 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp740292 
End bp741683 
Gene Length1392 bp 
Protein Length463 aa 
Translation table11 
GC content42% 
IMG OID642611137 
ProductNusA antitermination factor 
Protein accessionYP_001875558 
Protein GI187251076 
COG category[K] Transcription 
COG ID[COG0195] Transcription elongation factor 
TIGRFAM ID[TIGR01953] transcription termination factor NusA
[TIGR01954] transcription termination factor NusA, C-terminal duplication 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.0123778 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAGGCA ATCCCAAAGA ATTAATGATG GCTCTAGAGA GCTTGGAAAG AGAAAAAAAT 
ATCAAAAGAG ACGATATTAT TAAAACAATA GAAGACGCCC TTGTGTCGGC GCTTCGCAAA
AACTTAGGTA AAACAGCGCA AATAAGCGCG AAAATAAACC CCGAAGAGGG TGACATTAAG
GCCTTTCAGG TTTTAAATAT TGTAGAAATT GTAGCAAACC CGGAAATGGA AATCTCACTT
GAGCAAGCCA AAGCTATGGA TGACCGCTCC GAAGTAGGCG GCACAATAAC AAACGTTTTG
GAAGTTGAGG ATTTTTCCCG TATAGCTGCG CAAATAGCCA AACAGGTTTT AATTCAAAAA
GTAAGAGGTA TTGAAAGGGA AAATACTTAT AAAGAATTTA AACCCAGAGA GGGGGAAGTT
ATTACAGGCT CCGTACGCAG ATTTTCCGAC AGGGATATTG TTGTTGATTT AGGCAAAGTT
GAAGCTATTT TACCTTATTC CGAACAGATT AAAAGGGAAA GGTATTCTAA CGGATCGCGC
ATTAAAGCTA TTATCACAAA AGTTTTATCC CAGCAGGACT TGCTTACAAT CGGCGAAGAT
CCTGTTTTGG GCAGATACAA AAGCGCCGCT TTTAAAATGG ACAAAGGACA AAGAGGGCCA
TACGTCATTT TATCGCGTAC AAGCCCAGCT TTTTTAGAAG ACTTATTTAA AGTTGAAGTT
CCCGAAATAG GCGAAGGCAT TGTTGAAATC AAAGCTATTC AAAGAGACCC GGGCTTCAGA
GCTAAAGTGG TTGTCAGAAG CTATGATAAT AAAGTTGACC CAATAGGCAC CTGCGTAGGC
ATGAGGGGCA TAAGAATACG CGCTATTATG AATGAACTCA GCGGTGAACG TATTGACCTT
ATTCCTTACA GCGAAGACGT TACAACAATG ATTATGAATT CAATAGCTCC GGCAAGAGCG
AACTCCGTAA AAATAATAAG CGCCGAAGAG AAAAAAGCTC TTATCATTGT ACCTGACGAC
CAGCTTGCCA TAGCTATAGG TAAAGACTGG CAGAATATTA AATTAGCCAG CAAACTTACA
GGCTGGGAAC TTGAAGTAAA GAGCGAATCC CAAAAGCTCC AGGAGGGACA GGCCACCGTT
GACAATCTTG AAAGCTTGTT AGCTTCCGTG GAAGGCATTG GGCCCAAAAC GGCCGAAACA
CTTGTTAAAG CAGGCTTTTC TTCTGTTGAA AAGATAGCCG CTCTTGAGCC TGAACATCTT
GCCACCGTGC AAGGTATCGG GGAAAAGAGC GCGGCCAAAA TTATTGAAGG GGCCAAAAAA
TATTTAGAAA CGCAAGGCGA AGAGGTTTTG CAAGAGGAGG CAGTAAATGA CGACAACCAA
GAAGGCAACT AA
 
Protein sequence
MEGNPKELMM ALESLEREKN IKRDDIIKTI EDALVSALRK NLGKTAQISA KINPEEGDIK 
AFQVLNIVEI VANPEMEISL EQAKAMDDRS EVGGTITNVL EVEDFSRIAA QIAKQVLIQK
VRGIERENTY KEFKPREGEV ITGSVRRFSD RDIVVDLGKV EAILPYSEQI KRERYSNGSR
IKAIITKVLS QQDLLTIGED PVLGRYKSAA FKMDKGQRGP YVILSRTSPA FLEDLFKVEV
PEIGEGIVEI KAIQRDPGFR AKVVVRSYDN KVDPIGTCVG MRGIRIRAIM NELSGERIDL
IPYSEDVTTM IMNSIAPARA NSVKIISAEE KKALIIVPDD QLAIAIGKDW QNIKLASKLT
GWELEVKSES QKLQEGQATV DNLESLLASV EGIGPKTAET LVKAGFSSVE KIAALEPEHL
ATVQGIGEKS AAKIIEGAKK YLETQGEEVL QEEAVNDDNQ EGN