Gene Emin_1130 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_1130 
Symbol 
ID6263805 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp1229925 
End bp1231133 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content46% 
IMG OID642611610 
ProductNADH-ubiquinone oxidoreductase chain 49kDa 
Protein accessionYP_001876019 
Protein GI187251537 
COG category[C] Energy production and conversion 
COG ID[COG3261] Ni,Fe-hydrogenase III large subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones87 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAAAAA TAACAGTGCC TTTAGGGCCG CAGCATCCCG CCTTAAAAGA ACCGGGCAAT 
TTTATGCTTC TTTTGGAAGG TGAACAAATT GTAGACGCTA CGCTTCGCCT TGGTTATAAC
CACAGAGGCG TTGAGAAAGC CGCCGAAAGC AGAAACTATT TGCAAGCCAT GTATCTTATT
GAGCGTGTTT GCGGCATTTG CTCTCACTCG CACGCCACGT GTTATGTTAT GAATGTTGAG
GAAATAGCCG GCGCCGAAGT GCCTAAAAGA GCGCAGGCTA TACGCGTTAT CATAGGCGAA
CTTGAAAGGC TGCACAGCCA TTTGCTTTGG CTTGGCGTGG CGGCGCATGA AATAGGGTTT
GACACTCTTT TTATGTACTC CTGGCGCGAC AGGGAAAGCG TTATGGACGC TTTGGAAACT
ATTTCAGGCA ACCGCGTGCA CTATTCCATT AACACTATTG GCGGGGTAAG AAGGGATATT
ACGCCTGAAA TGGTTAAAAC TGTGTTAATG AAAATAGAAC ATCTTGAAAA AAGAGTTAAA
TATTATATCT CATTAGCTAC CGAAGAGCCC ACCGTTATCG CCAGAACAAA AGGCGTGGGG
CATCTTCCTA AAGAAGAGGC TTTAAGACGC TGCGCGGCCG GGCCTTTGGC AAGAGCTTCA
GGTATAGCCA GAGACGTAAG AAAAGACGAC CCTTATTTAA TTTACGGCGA ACTTGATTTT
AAAGTTATAA CCTCCGACGC GTGCGACGTT TTTGGCCGGT TATATGTGCG CGCTTTTGAA
ATGTTAGAAT CTTGCAGACT TTTAAAACAG GTGCTTAACT GGCTGCCCGA AGGCGCCATA
AAAGTTAATG TACCCGTTAA AATACCCGCC GGACATGCGG TTAACCGCTA TGAAGCGCCC
AGGGGCGAGG ATGTGCACTA TGTTAAATCA GACGGCGGTT TAAACCCGGC AAGAGTTAAA
GTGCGCGCGC CTACGCTGGC TAATTTTGAA AGCGTTGACT ATATGCTGCG CGGAGACGCA
CTTGCGGACG CGGCTTTAAT TATAGCCGCC ATTGACCCGT GTTTTTCCTG CACTGACAGG
GCAACAATTA TCAGTCCCTC TTTAAACTCA GTTAAAACTA TAGATTGGAA GGACATGCGG
GCGCACGGCA TAGAGTATTA TAAAAAGAAA GGAATAGATT TTTCAAAAGT AAAAGTTTTT
GATAAATAA
 
Protein sequence
MPKITVPLGP QHPALKEPGN FMLLLEGEQI VDATLRLGYN HRGVEKAAES RNYLQAMYLI 
ERVCGICSHS HATCYVMNVE EIAGAEVPKR AQAIRVIIGE LERLHSHLLW LGVAAHEIGF
DTLFMYSWRD RESVMDALET ISGNRVHYSI NTIGGVRRDI TPEMVKTVLM KIEHLEKRVK
YYISLATEEP TVIARTKGVG HLPKEEALRR CAAGPLARAS GIARDVRKDD PYLIYGELDF
KVITSDACDV FGRLYVRAFE MLESCRLLKQ VLNWLPEGAI KVNVPVKIPA GHAVNRYEAP
RGEDVHYVKS DGGLNPARVK VRAPTLANFE SVDYMLRGDA LADAALIIAA IDPCFSCTDR
ATIISPSLNS VKTIDWKDMR AHGIEYYKKK GIDFSKVKVF DK