Gene Mlg_1965 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1965 
Symbol 
ID4268167 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2235872 
End bp2237149 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content65% 
IMG OID638126720 
ProductNADH-quinone oxidoreductase, F subunit 
Protein accessionYP_742797 
Protein GI114321114 
COG category[C] Energy production and conversion 
COG ID[COG1894] NADH:ubiquinone oxidoreductase, NADH-binding (51 kD) subunit 
TIGRFAM ID[TIGR01959] NADH-quinone oxidoreductase, F subunit 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.789602 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value0.0852998 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCAATC AGGTGTGTCT GACCACGCTG GACAAGGAAA CCCCCTGGAG CCTGGAGACC 
TACCGGGCGA TGGGCGGCTA CCAGGCCTGG GAGAAGATTC TCAAGGAGAA GACGCCCCAG
GAAGAGATTA TCGAGACGGT CAAAAAGGCC AACCTGCGCG GCCGCGGTGG CGCCGGCTTC
CCCGCCGGGG TTAAGTGGGG CTTCATGCCC CGAAATGCGC CGGGCCAGAA GTACATTGTC
TGTAACTCTG ACGAATCGGA GCCCGGCACC TGTAAGGATC GCGACATCCT GCGCTTCAAT
CCCCACGCCC TGGTGGAGGG CATGGCGATT GCCGGCTATG CCATGGGTGC CACCGTGGGC
TACAACTACC TGCGCGGTGA GTTTCACCAC GAGCCCTTCG AGCGGATCGA GCAGGCGGTG
CGCGAAGCCC GCGAGGCCGG CCTGCTGGGG CGCAACATCC AAGGCAGTGG CATCGATTTC
GAGCTCCACA ATCATATCGG GGCGGGCGCC TATATCTGTG GCGAGGAATC GGCGCTGATG
GAGTCGCTGG AGGGCAAAAA GGGCCAGCCC CGCTACAAGC CGCCTTTCCC GGCCCAGGTC
GGCGTATACG GGCGCCCCAC CACCATCAAC AACACCGAGA CCCTCGCCTC CGTGCCCTCG
ATTATGCGCA AGGGCAGCGA GTGGTTCCTC GAGCTGGGCA AGCCCAATAA CGGCGGTGAG
AAGATCTTCT GTGTCTCCGG GCACGTGGAA AGGCCGGGTA ACTTTGAGGT CCCGCTGGGG
ACGCCGTTCA AGGACCTTTT GGAGATGGCC GGGGGCGTGC GCGGCGGGCG TAAGCTCAAG
GCCGTGATCC CGGGCGGTTC CTCCATGCCC GTGGTCCCCG GCGAGACCAT GCTGCAGGCC
ACCATGGACT ACGACGGCCT GGCGGAGATC GGCTCGGCCC TCGGTTCCGG CGGGGTCATC
GTGATGGACG AGACCACCGA CATGGTCAAG GCGATCCTGC GCATCTCGCG GTTCTACTTC
GCCGAGTCCT GCGGTCAGTG CACCCCCTGC CGGGAGGGCA CTGGCTGGAT GCAACGGGTG
CTCCGGCGCA TCGTCGAAGG CAAAGGCCGG CACGAGGACA TCGAACTGCT GGAGGCGGCG
GCGGGGCAGA TCGCCGGCCA CACGATCTGC GCCTTCGGCG AGGCCGCGGC CTGGCCGGTG
CAGAGCTTCC TCAAGCACTT CCGTCACGAG TTTGAATACT ACGTGGAGCA TAAGCGTTCC
ATGGTGGAGG CCGCCTGA
 
Protein sequence
MANQVCLTTL DKETPWSLET YRAMGGYQAW EKILKEKTPQ EEIIETVKKA NLRGRGGAGF 
PAGVKWGFMP RNAPGQKYIV CNSDESEPGT CKDRDILRFN PHALVEGMAI AGYAMGATVG
YNYLRGEFHH EPFERIEQAV REAREAGLLG RNIQGSGIDF ELHNHIGAGA YICGEESALM
ESLEGKKGQP RYKPPFPAQV GVYGRPTTIN NTETLASVPS IMRKGSEWFL ELGKPNNGGE
KIFCVSGHVE RPGNFEVPLG TPFKDLLEMA GGVRGGRKLK AVIPGGSSMP VVPGETMLQA
TMDYDGLAEI GSALGSGGVI VMDETTDMVK AILRISRFYF AESCGQCTPC REGTGWMQRV
LRRIVEGKGR HEDIELLEAA AGQIAGHTIC AFGEAAAWPV QSFLKHFRHE FEYYVEHKRS
MVEAA