Gene Mlg_1967 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1967 
Symbol 
ID4268169 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2237650 
End bp2238903 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content64% 
IMG OID638126722 
ProductNADH dehydrogenase subunit D 
Protein accessionYP_742799 
Protein GI114321116 
COG category[C] Energy production and conversion 
COG ID[COG0649] NADH:ubiquinone oxidoreductase 49 kD subunit 7 
TIGRFAM ID[TIGR01962] NADH dehydrogenase I, D subunit 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value0.0788296 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTGAGT TCAAGAGCTA TACCGTCAAC TTTGGCCCCC AGCACCCGGC CGCCCACGGG 
GTGCTGCGGA TGGTGCTGGA GATGGAGGGC GAGACGGTCC GGCGTGCGGA CCCGCATATC
GGGCTGCTGC ACCGGGCGAC CGAGAAGCTG GCCGAGTCCA AGCCCTATAA CCAGTCCATC
GGCTACATGG ACCGGTTGGA CTACGTCTCC ATGCTCTGCA ACGAGCACGC CTATGTGATG
GCGATCGAGA AGCTGCTCGG CATCGAGGCG CCGATCCGGG CCCAGTACAT CCGGGTGATG
TTCGACGAGA TCACCCGCAT CCTCAACCAC CTGATGTGGC TGGGGGCACA CGGCCTGGAC
GTGGGCGCAA TGACCGCCTT CCTCTACTGC TTCCGCGAGC GTGAGGACCT GATGGACTGC
TACGAGGCGG TCAGCGGCGC GCGTATGCAC GCGGCCTATT ACCGGCCGGG CGGTGTCTAC
CGCGACCTGC CTGAGAGCAT GCCCAAGTAT GAGCCCAGCA AGTTCCGGAG CAAGAAGGAG
CTGGAGCTCA TGAACGCGGC CCGGCAGGGG ACGATGCTCG ACTTCATCGA GGACTTCACC
GAGCGCTTCC CCGGCTGCGT GGACGAGTAC GAGACCCTGC TCACCGAGAA CCGGATCTGG
CGCCAGCGCC TGGTGGACGT CGGCGTGGTC TCGCCGGAGC GCGCGCTGCA ACTGGGCTTC
AGTGGCCCGA TGCTGCGGGG CTCCGGGATC GAGTGGGATC TGCGCAAGAA GCAGCCTTAC
GACGTCTACG ACCGGGTCGA GTTCGACATC CCGGTGGGCA CCAATGGGGA CTGCTACGAC
CGCTATCTGG TGCGTATCGA GGAGATGCGC CAGTCCAATC GCATCATCAA GCAGTGCGTG
GACTGGCTGC GTCACAACCC CGGCCCGGTG ATGCTCGAGG ACCACAAGGT CGCGCCACCC
AGTCGGGAGG AGATGAAGGA CGACATGGAG TCCCTCATCC ACCATTTCAA GCTGTTCACC
GAGGGTTACA CCACGCCCCC GGGCGAGGTC TACGCCGCGG TGGAGGCCCC CAAGGGTGAG
TTCGGCTGCT ACATGATTTC CGATGGCGCC AACAAGCCCT ATCGCGTCAA GCTGCGGGCA
CCGGGCTTCG CCCATCTGTC CGCCATGGAT GAGATGGCGC GCGGCCACAT GCTGGCTGAC
GTGGTGGCCA TTATCGGTAC ACAGGACATT GTTTTCGGGG AGATTGACCG TTGA
 
Protein sequence
MREFKSYTVN FGPQHPAAHG VLRMVLEMEG ETVRRADPHI GLLHRATEKL AESKPYNQSI 
GYMDRLDYVS MLCNEHAYVM AIEKLLGIEA PIRAQYIRVM FDEITRILNH LMWLGAHGLD
VGAMTAFLYC FREREDLMDC YEAVSGARMH AAYYRPGGVY RDLPESMPKY EPSKFRSKKE
LELMNAARQG TMLDFIEDFT ERFPGCVDEY ETLLTENRIW RQRLVDVGVV SPERALQLGF
SGPMLRGSGI EWDLRKKQPY DVYDRVEFDI PVGTNGDCYD RYLVRIEEMR QSNRIIKQCV
DWLRHNPGPV MLEDHKVAPP SREEMKDDME SLIHHFKLFT EGYTTPPGEV YAAVEAPKGE
FGCYMISDGA NKPYRVKLRA PGFAHLSAMD EMARGHMLAD VVAIIGTQDI VFGEIDR