Gene Mlg_1963 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1963 
Symbol 
ID4268165 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2232425 
End bp2233444 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content63% 
IMG OID638126718 
Productrespiratory-chain NADH dehydrogenase, subunit 1 
Protein accessionYP_742795 
Protein GI114321112 
COG category[C] Energy production and conversion 
COG ID[COG1005] NADH:ubiquinone oxidoreductase subunit 1 (chain H) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.787687 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.0372303 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGTTT TCACAGAACT TTTCTGGATA ACGCTCAAGA TCATGGCGCT GGTGGTGCCG 
CTGATGCTTG CAGTGGCTTA CCTGACCTAC GCCGAGCGCA GGGTCATCGG GGCGATGCAG
GACCGACGCG GCCCGAACCG CGTGGGCTAT CAGGGGTTGT TGCAGCCGAT CGCGGACGCG
CTGAAGCTGG TCATGAAGGA GATCAGCATC CCGTCCAACG CCAACCGGGT CCTGTTCGTC
ATCGCACCGT TGCTGGCCAT CATGCCCGCA CTGGCGGCCT GGGCGGTCAT TCCGGTGGCC
GAGGGCTGGG CCATCGCCGA TATCAACGCG GGTCTGCTCT ATATCCTGGC CATGACCTCC
CTGGGGGTCT ACGGCATCAT CATTGCCGGC TGGGCCTCCA ACTCCAAGTA CGCCCTGTTG
GGGACCCTGC GGGCGTCCGC GCAGGTCGTC TCCTACGAGA TTGCCATGGG CTTCGCCCTG
GTCGGCGTGC TGATGGCGGC CGGTTCCATG AACCTGGGCC AGATCATCCA GGCCCAGGCG
GGCGGTATCT TCCACTGGTT CTGGCTGCCG CTGTTGCCGC TCTTCCTGGT CTACTGGATC
TCCGGTGTGG CCGAGACCAA CCGCGCACCC TTCGACGTTG CCGAGGGCGA GTCCGAGATC
GTCGCCGGCT TCCACGTGGA GTACTCGGGG ACCTCCTTCG CGGTCTTTTT CCTGGCGGAA
TACGCCAACA TGATCCTCAT CTCCGCGGTG GCCGCGGTGA TGTTCCTGGG GGGCTGGTAT
TCGCCCTTCC ACGGTTGGCC GATTTTGGGC CCGATGCTCG ACTGGGTCCC CGGTGTCGTC
TGGTTCATGC TCAAGACCGC CTTCTTCATG TTCTGTTACC TGTGGTTCCG CGCCACCTTC
CCGCGATACC GCTATGACCA GATCATGCGG CTGGGGTGGA AGGTGCTGAT CCCGGTCACC
GTGGTCTGGC TCATCGTGCT GACCATCTTC ATCGTCACCG GCTTCGGGCC CTGGTTCTGA
 
Protein sequence
MAVFTELFWI TLKIMALVVP LMLAVAYLTY AERRVIGAMQ DRRGPNRVGY QGLLQPIADA 
LKLVMKEISI PSNANRVLFV IAPLLAIMPA LAAWAVIPVA EGWAIADINA GLLYILAMTS
LGVYGIIIAG WASNSKYALL GTLRASAQVV SYEIAMGFAL VGVLMAAGSM NLGQIIQAQA
GGIFHWFWLP LLPLFLVYWI SGVAETNRAP FDVAEGESEI VAGFHVEYSG TSFAVFFLAE
YANMILISAV AAVMFLGGWY SPFHGWPILG PMLDWVPGVV WFMLKTAFFM FCYLWFRATF
PRYRYDQIMR LGWKVLIPVT VVWLIVLTIF IVTGFGPWF