Gene Mlg_1003 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1003 
Symbol 
ID4268371 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp1138351 
End bp1142109 
Gene Length3759 bp 
Protein Length1252 aa 
Translation table11 
GC content65% 
IMG OID638125754 
Productrespiratory nitrate reductase alpha subunit apoprotein 
Protein accessionYP_741846 
Protein GI114320163 
COG category[C] Energy production and conversion 
COG ID[COG5013] Nitrate reductase alpha subunit 
TIGRFAM ID[TIGR01580] respiratory nitrate reductase, alpha subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.251874 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTTACT TTCTCGACCG ACTGACCTTC TTCAAGCGGG TCAAGGACGG CTTCTCCGGC 
GGACACGGGG AGGTCAGGGA TGAGTCCCGC CGCTGGGAGG ATGCCTATCG CAGCCGCTGG
CAGCACGACA AGGTCGTGCG CTCCACCCAC GGGGTGAACT GCACCGGCTC CTGCAGCTGG
AAGATCTACG TGAAAAACGG CCTGGTCACC TGGGAGACCC AGCAGACCGA CTACCCGCGC
ACCCGGCCGG AGCTGCCCAA CCACGAGCCG CGCGGCTGCG GCCGTGGCGC CAGCTACTCC
TGGTACATCT ACAGCGCCAA CCGGCTGAAG TACCCGATGA TCCGCGGCAC GCTGCTGCGC
ATGTGGCGCG AGGCGCGCAA GAAGCACAAA GACCCGGTGG ACGCCTGGGC CTCCATTGTC
GGCGACCCCA AGAAGGCCGA GCTCTACAAG AGCCGCCGAG GCCTGGGCGA CATGGTGCGC
TCCGACTGGG ACGAGGTGAA CGAGCTGGTC GCCTCCGCCA ATATCCACAC CATCAAAGAG
CACGGCCCGG ACCGAGTGAT CGGCTTTTCT CCGATCCCGG CCATGTCCAT GGTCAGCTAC
GCCGCCGGCT CCCGCTACAT GTCACTGCTC GGCGGCGTGT GCATGAGCTT CTACGACTGG
TACTGCGACC TGCCCCCCAG CAGCCCGCAG ACCTGGGGTG AGCAGACCGA CGTGCCGGAA
TCAGCCGACT GGTACAACGC CGGCTTCATC ATGATGTGGG GTTCCAACGT CCCGCAGACC
CGCACGCCCG ATGCCCACTT CATGACCGAG GTACGGTACA AGGGCACCGA GATCGTCACC
GTCTGCCCGG ACTACTCCGA GGCCTCCAAG TTCGGCGACA TCTGGCTGAA CCCCCAGCAG
GGCACCGACT CCGCGTTGGG TATGGCCCTG GGCCACGTGA TCCTCAAGGA ATTCCACGTG
GACAATCCCA GCGACTACTT CCGCGACTAC GCCCGCCAGT ACACCGACAT GCCCTGCCTG
GTCCGTCTGG AGAAGACCGA CAAGGGCTAC CAGGCCGACC GCTTTCTGCG GGCGTCAGAC
CTCGACAAGG CGCTCGGTCA GAAGAACAAC CCCGAGTGGA AGACCATCGC CTGGGACGAG
AAGAGCGACT CACTGGTGGT CCCGAATGGC TCCATCGGCT TCCGCTGGGG CGAGGATGGC
CAGTGGAACC TGGAGGAGAA GGAAGCCAGC GGCAAGGAGA CGAAGCTGCA GCTCTCGCAC
CTGGACAGCC GCGACGAGGT GGCCCTGGTG GGCTTCCCCT ACTTCGGCGG CCAGGAGCAC
GAACTGGGCC GGTTCACCGC CAACCCCCAG GAGGAGGTGA TCTACCGCAA GGTGCCGGTG
CGCAAGATAA AGCTGGCCGA CGGCGAGGAG GTGCTGGCCG CCTCGGTGTA CGACCTGATC
ACCGCCCACT ACGGCGTGGA CCGCGGCCTG GACTGCGAGA ACACCCCCAA GAGCTACGAC
GAAAACAAGC CCTACACCCC CGCCTGGCAG GAGCAGATCA CCGGTGTACC GCGTGACAAG
GTGATCAAGG TGGCCCGGCG CTTCGCCGAC AACGCCAACA AGACCCGTGG CAAGTCCATG
ATCATCATCG GTGCCGCCAT GAACCACTGG TACCACATGG ACATGAACTA CCGCAGCGTG
ATCAACATGC TGGTCTTCTG CGGGTGTATC GGTCAGAGCG GCGGCGGCTG GGCCCACTAC
GTCGGCCAGG AGAAGCTGCG TCCGCAGACC GGCTGGCAGC CGCTGGCCTT CGCCCTCGAC
TGGGCGCGTC CGCCGCGGCA CATGAACTCC ACCTCCTTCT TCTACGCCCA CACCGACCAG
TGGCGCTACG AGAAGCTGAA TGTGGCGGAC ATGCTCTCGC CGCTGGCCAA CAAAGAGGAC
TTCCAGGGCA GTCTGATCGA CTTCAACGTC CGCTCGGAGC GCATGGGCTG GCTGCCCTCC
GCCCCGCAGC TCGGCGCCAA CCCGCTGGAG GTGGCCAAGG AGGCGGCCAA GGCCGGAAAG
GACCCGAAGG ACTATGTGGT CGAGCAGCTC AAGAGTGGCC GGCTGCGCAT GGCCTGCGAG
GACCCGGACA ACCCGGCCAA CTTCCCGCGC AACCTGTTCG TCTGGCGCTC CAACCTGCTC
GGTTCCAGCG GCAAGGGCCA CGAGTACTTC CTCAAGCACC TGCTGGGCAC CAAGCACGGC
GTACAGGGCA AAGACCTCGG CGAGGAGGGT GCGGAGAAGC CCCAAGAGGT GGTCTGGCGC
GAGGAGGAGA CTCGCGGCAA GCTCGATCTG CTGACCACGC TGGACTTCCG CATGTCCACC
ACCTGCCTCT ACTCCGACGT GGTGCTGCCC ACGGCCACCT GGTACGAGAA GGACGACCTC
AACACCTCGG ACATGCACCC CTTCATCCAC CCGCTGTGCG AGGCGGTCAA CCCCTGCTGG
GAGTCCCGCA CCGACTGGGA GATCTACAAG GGGCTGGCCA AGACCTTCTC CAAACTGGTT
GAGGGCCACC TGGGCAAGGA GACCGACATC GTCACCCTGC CCCTGTTGCA CGACAGCCCG
GCCGAGCTGG GCCAGGCCCT GGACGTCAAG GAATGGCACA AGGGCGAATG CGACCCCATC
CCGGGCAAGA CCATGCCGGC ACTGGTCCCG GTGGAGCGCG ACTACCCCAA CGTCTACGCC
CGCTTCACCG CCCTGGGCCC GCTGATGACC AAGCTGGGCA ACGGCGGCAA GGGCATCAGC
TGGAACACCG AGCACGAGGT GGACCTGCTG GGCCGGCTGA ACGGCCAGCA CCGCAAGGAA
GGGGCGGCCA ACCAGGGGCT GCCCCGGATC AACACCGCCA TCGAGGCGGC GGACACCATT
CTGTCGCTGG CCCCGGAGAC CAACGGTGAG GTGGCGGTCA AGGCCTGGGC GGCGCTCTCC
AAACAGACCG GGCGCGACCA CACCCACCTG GCCCTGCCCC GCGAGGAGGA GAAACTGCGC
TTCCGCGACC TGGCCGCGCA ACCGCGCAAG ATCATCTCAT CGCCCACCTG GTCGGGTTTG
GAGTCGGAGC ACGTCTCCTA CAACGCTGGC TACACCAACG TGCATGAGCT GATCCCCTGG
CGCACCCTCT CTGGCCGCCA GCAGCTCTAC CAGGACCACC CCTGGATGCG CGCCTTCGGC
GAGGGCTTCG TCTGCTACCG GCCGCCGGTA AACATGCGCA CCACCCAGAC GGTGCAGGGT
GTACGCGGTA ACGGCAACGA GGAGATCGCG CTGAACTGGA TCACCCCGCA CCAGAAGTGG
GGTATCCACA GCACCTACAC GGACAACCTG ATCATGCTCA CGTTGTCCCG TGGCGGCCCC
TGCGTGTGGA TCAGCGAGAT CGATGCCAAA AAGGTCGGCA TCGTCGACAA CGACTGGATC
GAGTGCTTCA ACAGCAACGG CTCCCTGGTG GCCCGCGCCG TGGTCAGCCA GCGGGTCCGT
GAGGGCATGG TGATGATGTA CCACGCCCAG GAGAAGCTGG TGAACACCCC CGGATCCGAG
ATCACCGGCG AGCGTGGCGG CATCCACAAC TCGGTCACCC GGGCGGTCAT CAAGCCGACC
CACATGATCG GGGGCTACGC CCAGCTCAGC TACGGCTTCA ACTATTACGG CACCGTCGGT
TCCAACCGCG ACGAATTCGT GGTCGTGCGC AAGATGGACC GCGTGGACTG GCTGGACGGT
GACGACGAGA GCATCTACGA CGCGGAGGAA CAGGCATGA
 
Protein sequence
MSYFLDRLTF FKRVKDGFSG GHGEVRDESR RWEDAYRSRW QHDKVVRSTH GVNCTGSCSW 
KIYVKNGLVT WETQQTDYPR TRPELPNHEP RGCGRGASYS WYIYSANRLK YPMIRGTLLR
MWREARKKHK DPVDAWASIV GDPKKAELYK SRRGLGDMVR SDWDEVNELV ASANIHTIKE
HGPDRVIGFS PIPAMSMVSY AAGSRYMSLL GGVCMSFYDW YCDLPPSSPQ TWGEQTDVPE
SADWYNAGFI MMWGSNVPQT RTPDAHFMTE VRYKGTEIVT VCPDYSEASK FGDIWLNPQQ
GTDSALGMAL GHVILKEFHV DNPSDYFRDY ARQYTDMPCL VRLEKTDKGY QADRFLRASD
LDKALGQKNN PEWKTIAWDE KSDSLVVPNG SIGFRWGEDG QWNLEEKEAS GKETKLQLSH
LDSRDEVALV GFPYFGGQEH ELGRFTANPQ EEVIYRKVPV RKIKLADGEE VLAASVYDLI
TAHYGVDRGL DCENTPKSYD ENKPYTPAWQ EQITGVPRDK VIKVARRFAD NANKTRGKSM
IIIGAAMNHW YHMDMNYRSV INMLVFCGCI GQSGGGWAHY VGQEKLRPQT GWQPLAFALD
WARPPRHMNS TSFFYAHTDQ WRYEKLNVAD MLSPLANKED FQGSLIDFNV RSERMGWLPS
APQLGANPLE VAKEAAKAGK DPKDYVVEQL KSGRLRMACE DPDNPANFPR NLFVWRSNLL
GSSGKGHEYF LKHLLGTKHG VQGKDLGEEG AEKPQEVVWR EEETRGKLDL LTTLDFRMST
TCLYSDVVLP TATWYEKDDL NTSDMHPFIH PLCEAVNPCW ESRTDWEIYK GLAKTFSKLV
EGHLGKETDI VTLPLLHDSP AELGQALDVK EWHKGECDPI PGKTMPALVP VERDYPNVYA
RFTALGPLMT KLGNGGKGIS WNTEHEVDLL GRLNGQHRKE GAANQGLPRI NTAIEAADTI
LSLAPETNGE VAVKAWAALS KQTGRDHTHL ALPREEEKLR FRDLAAQPRK IISSPTWSGL
ESEHVSYNAG YTNVHELIPW RTLSGRQQLY QDHPWMRAFG EGFVCYRPPV NMRTTQTVQG
VRGNGNEEIA LNWITPHQKW GIHSTYTDNL IMLTLSRGGP CVWISEIDAK KVGIVDNDWI
ECFNSNGSLV ARAVVSQRVR EGMVMMYHAQ EKLVNTPGSE ITGERGGIHN SVTRAVIKPT
HMIGGYAQLS YGFNYYGTVG SNRDEFVVVR KMDRVDWLDG DDESIYDAEE QA