Gene Mlg_1002 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1002 
Symbol 
ID4268370 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp1136771 
End bp1138354 
Gene Length1584 bp 
Protein Length527 aa 
Translation table11 
GC content64% 
IMG OID638125753 
Productrespiratory nitrate reductase beta subunit 
Protein accessionYP_741845 
Protein GI114320162 
COG category[C] Energy production and conversion 
COG ID[COG1140] Nitrate reductase beta subunit 
TIGRFAM ID[TIGR01660] nitrate reductase, beta subunit 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.33657 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAGTCC GGGCGCAAAT CGGAAAAGTC CTGAATCTGG ACAAGTGCAT CGGGTGCCAC 
ACCTGCTCGA TCACCTGCAA GAACGTATGG ACCTCCCGCG AGGGCATGGA GTACGTGTGG
TTCAACAATG TGGAGTCCAA GCCCGGCGTC GGCTACCCCA AGGACTGGGA GAACCAGGAC
CGCTGGAACG GTGGCTGGGC GGTGCGTAAC GGCCGCCTGG AGCCCCGGGC CGGGGGCAAG
TGGCGGATTC TGTCCAACAT CTTCTACAAC CCGGACCTGC CGTCCATCGA CGATTACTAC
GAGCCGTTCA ACTTCGACTA CCAGCGGCTG CAGAACGCCC CGGCGTCGAA GTATCAGCCC
GTGGCCAAAC CCTTCTCGGC GCTCACCGGC TACCCGATGG ACAAGATCGA GTGGGGCCCC
AACTGGGAGG AGATCCTCGG CGGCGAGTTC GAGAAGCGGT CCAAGGACTA CAACTTCGCC
AAGGTGCAGA AGGAGATCTA CGGCCAGTTC GAGAACACCT TCCTGATGTA CCTGCCGCGG
TTGTGCGAGC ACTGCCTCAA CCCCTCCTGC GTGGCCTCCT GCCCCTCGGG CGCGATCTAC
AAGCGCGAGG AGGACGGCAT CGTCCTGATC GACCAGGACA AGTGCCGGGG CTGGCGGATG
TGCATCAGCG GCTGCCCGTA CAAGAAGATC TACTACAACT GGCAGAGCGG TAAATCGGAA
AAGTGCACCT TCTGCTTCCC CCGTATCGAG GTGGGCCAGC CCACGGTCTG TTCGGAGACC
TGCGTGGGGC GCATCCGTTA TCTGGGCGTC ATCCTCTACG ACGCCGACAA GATCTCGGAG
GCCGCCTCCA CCGCCTCGGA ACAGGACCTG TACGAGAAGC AGATGTCGGT CTTCCTCGAC
CCCCATGACC CGGAGGTCAT CGCCGAGGCC AAGAAGGCGG GCATCCCGCA CGCCTGGCTG
GAGGCCGCGA AACGCTCGCC GGTCTACAAG ATGGCCATGG ATTGGAAGGT CGCCTTCCCG
CTGCACCCGG AGTACCGCAC GCTGCCGATG GTCTGGTACA TCCCGCCGCT CTCCCCCATC
CAGTCCGCCG CCGATGCGGG CACCCTCGAG AACGACGGAC TGATTCCGGA TGTGAACTCG
CTGCGCATTC CGGTCAAGTA CCTGGCCAAC ATGCTGACCG CCGGCGATGA ACGTCCGGTG
GTGACGGCCC TGCAGCGAAT GCTGGCCATG CGCCACTTTA AACGCAGCCA GACCGTGGAG
GGCAAGCCCA ACCACCAGGT CCTCGAACAG GTCGGCCTGG ACGAGGCCAC GGTGGAGGAC
ATGTACCAGA TCATGGCCAT TGCCAACTAC GAGGATCGGT TCGACGTCCC GGCCGCCAAC
CGGGAACTGG CCGCCGACGA TGCCTACGGC GAGCGCTCCG GCTGCGGCTT CAGCTTCGGC
GACGGCTGCA ACGCCGCCAG CAACGGTGTC AGCCCGCAGG ACATCTTCGG TGGCCGCCGG
GCCTCCCGCC GGGCCTACCC CGTCCGCGAG GAGCCATCGG CGCCGGCCGA GGAAGCCGAG
AAGCAGAAGG AGGGCACGTC ATGA
 
Protein sequence
MRVRAQIGKV LNLDKCIGCH TCSITCKNVW TSREGMEYVW FNNVESKPGV GYPKDWENQD 
RWNGGWAVRN GRLEPRAGGK WRILSNIFYN PDLPSIDDYY EPFNFDYQRL QNAPASKYQP
VAKPFSALTG YPMDKIEWGP NWEEILGGEF EKRSKDYNFA KVQKEIYGQF ENTFLMYLPR
LCEHCLNPSC VASCPSGAIY KREEDGIVLI DQDKCRGWRM CISGCPYKKI YYNWQSGKSE
KCTFCFPRIE VGQPTVCSET CVGRIRYLGV ILYDADKISE AASTASEQDL YEKQMSVFLD
PHDPEVIAEA KKAGIPHAWL EAAKRSPVYK MAMDWKVAFP LHPEYRTLPM VWYIPPLSPI
QSAADAGTLE NDGLIPDVNS LRIPVKYLAN MLTAGDERPV VTALQRMLAM RHFKRSQTVE
GKPNHQVLEQ VGLDEATVED MYQIMAIANY EDRFDVPAAN RELAADDAYG ERSGCGFSFG
DGCNAASNGV SPQDIFGGRR ASRRAYPVRE EPSAPAEEAE KQKEGTS