Gene Mlg_0757 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_0757 
Symbol 
ID4268570 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp840982 
End bp842391 
Gene Length1410 bp 
Protein Length469 aa 
Translation table11 
GC content71% 
IMG OID638125506 
Productrubredoxin-type Fe(Cys)4 protein 
Protein accessionYP_741601 
Protein GI114319918 
COG category[C] Energy production and conversion 
COG ID[COG1251] NAD(P)H-nitrite reductase
[COG1773] Rubredoxin 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones57 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCAGAC CCTACCGACG CTACCTGTGC CGCGTCTGCG GTTATATCTA CGACGAGGCC 
AAGGGCGATC CTGACGGGAG TCTGCCTCCC GGCACCCGCT TCGAGGACAT CCCGGATGAC
TGGGAATGCC CCGACTGCGG GATGCGCAAG GTCGAGTTGG ATCTCCTTGA GGAGGGCGAC
GACGGCGGGG GCAGCCCCAA GATCCGCACG GGGTCGGCGC GCCCACAGGA TCCGGGGCCG
GGTCAGGACC CGCACCCGGT GGTTATTGTC GGGGCCGGTA TGGCTGGTTG GGCGGTGGCG
GAGGGGGTGC GGGCCCAGGA CGAGGGCCGC GCCATCACCC TCGTGACCCA ATGTAACGGG
GACGTCTATT ACAAGCCCCA GCTGTCGGCG GCGGCCGCGC GTGGGCGGGG CCCGGATGAG
CTCATTCAGG CGACCGGCGA GGACAAGGCC CGGGCCCTGG GGGTGAATCT GCTGGCCCGT
ACCCGGGCGT TGCGCATCGA CACCGACCGG CGCCGCCTGA TCACGCCGCG TGGCGGTATC
CCCTTCGGTG ACCTGGTCCT GGCCTGCGGC GCCCGCCAGC CCCGCCCGAG GCTGGCGGGG
GACGCGGCCG GCGACGTGCT GCAGGTGAAC GACCTGGCCG ATTACCGGCG GCTCCGTGAA
CGGGTGGATG TCCACGACTC GGCACGGGTC TTTATCCTCG GTGCCGGGCT GATTGGCTGC
GAGTTCGCAG AGGACCTCTC CGGCGCCGGC CATGCGGTGA CTCTGGTGGA CATCGCCGAG
CGGCCGCTGG CGCGGCTGCT GCCCGTGCCC CTGAGCGCGG ACCTGGCATT GGCGCTGGAT
GACAAGGGCG TGGCGCTGCA CATGGGCCGG ACCGTGGACG CGGTGGACCG GGCCGCGGAT
GGGGGCTATC GGGTGATGCT GGACGATGGT GGGGTCGTGG CCGCGGACGT GGTGGTCAGT
GCCCTGGGCC TGATCCCAAA CACCTATCTG GCGACCCGGG CCGGGCTGTC GGTGGGGCAG
GGGATCCAAG TGGACGGTCA ATTACGGACC AGCGACCCGG GGATACGGGC GCTGGGTGAC
TGCAGCGAGC ACACCGGCCA CCTGCTGCCC TATGTGCAGC CGCTGAAGGC CCAGGCGCAG
GTGATCGCCG CCTGCCTGGC AGGCGAGCGC GACCACTACA CCCCGGAGCC CGGCACGGTG
CGCATCAAGA CGCCCTCCTG CCGGTTGGCG GTCTGGACGC CGTGGCAGGA GGGCGTCTGG
CGCGAGGAGG CGCACGATGA GCAGGGCCGC ACCCTGGTGC ACTACAGCGG CGAGGCCGTC
ACCGGCTTTG CCTTGTCCGG CCGCCATGTG CGCCAGGCCC CGAAGCTCGA GCGACAGGTT
CAGGCCGGCC GCGACCGGGG TGTGGCCTGA
 
Protein sequence
MGRPYRRYLC RVCGYIYDEA KGDPDGSLPP GTRFEDIPDD WECPDCGMRK VELDLLEEGD 
DGGGSPKIRT GSARPQDPGP GQDPHPVVIV GAGMAGWAVA EGVRAQDEGR AITLVTQCNG
DVYYKPQLSA AAARGRGPDE LIQATGEDKA RALGVNLLAR TRALRIDTDR RRLITPRGGI
PFGDLVLACG ARQPRPRLAG DAAGDVLQVN DLADYRRLRE RVDVHDSARV FILGAGLIGC
EFAEDLSGAG HAVTLVDIAE RPLARLLPVP LSADLALALD DKGVALHMGR TVDAVDRAAD
GGYRVMLDDG GVVAADVVVS ALGLIPNTYL ATRAGLSVGQ GIQVDGQLRT SDPGIRALGD
CSEHTGHLLP YVQPLKAQAQ VIAACLAGER DHYTPEPGTV RIKTPSCRLA VWTPWQEGVW
REEAHDEQGR TLVHYSGEAV TGFALSGRHV RQAPKLERQV QAGRDRGVA