Gene Mlg_2839 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_2839 
Symbol 
ID4270883 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp3222817 
End bp3225198 
Gene Length2382 bp 
Protein Length793 aa 
Translation table11 
GC content71% 
IMG OID638127601 
Productvon Willebrand factor, type A 
Protein accessionYP_743669 
Protein GI114321986 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG4548] Nitric oxide reductase activation protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCACGCG GTCTCTGCCG GGCCCAACCC ACCCAGCCCC AGGAGGCCGT CATGAACGCC 
GAAATCCAAG CCTTGGCGGA TGAACTGCGC GGCACCCACC GCGAGGTGGC CGAGGTGCTC
GACGCCTGCC TCGCCGAAGC AACCCGGGTG ATGTCCGCCG ACACCCAGGC CCGTTATCTG
GAGGCCGCCC TGGCCCTGAA CCGCCTGGGC CGCGGCCACG AGATCGTCAT CACTTGGCTC
GAGGCCATGC CCCCGGTGGC GCGCGAGGCC GGCGAGGCCA TCGTCCCCGA CACCGCCAGC
GCCGCGCTCA AACTGGCCTC CATGGTCAGC GGCGAAGTCG TCGGCCTACT CTTCGACAGC
CTGCCCACCG CCGCCCGACG GCTGGGCGAC GACGACCTGC TGCGCCAATA CCTGGCCCTG
ATCCACCAAC TCTCCGGCCG CGCCCCGCGC GCCCTGCGCC CGCTGTTCAC CCACCTCGAC
CAGCTGCTGG CCGTGCTCAC CCTCAGCGGT CTTCGCCGCT GGGCACTCTG GGGCGTGCAG
GCCTACGCCC GGGACTACGA CCGCCTGGCC GCCTACTTCG CGCTGGAATC GGCCGACAGT
CAGCAGGTCC TGCAACAGGA GCGCCGCGGC GTGCTCTTCG TGGACGTCCA GCGCCGGCTG
GGCTTCTATT TGCGCGCCCT TTGGGGCCGC GACTTCTTCC TGCGCCCCAA CGCCGCCGAA
CCGGGCAGCC CCGAGGCCCG CCCCTTCATC GAGGCTGGCA CCCTGCACCT GCCCGACGCC
ATGGACGATG TGGGCAGTGT CCGCGGCCTG GAGGTCTACC GCGCCCAGTG CGCCCACGCC
GCCGCCCACA TCGGCTTCGG CGAGGGCGCG CCGATGCAGG CCGAGGCGCT GAGCCCCGCC
CAGCGTTACC TGATGGCCCT GATCGAAGAT GCCCGGGTAG AGGCCCTAAG CGTGGCCACC
TTTCCCGGCC TCTTCCCCCT CTGGCGCCGG CTGCTCAGCG AGGCCCCCCG CGCCGAGGAC
CCCACCCTGG CGCTGCTCCA GCGGCTCGCC CTCGCGCTGC TCGACCCCCA GTGGCAGGAC
GACCACCCGG TTGTAGCGCA ACTGGCCGGC CGCTTCCACC AGCGCATCCA GGCCGGCGAC
CACGGCTGGG AGCTGTCCGC GGAGCTCGGC CTCGACCTCA TTGGCCACCT TCAGGACGCC
GGCCCGCTGC CGCCCCTGAG CCGCCTGGAA ACCCTGCCGC TGGCCTACCG CGACGACAAC
CGCTACCTCT GGGCCGAGCC GGAGGAGGCC GAACTGGCGC GGCAGGCGCC CGCCAAGGAG
GCCCAGGTAC GGCGCCGGCC TAGCGTCATC GAGATGGTCA ATGAACTGGA CTGCGAGCTG
GCCGGGGACG ATGCCCAGGA GATCTGGATC CTCGACACCG AGTTCTACCG CGACGGCGAT
CCGGAGGGGG TCAGTATCAA TGAGCTGGAG GGCAAGCCCG CCACCAGTCC CCCCTTCCAT
TACCAGGAGT GGGACTACAA GGCGCAACTG CACCGGCCCG ACTGGGTCAC GCTTATGGAA
CGCCGCCAGC CGGCCGGTGA CCCGGACGAC CTCAAGGCAA TCATGGACGA ATACCGCCCC
GTGGCCCGAC GGCTGCAGCG GGTGATCGAC AGCCTGATCC CCCAGGGGCT GGTGCGCGAG
CGCCGGCAGG AGGATGGCGA CGAGATCGAC CTGGACGCCG CCATCCGCGC CCGCATCGAC
CAGAAAACCG GCCACACGCC CGACCACCGG GTGAGCATCC GCTACCACCG CCAGGAGCGG
GACCTGGCGG TCCTGCTGCT GCTCGACCTG TCGGAATCGG CCAATGACAC CCTGCCCGGC
TCCGACCGCC CGCTCATCCA GCTCACCCGC GAGGCCACCA CGCTGCTGGC CTGGGCGGCC
GACAGCATCG GCGACCCCTT CGCCGTGCAC GGATTCGCCT CGGAGACCCG GCACGATGTC
CACTACCACC GCTTCAAGGA CTTCGACCAG CCCTGGGACG ATGCCGCCCA AGCTCGGGTG
GCGGGTCTGG AGGCGGGGCT CTCCACCCGC ATGGGTGCGG CATTACGTCA CGCCGGGCAC
TATATGACCC GCCGCCCGGA GCGCCACCGA CTGATCCTGC TGCTCTCTGA CGGTGCCCCC
TCCGACATCG ACGCGCCGGA CCCGCAGTAC CTGCGCCAGG ACACCCGCAA GGCGGTGGAG
GCGCTCCAGG CCCGTGGCGT TCACGCCCAC TGCCTGACCC TGGACCCGGG CGCGGATCAG
TACGTCCAGC AGCTCTTCGG CCCCCGCGGC TATACCGTGC TGGACCACCC CCAGCGGCTG
CCTGAGAAGC TGCCCACCCT GTTCGCCAGC CTCACCCGCT GA
 
Protein sequence
MARGLCRAQP TQPQEAVMNA EIQALADELR GTHREVAEVL DACLAEATRV MSADTQARYL 
EAALALNRLG RGHEIVITWL EAMPPVAREA GEAIVPDTAS AALKLASMVS GEVVGLLFDS
LPTAARRLGD DDLLRQYLAL IHQLSGRAPR ALRPLFTHLD QLLAVLTLSG LRRWALWGVQ
AYARDYDRLA AYFALESADS QQVLQQERRG VLFVDVQRRL GFYLRALWGR DFFLRPNAAE
PGSPEARPFI EAGTLHLPDA MDDVGSVRGL EVYRAQCAHA AAHIGFGEGA PMQAEALSPA
QRYLMALIED ARVEALSVAT FPGLFPLWRR LLSEAPRAED PTLALLQRLA LALLDPQWQD
DHPVVAQLAG RFHQRIQAGD HGWELSAELG LDLIGHLQDA GPLPPLSRLE TLPLAYRDDN
RYLWAEPEEA ELARQAPAKE AQVRRRPSVI EMVNELDCEL AGDDAQEIWI LDTEFYRDGD
PEGVSINELE GKPATSPPFH YQEWDYKAQL HRPDWVTLME RRQPAGDPDD LKAIMDEYRP
VARRLQRVID SLIPQGLVRE RRQEDGDEID LDAAIRARID QKTGHTPDHR VSIRYHRQER
DLAVLLLLDL SESANDTLPG SDRPLIQLTR EATTLLAWAA DSIGDPFAVH GFASETRHDV
HYHRFKDFDQ PWDDAAQARV AGLEAGLSTR MGAALRHAGH YMTRRPERHR LILLLSDGAP
SDIDAPDPQY LRQDTRKAVE ALQARGVHAH CLTLDPGADQ YVQQLFGPRG YTVLDHPQRL
PEKLPTLFAS LTR