Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_2839 |
Symbol | |
ID | 4270883 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | + |
Start bp | 3222817 |
End bp | 3225198 |
Gene Length | 2382 bp |
Protein Length | 793 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 638127601 |
Product | von Willebrand factor, type A |
Protein accession | YP_743669 |
Protein GI | 114321986 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG4548] Nitric oxide reductase activation protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 39 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCACGCG GTCTCTGCCG GGCCCAACCC ACCCAGCCCC AGGAGGCCGT CATGAACGCC GAAATCCAAG CCTTGGCGGA TGAACTGCGC GGCACCCACC GCGAGGTGGC CGAGGTGCTC GACGCCTGCC TCGCCGAAGC AACCCGGGTG ATGTCCGCCG ACACCCAGGC CCGTTATCTG GAGGCCGCCC TGGCCCTGAA CCGCCTGGGC CGCGGCCACG AGATCGTCAT CACTTGGCTC GAGGCCATGC CCCCGGTGGC GCGCGAGGCC GGCGAGGCCA TCGTCCCCGA CACCGCCAGC GCCGCGCTCA AACTGGCCTC CATGGTCAGC GGCGAAGTCG TCGGCCTACT CTTCGACAGC CTGCCCACCG CCGCCCGACG GCTGGGCGAC GACGACCTGC TGCGCCAATA CCTGGCCCTG ATCCACCAAC TCTCCGGCCG CGCCCCGCGC GCCCTGCGCC CGCTGTTCAC CCACCTCGAC CAGCTGCTGG CCGTGCTCAC CCTCAGCGGT CTTCGCCGCT GGGCACTCTG GGGCGTGCAG GCCTACGCCC GGGACTACGA CCGCCTGGCC GCCTACTTCG CGCTGGAATC GGCCGACAGT CAGCAGGTCC TGCAACAGGA GCGCCGCGGC GTGCTCTTCG TGGACGTCCA GCGCCGGCTG GGCTTCTATT TGCGCGCCCT TTGGGGCCGC GACTTCTTCC TGCGCCCCAA CGCCGCCGAA CCGGGCAGCC CCGAGGCCCG CCCCTTCATC GAGGCTGGCA CCCTGCACCT GCCCGACGCC ATGGACGATG TGGGCAGTGT CCGCGGCCTG GAGGTCTACC GCGCCCAGTG CGCCCACGCC GCCGCCCACA TCGGCTTCGG CGAGGGCGCG CCGATGCAGG CCGAGGCGCT GAGCCCCGCC CAGCGTTACC TGATGGCCCT GATCGAAGAT GCCCGGGTAG AGGCCCTAAG CGTGGCCACC TTTCCCGGCC TCTTCCCCCT CTGGCGCCGG CTGCTCAGCG AGGCCCCCCG CGCCGAGGAC CCCACCCTGG CGCTGCTCCA GCGGCTCGCC CTCGCGCTGC TCGACCCCCA GTGGCAGGAC GACCACCCGG TTGTAGCGCA ACTGGCCGGC CGCTTCCACC AGCGCATCCA GGCCGGCGAC CACGGCTGGG AGCTGTCCGC GGAGCTCGGC CTCGACCTCA TTGGCCACCT TCAGGACGCC GGCCCGCTGC CGCCCCTGAG CCGCCTGGAA ACCCTGCCGC TGGCCTACCG CGACGACAAC CGCTACCTCT GGGCCGAGCC GGAGGAGGCC GAACTGGCGC GGCAGGCGCC CGCCAAGGAG GCCCAGGTAC GGCGCCGGCC TAGCGTCATC GAGATGGTCA ATGAACTGGA CTGCGAGCTG GCCGGGGACG ATGCCCAGGA GATCTGGATC CTCGACACCG AGTTCTACCG CGACGGCGAT CCGGAGGGGG TCAGTATCAA TGAGCTGGAG GGCAAGCCCG CCACCAGTCC CCCCTTCCAT TACCAGGAGT GGGACTACAA GGCGCAACTG CACCGGCCCG ACTGGGTCAC GCTTATGGAA CGCCGCCAGC CGGCCGGTGA CCCGGACGAC CTCAAGGCAA TCATGGACGA ATACCGCCCC GTGGCCCGAC GGCTGCAGCG GGTGATCGAC AGCCTGATCC CCCAGGGGCT GGTGCGCGAG CGCCGGCAGG AGGATGGCGA CGAGATCGAC CTGGACGCCG CCATCCGCGC CCGCATCGAC CAGAAAACCG GCCACACGCC CGACCACCGG GTGAGCATCC GCTACCACCG CCAGGAGCGG GACCTGGCGG TCCTGCTGCT GCTCGACCTG TCGGAATCGG CCAATGACAC CCTGCCCGGC TCCGACCGCC CGCTCATCCA GCTCACCCGC GAGGCCACCA CGCTGCTGGC CTGGGCGGCC GACAGCATCG GCGACCCCTT CGCCGTGCAC GGATTCGCCT CGGAGACCCG GCACGATGTC CACTACCACC GCTTCAAGGA CTTCGACCAG CCCTGGGACG ATGCCGCCCA AGCTCGGGTG GCGGGTCTGG AGGCGGGGCT CTCCACCCGC ATGGGTGCGG CATTACGTCA CGCCGGGCAC TATATGACCC GCCGCCCGGA GCGCCACCGA CTGATCCTGC TGCTCTCTGA CGGTGCCCCC TCCGACATCG ACGCGCCGGA CCCGCAGTAC CTGCGCCAGG ACACCCGCAA GGCGGTGGAG GCGCTCCAGG CCCGTGGCGT TCACGCCCAC TGCCTGACCC TGGACCCGGG CGCGGATCAG TACGTCCAGC AGCTCTTCGG CCCCCGCGGC TATACCGTGC TGGACCACCC CCAGCGGCTG CCTGAGAAGC TGCCCACCCT GTTCGCCAGC CTCACCCGCT GA
|
Protein sequence | MARGLCRAQP TQPQEAVMNA EIQALADELR GTHREVAEVL DACLAEATRV MSADTQARYL EAALALNRLG RGHEIVITWL EAMPPVAREA GEAIVPDTAS AALKLASMVS GEVVGLLFDS LPTAARRLGD DDLLRQYLAL IHQLSGRAPR ALRPLFTHLD QLLAVLTLSG LRRWALWGVQ AYARDYDRLA AYFALESADS QQVLQQERRG VLFVDVQRRL GFYLRALWGR DFFLRPNAAE PGSPEARPFI EAGTLHLPDA MDDVGSVRGL EVYRAQCAHA AAHIGFGEGA PMQAEALSPA QRYLMALIED ARVEALSVAT FPGLFPLWRR LLSEAPRAED PTLALLQRLA LALLDPQWQD DHPVVAQLAG RFHQRIQAGD HGWELSAELG LDLIGHLQDA GPLPPLSRLE TLPLAYRDDN RYLWAEPEEA ELARQAPAKE AQVRRRPSVI EMVNELDCEL AGDDAQEIWI LDTEFYRDGD PEGVSINELE GKPATSPPFH YQEWDYKAQL HRPDWVTLME RRQPAGDPDD LKAIMDEYRP VARRLQRVID SLIPQGLVRE RRQEDGDEID LDAAIRARID QKTGHTPDHR VSIRYHRQER DLAVLLLLDL SESANDTLPG SDRPLIQLTR EATTLLAWAA DSIGDPFAVH GFASETRHDV HYHRFKDFDQ PWDDAAQARV AGLEAGLSTR MGAALRHAGH YMTRRPERHR LILLLSDGAP SDIDAPDPQY LRQDTRKAVE ALQARGVHAH CLTLDPGADQ YVQQLFGPRG YTVLDHPQRL PEKLPTLFAS LTR
|
| |