Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_2120 |
Symbol | |
ID | 4269370 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | + |
Start bp | 2405703 |
End bp | 2407646 |
Gene Length | 1944 bp |
Protein Length | 647 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 638126876 |
Product | von Willebrand factor, type A |
Protein accession | YP_742952 |
Protein GI | 114321269 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG4548] Nitric oxide reductase activation protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 38 |
Fosmid unclonability p-value | 0.876346 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGCAGT TCCTGGAGGT GGAAGAGTTC GTCGGTCGCC ACTGGCACCG TTGGGCATCG CAGGCACGGA GCTACCCGCG CCACCCGGAG GCCGCGGTCC AGCTCGCCAC CCTGCGCGGC CTCCTGGGCG TGTTCTTCCG CGCCAGCGGG GGGCCGGCGG GGGTGCCGGT GGCCTCCATC GTGGCCCGCA ACTCCCGTCA CCGCCTGACC TGGCGGCAGC GCCTCGGGTT CGACGAGGAG CCGGTGGATC GGGCCCGCCG CGATGAGGAG AACCTGCTGC TGCCACCGGT GCTGGACTAC TTCCCCACCG CGGCTCAGAA CCGCGACCAC TATCTTTGGC TGGCGGCCTT CCTGGCCCTG GCGCGGCCAC CGCGCACGGA CCGCCTGCAC GACCCATTGC AGCAGGATAT CGTCCGGCTG CGGGAGGTCC ACCGCGTGAT CCAAACCATC CGCGACCGGC TACCGGGGCT CCACCAGCGT TACCGCGCCC TGGGCAGCGC CATGCTGGCA CTGCGCCCCC GGCGCCGCCT GCCGCCCCAG GAGTCCGCCG TGGAGCTGGC CGTGCAGTAC CTGCTCGGGG CCGCCCTGCC GGGTCGGAGC GCCGCCACGG CCATCGTCCG CGCCGTCACC GATCCACAAG TGGCCCTGGA TGCGTTCCAG GCCGACCGGG ATTACCGACC ACCCCTGCCC ATGCCGCTCT GGGGCGAGGT CGTCCCGCTG GGCACGGGGA CCGGCGCCAA GCCGGGCGAG GGCCACGACG AGGGCGGCGC CACGGGCAGC CCCAAGCAGG CCAGCGAGGG CAAGCGCCAG GCCGAGCGCC GTCACCAGGA CCAATGCGAG CGCGACGACC CGCTGGTCCT CAACCGCTTC GAGAAGATGC TCTCCTGGAC GGAGATGGTG AACGTCAACC GGCTGGTTGA AGACGAGGAG GACGAAGAGG CCAAACGGGC TGCTGAACAG ATCGAGGAGA TCACCCTCAG CCCCCATAAG CAGCGCGCGG CCACCCGGCT GAAGGTGGAC CTGGACCTGC CGCCGGACGC CGTCACCGGC GATCGCCTGC GCGCAACCCA CACCTACCCG GAGTGGAACT TCCGCAAGCA GGCCTACCTG CCGGACCACT GCGCCGTGCA CACGGACCTG CAGCCCGAGG AGGGCGAGGC CTGGCGCCCC GATGCCGGCA CCCGCCGGCG CATCCGCCGG GTGCAACGCC AGTTCGAGGC CCTGCGCCCG CGCCGGGAAC TGCTGCGCGC CCAGATCGAT GGCGCGGAAC TGGACATGGA CGCCACCATC CGCGCCCATT GCGACCTCCG GGCCACCGGC GAGGGCTCCG ACAACATCTA CCAGGCGGCC CGCTGCCAGG CCCGCGACCT GGCGGTGGCG ATCCTGGTGG ACTGCTCGCT CTCCACCGAC GCCTGGCTGG AGGATCAGCG GATACTGGAT GTGGAGAAGG AGGCCCTGCT GGTGCTGGCC CATGGCCTCA AGGGCTGCGG AGACGATTAT GCCATCTACA CCTTCACCTC CCACCGGCGG CAGAAGGTCT GGGTAAATAC CGTCAAGGCC TTCGACGAAC CCCTCCAGGC GCGGGTGGAG CGCCGGATCG GGGCACTCAA GCCCGGCCAT TACACCCGCA TGGGACCGGC GCTACGCCAC GTCTCCGGCG AATTGGCCAA ACGGCCCAAT AGGCACAAAC TGCTACTGGT GCTCACCGAC GGCAAGCCCA ACGATACCGA CTACTATGAG GGCCGCTACG CCATCGAGGA CACGCGCAAG GCGGTACGGG AGGCCCGGCG CCAGGCCCAG ACGGTGTTCG GTGTCACCGT GGACAGCGAG GCCCAACAGT ACTTTCCTTA CCTGTTCGGA CGGGCCGGCT ACAGCATCGT CCAGCGGCCC GCCCACCTGG CCCAGAGCTT GCCGGCCATC TACCGTCAGA TCATCAGCGA ATAG
|
Protein sequence | MLQFLEVEEF VGRHWHRWAS QARSYPRHPE AAVQLATLRG LLGVFFRASG GPAGVPVASI VARNSRHRLT WRQRLGFDEE PVDRARRDEE NLLLPPVLDY FPTAAQNRDH YLWLAAFLAL ARPPRTDRLH DPLQQDIVRL REVHRVIQTI RDRLPGLHQR YRALGSAMLA LRPRRRLPPQ ESAVELAVQY LLGAALPGRS AATAIVRAVT DPQVALDAFQ ADRDYRPPLP MPLWGEVVPL GTGTGAKPGE GHDEGGATGS PKQASEGKRQ AERRHQDQCE RDDPLVLNRF EKMLSWTEMV NVNRLVEDEE DEEAKRAAEQ IEEITLSPHK QRAATRLKVD LDLPPDAVTG DRLRATHTYP EWNFRKQAYL PDHCAVHTDL QPEEGEAWRP DAGTRRRIRR VQRQFEALRP RRELLRAQID GAELDMDATI RAHCDLRATG EGSDNIYQAA RCQARDLAVA ILVDCSLSTD AWLEDQRILD VEKEALLVLA HGLKGCGDDY AIYTFTSHRR QKVWVNTVKA FDEPLQARVE RRIGALKPGH YTRMGPALRH VSGELAKRPN RHKLLLVLTD GKPNDTDYYE GRYAIEDTRK AVREARRQAQ TVFGVTVDSE AQQYFPYLFG RAGYSIVQRP AHLAQSLPAI YRQIISE
|
| |