Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_2394 |
Symbol | |
ID | 4269391 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | - |
Start bp | 2716823 |
End bp | 2719741 |
Gene Length | 2919 bp |
Protein Length | 972 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 638127152 |
Product | von Willebrand factor, type A |
Protein accession | YP_743224 |
Protein GI | 114321541 |
COG category | [R] General function prediction only |
COG ID | [COG4245] Uncharacterized protein encoded in toxicity protection region of plasmid R478, contains von Willebrand factor (vWF) domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.400523 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.00000432302 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAACATCA GACACGTCGA CCGCCATGTC GAGAAACGGC GGTCCTGGTA CACCGGGCTG TTCTGCGCCG GCCTCAGCGC CGCCATGATC AGCGCACCGC TCTCCGCCGG GGATTCCTGG TTCTCCAAGG CGCGGATCGG TGCCGATACG GCCCCGGCGA GCGCCCAGGG CCTCACCGCC GCGGGCAGCG ACCGGGTGGT GGGGGCCTCC ACGGACGCCG GCACGACCGT CTTCGATATC ACCATTTCCC TGCACAACAA CCCGGAGACC GAAAACGAGC GGGATCCCTA CGAGGCCGTG ATCGAACACT TCGCCGACTC GGTCTGCGAG CAGAGCAACG GGGCCAGCCA GCTCGGTACC GTGCGGGTCT TCACCAACGG CGGCCACGGC TCCCGGGCCG ACATTATCTG GAACGAGGAG GAGTGGCCGC GGGCCAGTAT CGCGGGGTTC GGCAGTGCCG GGCGGCACAT CTGGATGGGG GATATCTTCC CCGACGGCTG CGGTAATGGC TGCGACTACG ACATGCTGGC CGATGCCGAG GGCGCCGGCT ACACCCTGGG GCACGAGTGG GGCCACTACG TGCTTGCCCT GTATGATGAG TACGAGGGGC GCGACCCGGC GGAGAATCGC GATACCTTCC CGCAGGTTGG CGATGTGCCC ACCAGCCCGG CCATCATGAA CAGCCAGTGG CAGGCCCGCG GCGGTAACTA CGAGTGGTTG AACCACTCCA CCAGTGACAA CATCGGCGAT CCGGAGGATA CCGCCCAGGG CCGGGTCTAC GGCAAGAGCG GCTGGGAGGT GTTGGTCCAA CCCACCACCG ACGATCCACA GGAGGGTAAC GAGACCGTTC AGCCTGACCG GACCCGGTAT ACGGCCCTGG AGGCCGTGGC GCCGACGGCG GCGGATAACT GGGTGGTCAC CCAGCTCGAT CAGATGGATC ACGGTTGCCG CGATGAGCTG GAGATCGTCT GGATGGACGA TGACCTGGAG ATCTCGCTCA TTGTCGACAC CTCCGGGAGC ATGAGCGGCG CTCCCATCAT CAACGCCCGC ACAGCCGGTC GGACCCTGGT GGATGTGGTC GAGCCTGGCC GTACCGCCAT GGGCGTCGTG CGCTTCTCGG CGAGTGCCTC GGTGGTCCAC CCCATGATCG CCATCCCGGA CCCGGGTACG GCGGAAAAGG ACCAGCTCAA GGACGCCATC GACAGCCTCC CGGCCTCCGG GCTGACCGCC ATGTTCGATG GCCTGATACT GGGCTTGGAC GAACTGCAGG ATTACAGCGC CGCCAACGAT ACCGATGCCG GGCAGGTGGC CTTCCTGCTC TCCGATGGTG GCGACAACAG CTCCGCTGCG ACAGAGCCGC AGACCGTCCA GGCCTACCAG GATGCCAACG TCCCCATCAT CGCCTTCGGC TATGGCAGCT TCGCACCCAC CGGGGTGTTG CGGCGGCTCG CCGATAACAC CGGCGGTGAG TTCTTCGCCT CACCCACGAC CCTGGCCGAG ATCCAGGAGG CCTTCCTGGC GGCCAACGCC GCCGTGTCCG ATGCGGTCAA CCTGAGTCAG GAGTCGCAGC CGGTCGCTGC GGGCGCCAAC GAGCGGCTCA CCTTCACCGT GGACCCCACC CTGGGCTCGA TGACCGTGCT CCTTAACTTC ACCGGCAGTG CCGACCAGTT GTCCCCCACG CTGTTGGACA GTGACGGCAA CGACACCGGA ATCCCGTTCA GCTGCGACGA GTCAGCCGAT GAGGTCTCCT GCCTCGCCAC CGTGGACCGC GACGCGGTAT CGGCGGGTGG CGTCGGCGAC TGGACCGTCG ACACCGGGGA GAATACCTCC GGCGGTGAAG TGGAGGTGCT CCTGAATGTG GTGGCCAACC CGGCGGACGG TCGGACCTTT GACGTGCGGG TCAGTACCCT GGGTGGCAGC ACGGTGGAAT ACCCGTCACC CGCGCTGATC TCGGCCGCGA TCTCCGCCGG GCGGATGGTC TCGGGGGTCA ATGTGGTCGC CGAGCTGACC GATCCCGACG GGAACGTCAC CACCGTGCCG CTCAACGACG AGGGCCAGAA CGGTGACGCG GAGGCCGGTG ATGGCATCTA CTCGGCCGTG GTGAACTACC GGCAGGGTGG TACCCACGAG CTGCGCGTCC GGGTGGACAA CCAGGCCGGG ACCGGCCAGT TCGTGTACGG GGGTGTGGCC CCGGCGCCGG ACATCAATGG CATGGAGGTG CAGGCCCCGG ATCCGGAGCC GATCCCGGAG AACTTCCAGC GCGTGGCGAC CACGCAGTTC ACCGTGGATG GTTTTCAGGA CGACGATCAC GCGGATGATC CCGCCCTGCC GGGCGCCTGC ACGCCGCTCG AGGCCGATAA CACGACTATC CCGGGGCGGA TCGATGCCGC CAGCGATAGG GATTGCTTCC GCCTGGTCGG TACCAGCGTC CCGGACGAGG GCGATGTCTC TCTGCGGGTG GCATCCTTTG GCCTGGGCAT GCAGCCCATC GTGACGATCT ACACCGGCGA TGGCAGCGAG GTGCTGCTGA CCTTCAGCCT GGACGATGAG GACTTCGTCG CCCGCAACGG CTACCTCTAC ACCGTCCTGG ATCGGGACTG GCTGGAGACC GCCAATGACG AAGGTATGGT TGGCGGTGCG GAGCTGCAGG ACCTGGTGGT CACGGTCGAG CATGAGGATG AGACCGCCGA CGAGGGCACC TACAAGGTCA GCGCCGGCTC GGTGATCAGC TCGGACCAGC CCGCGGAACC GGACCGGATC ACGGACGAGG ATGAGGAGGA GTTCGAGATG ACCCGCCGGG GTTCCGCCTG CAGCGTGGCG GGCAACAGTG CCGGTGGCCC CGCCGACCCG ACCCTGCCGC TGCTGGCCAT CCTGGCCCTG CTCGGCGTCA TCCTGGGCCG TCGTCGCCAC CGCGCCTGA
|
Protein sequence | MNIRHVDRHV EKRRSWYTGL FCAGLSAAMI SAPLSAGDSW FSKARIGADT APASAQGLTA AGSDRVVGAS TDAGTTVFDI TISLHNNPET ENERDPYEAV IEHFADSVCE QSNGASQLGT VRVFTNGGHG SRADIIWNEE EWPRASIAGF GSAGRHIWMG DIFPDGCGNG CDYDMLADAE GAGYTLGHEW GHYVLALYDE YEGRDPAENR DTFPQVGDVP TSPAIMNSQW QARGGNYEWL NHSTSDNIGD PEDTAQGRVY GKSGWEVLVQ PTTDDPQEGN ETVQPDRTRY TALEAVAPTA ADNWVVTQLD QMDHGCRDEL EIVWMDDDLE ISLIVDTSGS MSGAPIINAR TAGRTLVDVV EPGRTAMGVV RFSASASVVH PMIAIPDPGT AEKDQLKDAI DSLPASGLTA MFDGLILGLD ELQDYSAAND TDAGQVAFLL SDGGDNSSAA TEPQTVQAYQ DANVPIIAFG YGSFAPTGVL RRLADNTGGE FFASPTTLAE IQEAFLAANA AVSDAVNLSQ ESQPVAAGAN ERLTFTVDPT LGSMTVLLNF TGSADQLSPT LLDSDGNDTG IPFSCDESAD EVSCLATVDR DAVSAGGVGD WTVDTGENTS GGEVEVLLNV VANPADGRTF DVRVSTLGGS TVEYPSPALI SAAISAGRMV SGVNVVAELT DPDGNVTTVP LNDEGQNGDA EAGDGIYSAV VNYRQGGTHE LRVRVDNQAG TGQFVYGGVA PAPDINGMEV QAPDPEPIPE NFQRVATTQF TVDGFQDDDH ADDPALPGAC TPLEADNTTI PGRIDAASDR DCFRLVGTSV PDEGDVSLRV ASFGLGMQPI VTIYTGDGSE VLLTFSLDDE DFVARNGYLY TVLDRDWLET ANDEGMVGGA ELQDLVVTVE HEDETADEGT YKVSAGSVIS SDQPAEPDRI TDEDEEEFEM TRRGSACSVA GNSAGGPADP TLPLLAILAL LGVILGRRRH RA
|
| |