Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mvan_4853 |
Symbol | |
ID | 4646419 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium vanbaalenii PYR-1 |
Kingdom | Bacteria |
Replicon accession | NC_008726 |
Strand | - |
Start bp | 5191799 |
End bp | 5193805 |
Gene Length | 2007 bp |
Protein Length | 668 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 639808324 |
Product | von Willebrand factor, type A |
Protein accession | YP_955632 |
Protein GI | 120405803 |
COG category | [R] General function prediction only |
COG ID | [COG4867] Uncharacterized protein with a von Willebrand factor type A (vWA) domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.381001 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTAAAC ATGTTCGGCT GTCGCGGTAT TCGCGGTACA CGGGCGGGCC CGACCCGCTG GCGCCGCCCG TCGACCTGCG CGAGGCGCTC GAGGCGATCG GTCAGGACGT CATGGAGGGC ACGTCGCCGC GGCGCGCGCT GTCGGAGATG CTCCGGCGCG GCACTCAGAA CATGCCGGGC GCCGACAAGC TCGCCGCCGA AGCCAACCGT CGCCGTCGGG AACTGCTGCA GCGCAACAAT CTCGACGGCA CCCTGGCCGA CATCAAGAAG CTGCTCGACG AGGCGGTGCT CGCCGAACGC AAGGAACTGG CGCGTGCGCT CGACGACGAC GCACGGTTCG GTGAGCTGCA GCTGAACTCG CTGTCCCCGT CCCCGGCCAA AGCCGTGCAG GAGCTTTCCG ATTATGACTG GCGGTCGCCG GAGGCGCGCG AGAAGTACGA CCAGATCAAG GATCTGCTCG GACGCGAGAT GCTCGATCAG CGGTTCGCCG GCATGAAGGA GGCGCTGGAG AACGCCACCG ACGAGGATCG CCAGCGGGTC AACGACATGC TCGACGATCT CAACGACCTG CTCGACAAGC ACGCCAAAGG ACAAGATTCC CCCCAGGATT TCCAGGACTT CATGGCCAAG CACGGCCAGT TCTTCCCGGA GAACCCGAAG AACATCGACG AGCTGCTGGA CTCGCTGGCC AAGCGGGCGG CGGCCGCGCA GCGGTTCCGC AACAGCCTGT CCGAGCAGCA GCGCGCCGAG CTGGATGCGT TGGCGCAGCA GGCTTTCGGG TCACCGTCGT TGATGAACGC GCTCGACCGT CTCGACGCCC ACCTGCAGTC CGCCCGGCCC GGGGAGGATT GGACGGGCTC GCAACGGTTC TCCGGTGACG ACCCGCTCGG CATGGGGGAG GGGGCACAGG CGCTGGCCGA CATCGCCGAG CTCGAGCAGC TCGCCGAACA GCTGTCGCAG AGCTACTCCG GCGCGACCAT GGACGACGTC GACCTTGACA TGCTGGCGCG TCAACTCGGC GACGAGGCCG CCGTCAACGC CCGCACCCTG GCGGAGCTGG AGCGGGCGTT GATGAACCAG GGCTTCCTCG ACCGCGGATC CGACGGGCAG TGGCGGCTCT CACCCAAGGC GATGCGACAG CTCGGTCAGG CGGCGTTACG TGATGTGGCG CAACAGCTTT CCGGGCGGCA CGGCGAGCGC GACACCCGCC GCGCCGGGGC GGCCGGAGAG CTGACCGGAG CCACCCGGCC GTGGCAGTTC GGCGACACCG AACCGTGGAA CGTCACCCGC ACGGTAACGA ATGCAGTGCT CCGCACTGCG GGTACCAACG CGGTGCTGCG CGAGGCCGGG ACCGGGGAGA CCGCCGGGCC GATTCGGATC ACGGTGGACG ACGTCGAGAT CTCCGAGACC GAAACCCGCA CCCAGGCCGC GGTGGCGCTG CTGGTGGATA CGTCGTTCTC GATGGTGATG GAGAACCGTT GGCTGCCGAT GAAGCGGACC GCGCTGGCGC TCAACCACCT CGTCAGCACC CGGTTCCGTT CCGACGCGCT GCAGATCGTG GCGTTCGGCC GGTATGCCCG CACCGTGACC GCCGCGGAGC TGACCGGGCT GGAAGGCGTC TACGAGCAGG GCACGAATCT GCACCACGCA CTGGCGCTGG CCACCCGGCA CCTGCGGCGG CACCCCAACG CGCAGCCGGT CGTGCTGGTG GTCACCGACG GTGAGCCGAC TGCGCATCTG GAGGACTTCG ACGGCGACGG CCGTTCTGCG GTGTTTTTTG ACTACCCTCC GCACCCGCGG ACCATCGCCC ACACCGTGAA GGGATTCGAC GAGGTGGCGC GGATGGGCGC GCAGGTGACG ATCTTCCGGC TCGGCAGCGA CCCGGGGCTG GCACGGTTCA TCGATCAGGT GGCGCGCCGG GTGGGCGGTC GCGTGGTGGT GCCCGACCTC GACGGCCTGG GCGCGGCGGT GGTGGGTGAC TATCTCAGCG CGAAGCGGCG TCGTTAA
|
Protein sequence | MAKHVRLSRY SRYTGGPDPL APPVDLREAL EAIGQDVMEG TSPRRALSEM LRRGTQNMPG ADKLAAEANR RRRELLQRNN LDGTLADIKK LLDEAVLAER KELARALDDD ARFGELQLNS LSPSPAKAVQ ELSDYDWRSP EAREKYDQIK DLLGREMLDQ RFAGMKEALE NATDEDRQRV NDMLDDLNDL LDKHAKGQDS PQDFQDFMAK HGQFFPENPK NIDELLDSLA KRAAAAQRFR NSLSEQQRAE LDALAQQAFG SPSLMNALDR LDAHLQSARP GEDWTGSQRF SGDDPLGMGE GAQALADIAE LEQLAEQLSQ SYSGATMDDV DLDMLARQLG DEAAVNARTL AELERALMNQ GFLDRGSDGQ WRLSPKAMRQ LGQAALRDVA QQLSGRHGER DTRRAGAAGE LTGATRPWQF GDTEPWNVTR TVTNAVLRTA GTNAVLREAG TGETAGPIRI TVDDVEISET ETRTQAAVAL LVDTSFSMVM ENRWLPMKRT ALALNHLVST RFRSDALQIV AFGRYARTVT AAELTGLEGV YEQGTNLHHA LALATRHLRR HPNAQPVVLV VTDGEPTAHL EDFDGDGRSA VFFDYPPHPR TIAHTVKGFD EVARMGAQVT IFRLGSDPGL ARFIDQVARR VGGRVVVPDL DGLGAAVVGD YLSAKRRR
|
| |