Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mvan_3098 |
Symbol | |
ID | 4646854 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium vanbaalenii PYR-1 |
Kingdom | Bacteria |
Replicon accession | NC_008726 |
Strand | + |
Start bp | 3267148 |
End bp | 3269070 |
Gene Length | 1923 bp |
Protein Length | 640 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 639806575 |
Product | von Willebrand factor, type A |
Protein accession | YP_953906 |
Protein GI | 120404077 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.298521 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.039864 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGAAGGC ATAGCCTCCC CGACCCCGAC GAGTCGGACC AGTCCGGCTC GCCCGCAAGG GGTTTCGGCG ACTTCGGCGA ATCCGCTGAC TCCGGTGAGT TCGGCGGCTT CCGAGCCTCC GATACACCCG GCTCCCCGAC CGCACCCCGG TCGGGTCCGC AGCACAGCGG TGGCTGGGAG GGCGGCGAAT GGACCGGCAG CCACCGGGCG GTGACACCGG GCCGGCGCAA GGTGAGCCTC GGCGTGATCG TTGCCCTGGT CGCCGTCGTC GTGGTGGTGG CCACCGTCAT CGTCTGGCGT TTCGTCGGTG ACGCGTTGTC CGGGCGCTCC GATGTAGCGG CCGCGCGGTG TGTGGAGGGC GAGGTCGCCG TCGCCGTCGT CGCTGATCCC GCGATCGCCG AGCCGGTCGC TGCGCTCGCC GAGCGGTACA ACGAGACAGC CGCCCCTGTC GGCGACCGCT GCGTGAAGGT GGGCGTGAAG TCCGCCGATT CCGACCAGGT GCTCAACGGT TTCTCCGGAC AATGGCCCGG CGATCTCGGT GAACGTCCAG CGCTGTGGAT TCCGGCGAGT TCGGTGTCGG GCGCCCGGCT CGAGGCGGCG ACTGGAGCCG AGACGGTCAG CGACAGCCGC TCGCTGGTGA CCTCGCCCGT CGTGCTCGCC GTCGCGCCTG CGCTCAAAGA TGCTCTGGGT CAACAGAACT GGGGCACGCT TCCGAGGCTG CAAACCGATC CCGCCGCGCT GGACGGCCTC GGCCTGCAGG GGTGGGGTGG GCTGCGTCTG GCGCTGCCGC TCGGCGACGA CAGCGATGCC TCCTATCTGG CGGCCGAGGC GATCGCCGCC GCCGCGGCAC CCTCGGGGGC ACCGGCCAGT GCAGGTCTCG GCGCGGTCAG CACGGTGATG TCGGGTGCGC CGGAGCTGGC CGACCCCAAT GCGGGCACGG CCATCGATGC CCTGGTCGGC GCCGCCGACC AGGCCGCCGC ACCCGTGCAC GCGGTGGTGA CCACCGAGCA GCGGGTGTTC CAGCGCGCAT CCTCGCTGCC CGACTCGAAG GACAAACTGG CCGCCTGGAT TCCACCGGGA CCGACGGCGA CCGCCGACTT CCCCACCGTG TTGCTGGCCG GGGACTGGCT GTCCCAGGAA CAGGTCACCG CGGCCAGCGA GTTCGCCCGC TTCATGCGCA AGCCCGAACA GCTGGGCGAG TTGGCCAAGG CGGGCTTCCG GGTGGAGGGC ACGGCGCCTC CGGCCAGTGA CGTCGTCGAC TTCGCGCCGG TGTCCGCTCC TCTGGAGGTC GGGGACAACG CGCTGCGGTC CACGATCGCG GAGACGCTGG CCACTCCGGT GGGAAGCCCG ACGGTGACCG TCATGCTCGA CCAGTCGATG CCCGTCGAAG AGGGCGGGGT CTCGCGGTTG CAGAACGTCA TCGACGCCCT CAAGGCCCGC ATCGCGGTGC TCCCTGCCGA TTCCGGGGTC GGGCTGTGGA CGTTCGACGG TGTCCAGGGA CGCTCGGCGG TCAGCGTCGG ACCGCTGTCG GAGCCGGTGG ACGGCGCGCC GCGCAAGGAA GCGCTCACCG CGGCACTGGA CTCGCAGTCC CCGTCCGGCG GCGGCGCGGT GTCGTTCACC ACGCTGCGCC TGGTCTACAC CGACGCGTCG ACGAAATACC GTGAGGGCCA GAAGAATTCG GTCCTGGTGA TCACCACCGG GCCACACACC GACCAGTCGC TGGGAGCCGC GGGCCTGCAG GACTACATCC GCGGCGCCTT CAACCGGGAC CGCCCGGTGG CGGTCAACGT GATCGATTTC GGTGACGACT CCGATCGGGC CACCTGGGAG TCCGTCGCCC AGATCACCGG TGGCAACTAC CAGAACCTCG GCACCTCGGC GTCCCCGGAG CTGGCGGCGG CCATCTCGTC GATGTTGTCC TGA
|
Protein sequence | MGRHSLPDPD ESDQSGSPAR GFGDFGESAD SGEFGGFRAS DTPGSPTAPR SGPQHSGGWE GGEWTGSHRA VTPGRRKVSL GVIVALVAVV VVVATVIVWR FVGDALSGRS DVAAARCVEG EVAVAVVADP AIAEPVAALA ERYNETAAPV GDRCVKVGVK SADSDQVLNG FSGQWPGDLG ERPALWIPAS SVSGARLEAA TGAETVSDSR SLVTSPVVLA VAPALKDALG QQNWGTLPRL QTDPAALDGL GLQGWGGLRL ALPLGDDSDA SYLAAEAIAA AAAPSGAPAS AGLGAVSTVM SGAPELADPN AGTAIDALVG AADQAAAPVH AVVTTEQRVF QRASSLPDSK DKLAAWIPPG PTATADFPTV LLAGDWLSQE QVTAASEFAR FMRKPEQLGE LAKAGFRVEG TAPPASDVVD FAPVSAPLEV GDNALRSTIA ETLATPVGSP TVTVMLDQSM PVEEGGVSRL QNVIDALKAR IAVLPADSGV GLWTFDGVQG RSAVSVGPLS EPVDGAPRKE ALTAALDSQS PSGGGAVSFT TLRLVYTDAS TKYREGQKNS VLVITTGPHT DQSLGAAGLQ DYIRGAFNRD RPVAVNVIDF GDDSDRATWE SVAQITGGNY QNLGTSASPE LAAAISSMLS
|
| |