Gene Mvan_3098 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_3098 
Symbol 
ID4646854 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp3267148 
End bp3269070 
Gene Length1923 bp 
Protein Length640 aa 
Translation table11 
GC content71% 
IMG OID639806575 
Productvon Willebrand factor, type A 
Protein accessionYP_953906 
Protein GI120404077 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.298521 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.039864 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGAAGGC ATAGCCTCCC CGACCCCGAC GAGTCGGACC AGTCCGGCTC GCCCGCAAGG 
GGTTTCGGCG ACTTCGGCGA ATCCGCTGAC TCCGGTGAGT TCGGCGGCTT CCGAGCCTCC
GATACACCCG GCTCCCCGAC CGCACCCCGG TCGGGTCCGC AGCACAGCGG TGGCTGGGAG
GGCGGCGAAT GGACCGGCAG CCACCGGGCG GTGACACCGG GCCGGCGCAA GGTGAGCCTC
GGCGTGATCG TTGCCCTGGT CGCCGTCGTC GTGGTGGTGG CCACCGTCAT CGTCTGGCGT
TTCGTCGGTG ACGCGTTGTC CGGGCGCTCC GATGTAGCGG CCGCGCGGTG TGTGGAGGGC
GAGGTCGCCG TCGCCGTCGT CGCTGATCCC GCGATCGCCG AGCCGGTCGC TGCGCTCGCC
GAGCGGTACA ACGAGACAGC CGCCCCTGTC GGCGACCGCT GCGTGAAGGT GGGCGTGAAG
TCCGCCGATT CCGACCAGGT GCTCAACGGT TTCTCCGGAC AATGGCCCGG CGATCTCGGT
GAACGTCCAG CGCTGTGGAT TCCGGCGAGT TCGGTGTCGG GCGCCCGGCT CGAGGCGGCG
ACTGGAGCCG AGACGGTCAG CGACAGCCGC TCGCTGGTGA CCTCGCCCGT CGTGCTCGCC
GTCGCGCCTG CGCTCAAAGA TGCTCTGGGT CAACAGAACT GGGGCACGCT TCCGAGGCTG
CAAACCGATC CCGCCGCGCT GGACGGCCTC GGCCTGCAGG GGTGGGGTGG GCTGCGTCTG
GCGCTGCCGC TCGGCGACGA CAGCGATGCC TCCTATCTGG CGGCCGAGGC GATCGCCGCC
GCCGCGGCAC CCTCGGGGGC ACCGGCCAGT GCAGGTCTCG GCGCGGTCAG CACGGTGATG
TCGGGTGCGC CGGAGCTGGC CGACCCCAAT GCGGGCACGG CCATCGATGC CCTGGTCGGC
GCCGCCGACC AGGCCGCCGC ACCCGTGCAC GCGGTGGTGA CCACCGAGCA GCGGGTGTTC
CAGCGCGCAT CCTCGCTGCC CGACTCGAAG GACAAACTGG CCGCCTGGAT TCCACCGGGA
CCGACGGCGA CCGCCGACTT CCCCACCGTG TTGCTGGCCG GGGACTGGCT GTCCCAGGAA
CAGGTCACCG CGGCCAGCGA GTTCGCCCGC TTCATGCGCA AGCCCGAACA GCTGGGCGAG
TTGGCCAAGG CGGGCTTCCG GGTGGAGGGC ACGGCGCCTC CGGCCAGTGA CGTCGTCGAC
TTCGCGCCGG TGTCCGCTCC TCTGGAGGTC GGGGACAACG CGCTGCGGTC CACGATCGCG
GAGACGCTGG CCACTCCGGT GGGAAGCCCG ACGGTGACCG TCATGCTCGA CCAGTCGATG
CCCGTCGAAG AGGGCGGGGT CTCGCGGTTG CAGAACGTCA TCGACGCCCT CAAGGCCCGC
ATCGCGGTGC TCCCTGCCGA TTCCGGGGTC GGGCTGTGGA CGTTCGACGG TGTCCAGGGA
CGCTCGGCGG TCAGCGTCGG ACCGCTGTCG GAGCCGGTGG ACGGCGCGCC GCGCAAGGAA
GCGCTCACCG CGGCACTGGA CTCGCAGTCC CCGTCCGGCG GCGGCGCGGT GTCGTTCACC
ACGCTGCGCC TGGTCTACAC CGACGCGTCG ACGAAATACC GTGAGGGCCA GAAGAATTCG
GTCCTGGTGA TCACCACCGG GCCACACACC GACCAGTCGC TGGGAGCCGC GGGCCTGCAG
GACTACATCC GCGGCGCCTT CAACCGGGAC CGCCCGGTGG CGGTCAACGT GATCGATTTC
GGTGACGACT CCGATCGGGC CACCTGGGAG TCCGTCGCCC AGATCACCGG TGGCAACTAC
CAGAACCTCG GCACCTCGGC GTCCCCGGAG CTGGCGGCGG CCATCTCGTC GATGTTGTCC
TGA
 
Protein sequence
MGRHSLPDPD ESDQSGSPAR GFGDFGESAD SGEFGGFRAS DTPGSPTAPR SGPQHSGGWE 
GGEWTGSHRA VTPGRRKVSL GVIVALVAVV VVVATVIVWR FVGDALSGRS DVAAARCVEG
EVAVAVVADP AIAEPVAALA ERYNETAAPV GDRCVKVGVK SADSDQVLNG FSGQWPGDLG
ERPALWIPAS SVSGARLEAA TGAETVSDSR SLVTSPVVLA VAPALKDALG QQNWGTLPRL
QTDPAALDGL GLQGWGGLRL ALPLGDDSDA SYLAAEAIAA AAAPSGAPAS AGLGAVSTVM
SGAPELADPN AGTAIDALVG AADQAAAPVH AVVTTEQRVF QRASSLPDSK DKLAAWIPPG
PTATADFPTV LLAGDWLSQE QVTAASEFAR FMRKPEQLGE LAKAGFRVEG TAPPASDVVD
FAPVSAPLEV GDNALRSTIA ETLATPVGSP TVTVMLDQSM PVEEGGVSRL QNVIDALKAR
IAVLPADSGV GLWTFDGVQG RSAVSVGPLS EPVDGAPRKE ALTAALDSQS PSGGGAVSFT
TLRLVYTDAS TKYREGQKNS VLVITTGPHT DQSLGAAGLQ DYIRGAFNRD RPVAVNVIDF
GDDSDRATWE SVAQITGGNY QNLGTSASPE LAAAISSMLS