Gene Mvan_4853 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_4853 
Symbol 
ID4646419 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp5191799 
End bp5193805 
Gene Length2007 bp 
Protein Length668 aa 
Translation table11 
GC content69% 
IMG OID639808324 
Productvon Willebrand factor, type A 
Protein accessionYP_955632 
Protein GI120405803 
COG category[R] General function prediction only 
COG ID[COG4867] Uncharacterized protein with a von Willebrand factor type A (vWA) domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.381001 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTAAAC ATGTTCGGCT GTCGCGGTAT TCGCGGTACA CGGGCGGGCC CGACCCGCTG 
GCGCCGCCCG TCGACCTGCG CGAGGCGCTC GAGGCGATCG GTCAGGACGT CATGGAGGGC
ACGTCGCCGC GGCGCGCGCT GTCGGAGATG CTCCGGCGCG GCACTCAGAA CATGCCGGGC
GCCGACAAGC TCGCCGCCGA AGCCAACCGT CGCCGTCGGG AACTGCTGCA GCGCAACAAT
CTCGACGGCA CCCTGGCCGA CATCAAGAAG CTGCTCGACG AGGCGGTGCT CGCCGAACGC
AAGGAACTGG CGCGTGCGCT CGACGACGAC GCACGGTTCG GTGAGCTGCA GCTGAACTCG
CTGTCCCCGT CCCCGGCCAA AGCCGTGCAG GAGCTTTCCG ATTATGACTG GCGGTCGCCG
GAGGCGCGCG AGAAGTACGA CCAGATCAAG GATCTGCTCG GACGCGAGAT GCTCGATCAG
CGGTTCGCCG GCATGAAGGA GGCGCTGGAG AACGCCACCG ACGAGGATCG CCAGCGGGTC
AACGACATGC TCGACGATCT CAACGACCTG CTCGACAAGC ACGCCAAAGG ACAAGATTCC
CCCCAGGATT TCCAGGACTT CATGGCCAAG CACGGCCAGT TCTTCCCGGA GAACCCGAAG
AACATCGACG AGCTGCTGGA CTCGCTGGCC AAGCGGGCGG CGGCCGCGCA GCGGTTCCGC
AACAGCCTGT CCGAGCAGCA GCGCGCCGAG CTGGATGCGT TGGCGCAGCA GGCTTTCGGG
TCACCGTCGT TGATGAACGC GCTCGACCGT CTCGACGCCC ACCTGCAGTC CGCCCGGCCC
GGGGAGGATT GGACGGGCTC GCAACGGTTC TCCGGTGACG ACCCGCTCGG CATGGGGGAG
GGGGCACAGG CGCTGGCCGA CATCGCCGAG CTCGAGCAGC TCGCCGAACA GCTGTCGCAG
AGCTACTCCG GCGCGACCAT GGACGACGTC GACCTTGACA TGCTGGCGCG TCAACTCGGC
GACGAGGCCG CCGTCAACGC CCGCACCCTG GCGGAGCTGG AGCGGGCGTT GATGAACCAG
GGCTTCCTCG ACCGCGGATC CGACGGGCAG TGGCGGCTCT CACCCAAGGC GATGCGACAG
CTCGGTCAGG CGGCGTTACG TGATGTGGCG CAACAGCTTT CCGGGCGGCA CGGCGAGCGC
GACACCCGCC GCGCCGGGGC GGCCGGAGAG CTGACCGGAG CCACCCGGCC GTGGCAGTTC
GGCGACACCG AACCGTGGAA CGTCACCCGC ACGGTAACGA ATGCAGTGCT CCGCACTGCG
GGTACCAACG CGGTGCTGCG CGAGGCCGGG ACCGGGGAGA CCGCCGGGCC GATTCGGATC
ACGGTGGACG ACGTCGAGAT CTCCGAGACC GAAACCCGCA CCCAGGCCGC GGTGGCGCTG
CTGGTGGATA CGTCGTTCTC GATGGTGATG GAGAACCGTT GGCTGCCGAT GAAGCGGACC
GCGCTGGCGC TCAACCACCT CGTCAGCACC CGGTTCCGTT CCGACGCGCT GCAGATCGTG
GCGTTCGGCC GGTATGCCCG CACCGTGACC GCCGCGGAGC TGACCGGGCT GGAAGGCGTC
TACGAGCAGG GCACGAATCT GCACCACGCA CTGGCGCTGG CCACCCGGCA CCTGCGGCGG
CACCCCAACG CGCAGCCGGT CGTGCTGGTG GTCACCGACG GTGAGCCGAC TGCGCATCTG
GAGGACTTCG ACGGCGACGG CCGTTCTGCG GTGTTTTTTG ACTACCCTCC GCACCCGCGG
ACCATCGCCC ACACCGTGAA GGGATTCGAC GAGGTGGCGC GGATGGGCGC GCAGGTGACG
ATCTTCCGGC TCGGCAGCGA CCCGGGGCTG GCACGGTTCA TCGATCAGGT GGCGCGCCGG
GTGGGCGGTC GCGTGGTGGT GCCCGACCTC GACGGCCTGG GCGCGGCGGT GGTGGGTGAC
TATCTCAGCG CGAAGCGGCG TCGTTAA
 
Protein sequence
MAKHVRLSRY SRYTGGPDPL APPVDLREAL EAIGQDVMEG TSPRRALSEM LRRGTQNMPG 
ADKLAAEANR RRRELLQRNN LDGTLADIKK LLDEAVLAER KELARALDDD ARFGELQLNS
LSPSPAKAVQ ELSDYDWRSP EAREKYDQIK DLLGREMLDQ RFAGMKEALE NATDEDRQRV
NDMLDDLNDL LDKHAKGQDS PQDFQDFMAK HGQFFPENPK NIDELLDSLA KRAAAAQRFR
NSLSEQQRAE LDALAQQAFG SPSLMNALDR LDAHLQSARP GEDWTGSQRF SGDDPLGMGE
GAQALADIAE LEQLAEQLSQ SYSGATMDDV DLDMLARQLG DEAAVNARTL AELERALMNQ
GFLDRGSDGQ WRLSPKAMRQ LGQAALRDVA QQLSGRHGER DTRRAGAAGE LTGATRPWQF
GDTEPWNVTR TVTNAVLRTA GTNAVLREAG TGETAGPIRI TVDDVEISET ETRTQAAVAL
LVDTSFSMVM ENRWLPMKRT ALALNHLVST RFRSDALQIV AFGRYARTVT AAELTGLEGV
YEQGTNLHHA LALATRHLRR HPNAQPVVLV VTDGEPTAHL EDFDGDGRSA VFFDYPPHPR
TIAHTVKGFD EVARMGAQVT IFRLGSDPGL ARFIDQVARR VGGRVVVPDL DGLGAAVVGD
YLSAKRRR