Gene Mkms_4399 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_4399 
Symbol 
ID4612342 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp4621902 
End bp4623899 
Gene Length1998 bp 
Protein Length665 aa 
Translation table11 
GC content69% 
IMG OID639794085 
Productvon Willebrand factor, type A 
Protein accessionYP_940380 
Protein GI119870428 
COG category[R] General function prediction only 
COG ID[COG4867] Uncharacterized protein with a von Willebrand factor type A (vWA) domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.471327 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGATG CCCGCGCGCA CAAGGGACAC GGCCGGTCAT CCCGGTACTC CCGCTATACC 
GGCGGGCCGG ATCCGCTTGC CCCGCCGGTG GATCTGCGCG AGGCGCTCGA GCAGATCGGT
GAGGACGTGA TGGAGGGCAG CTCGCCGCGG CGGGCGCTGT CCGAACTGCT GCGGCGCGGC
ACCAAGAACA TGCGCGGGGC CGACCGGCTG GCCGCCGAGG CCAACCGGCG GCGCCGGGAA
CTGTTGAAGC GCAACAACCT CGACGGCACC CTGCAGGAGA TCAAGAAGCT GCTCGACGAG
GCGGTGCTCG CCGAACGCAA GGAACTCGCC CGCGCCCTCG ACGACGACGC GCGGTTCTCC
GAGATGCAGA TCGAGGCGCT GTCCCCGTCG CCGGCCAAGG CCGTCCAGGA ATTGTCGGAC
TACCAGTGGC GCAGCCCCGA GGCCCGCGAG AAGTACGACC AGATCAAGGA TCTGCTCGGC
CGCGAGATGC TCGACCAGCG GTTCGCCGGC ATGAAGGAGG CGCTGGAGAA CGCCACCGAC
GAGGACCGCC AGCGCGTCAA CGACATGCTC GACGACCTCA ACGAGCTGTT GGACAAGCAT
GCGAACGGCC AGGATTCGCA ACAGGATTAC GACGATTTCA TGGCCAAGCA CGGCGAGTTC
TTCCCGGAGA ATCCGCGCAA CGTCGACGAA CTCCTCGACT CGCTGGCCAA ACGCGCCGCG
GCCGCACAGC GCTTCCGCAA CAGCCTCTCC CCGGACCAGC GCGCCGAGCT GGATGCGCTG
GCGCAGCAGG CATTCGGCTC GCCGTCGCTG ATGAACGCGC TCAACAAACT CGACTCCCAT
CTACAGGCGG CGCGCCCAGG TGAGGACTGG TCGGGGTCCT CGGAGTTCTC CGGCGACAAC
CCACTGGGGA TGGGGGAGGG CGCGCAGGCG CTTGCCGACA TCGGTGAGCT CGAACAGCTC
GCCGAGCAGC TGTCGCAGAG CTACGCGGGC GCCACGATGG ACGACGTCGA CCTCGACGCG
CTGGCCCGCC AGCTCGGTGA CCAGGCCGCC GTCGACGCGC GGACGCTGGC CGAACTCGAA
CGCGCCCTGA TGAACCAGGG CTTCCTCGAC CGCGGGTCCG ACGGGAAATG GCGGCTGTCG
CCGAAGGCCA TGCGTCAGCT CGGGCAGGCC GCGCTACGCG ATGTGGCGCA ACAGCTTTCG
GGCCGCCACG GTGAACGCGA CACCCGCAGG GCGGGCGCCG CCGGCGAGCT GACGGGAGCC
ACCCGGCCCT GGCAGTTCGG CGACACCGAA CCGTGGAACG TCACCCGCAC GCTCACCAAC
GCCGTTCTGC GCCAAGCGGG TTCGAGCGTA CGCGAGATCC CGGTGAGCAT CACCGTCGAC
GACGTCGAGA TCTCCGAGAC CGAGACCAGG ACGCAGGCCG CGGTGGCGCT GCTCGTCGAC
ACCTCGTTCT CGATGGTGAT GGAGAACCGG TGGCTGCCCA TGAAGCGGAC CGCGCTGGCG
CTCAACCATC TGGTGAGCAC CCGGTTCCGT TCGGACGCAC TGCAGATCGT CGCGTTCGGC
CGGTACGCCA GGACGGTGAC CGCGGCCGAA CTGACCGGGC TCGAGGGCGT CTACGAACAG
GGCACCAACC TGCACCACGC GCTGGCGCTG GCCACCCGGC ATCTGCGCCG GCACCCCAAC
GCCCAGCCGG TCATCCTCGT GGTCACCGAC GGGGAGCCGA CCGCCCACCT CGAGGACTTC
GGCGGACGCG ACGGCGCACA GGTGTTTTTT GATTACCCGC CGCATCCGCG GACCATCGCC
CACACCGTGC GCGGCTTCGA CGAGGTCGCC CGCCTCGGCG CCCAGGTGAC GATCTTCCGG
TTGGGCACCG ACCCCGGCCT CGCGCGGTTC ATCGACCAGG TCGCCCGCCG CGTGGGCGGC
CGGGTGGTGG TGCCCGACCT CGACGGACTC GGCGCCGCTG TCGTCGGCGA CTACCTGACC
TCACGCCGCC GACGGTAA
 
Protein sequence
MADARAHKGH GRSSRYSRYT GGPDPLAPPV DLREALEQIG EDVMEGSSPR RALSELLRRG 
TKNMRGADRL AAEANRRRRE LLKRNNLDGT LQEIKKLLDE AVLAERKELA RALDDDARFS
EMQIEALSPS PAKAVQELSD YQWRSPEARE KYDQIKDLLG REMLDQRFAG MKEALENATD
EDRQRVNDML DDLNELLDKH ANGQDSQQDY DDFMAKHGEF FPENPRNVDE LLDSLAKRAA
AAQRFRNSLS PDQRAELDAL AQQAFGSPSL MNALNKLDSH LQAARPGEDW SGSSEFSGDN
PLGMGEGAQA LADIGELEQL AEQLSQSYAG ATMDDVDLDA LARQLGDQAA VDARTLAELE
RALMNQGFLD RGSDGKWRLS PKAMRQLGQA ALRDVAQQLS GRHGERDTRR AGAAGELTGA
TRPWQFGDTE PWNVTRTLTN AVLRQAGSSV REIPVSITVD DVEISETETR TQAAVALLVD
TSFSMVMENR WLPMKRTALA LNHLVSTRFR SDALQIVAFG RYARTVTAAE LTGLEGVYEQ
GTNLHHALAL ATRHLRRHPN AQPVILVVTD GEPTAHLEDF GGRDGAQVFF DYPPHPRTIA
HTVRGFDEVA RLGAQVTIFR LGTDPGLARF IDQVARRVGG RVVVPDLDGL GAAVVGDYLT
SRRRR