Gene Mjls_4693 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMjls_4693 
Symbol 
ID4880392 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. JLS 
KingdomBacteria 
Replicon accessionNC_009077 
Strand
Start bp4931072 
End bp4933069 
Gene Length1998 bp 
Protein Length665 aa 
Translation table11 
GC content70% 
IMG OID640141998 
Productvon Willebrand factor, type A 
Protein accessionYP_001072949 
Protein GI126437258 
COG category[R] General function prediction only 
COG ID[COG4867] Uncharacterized protein with a von Willebrand factor type A (vWA) domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.93356 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGATG CCCGCGCGCA CAAGGGACAC GGCCGGTCAT CCCGGTACTC CCGCTATACC 
GGCGGGCCGG ATCCGCTTGC CCCGCCGGTG GATCTGCGCG AGGCGCTCGA GCAGATCGGT
GAGGACGTGA TGGAGGGCAG CTCGCCGCGG CGGGCGCTGT CCGAACTGCT GCGGCGCGGC
ACCAAGAACA TGCGCGGGGC CGACCGGCTG GCCGCCGAGG CCAACCGGCG GCGCCGGGAA
CTGTTGAAGC GCAACAACCT CGACGGCACC CTGCAGGAGA TCAAGAAGCT GCTCGACGAG
GCGGTGCTCG CCGAACGCAA GGAACTCGCC CGCGCCCTCG ACGACGACGC GCGGTTCTCC
GAGATGCAGA TCGAGGCGCT GTCCCCGTCG CCGGCCAAGG CCGTCCAGGA ATTGTCGGAC
TACCAGTGGC GCAGCCCCGA GGCCCGCGAG AAGTACGACC AGATCAAGGA TCTGCTCGGC
CGCGAGATGC TCGACCAGCG GTTCGCCGGC ATGAAGGAGG CGCTGGAGAA CGCCACCGAC
GAGGACCGCC AGCGCGTCAA CGACATGCTC GACGACCTCA ACGAGCTGTT GGACAAGCAT
GCGAACGGCC AGGATTCGCA ACAGGATTTC GACGATTTCA TGGCCAAGCA CGGCGAGTTC
TTCCCGGAGA ATCCGCGCAA CGTCGACGAA CTCCTCGACT CGCTGGCCAA ACGCGCCGCG
GCCGCGCAGC GCTTCCGCAA CAGCCTCTCC CCGGACCAGC GCGCCGAGCT GGATGCGCTG
GCGCAGCAGG CATTCGGCTC GCCGTCGCTG ATGAACGCGC TCAACAAACT CGACTCCCAT
CTACAGGCGG CGCGCCCGGG TGAGGACTGG TCGGGGTCCT CGGAGTTCTC CGGCGACAAC
CCGCTGGGGA TGGGGGAGGG CGCGCAGGCG CTGGCCGACA TCGGTGAGCT CGAACAGCTC
GCCGAGCAGC TGTCGCAGAG CTACGCGGGC GCCACGATGG ACGACGTCGA CCTCGACGCG
CTGGCCCGCC AGCTCGGTGA CCAGGCCGCC GTCGACGCGC GGACGCTGGC CGAACTCGAA
CGCGCCCTGA TGAACCAGGG CTTCCTCGAC CGCGGGTCCG ACGGGAAATG GCGGCTGTCG
CCGAAGGCCA TGCGTCAGCT CGGGCAGGCC GCGCTACGCG ATGTGGCGCA ACAACTTTCG
GGCCGCCACG GTGAACGTGA CACCCGCAGG GCGGGCGCCG CCGGCGAGCT GACGGGAGCC
ACCCGACCCT GGCAGTTCGG CGACACCGAA CCGTGGAACG TCACCCGCAC GCTCACCAAC
GCCGTTCTGC GCCAAGCGGG TTCGAGCGTA CGCGAGATCC CGGTGAGCAT CACCGTCGAC
GACGTCGAGA TCTCCGAGAC CGAGACCAGG ACGCAGGCCG CGGTGGCGCT GCTCGTCGAC
ACCTCGTTCT CGATGGTGAT GGAGAACCGG TGGCTGCCCA TGAAGCGGAC CGCGCTGGCG
CTCAACCATC TGGTGAGCAC CCGGTTCCGT TCGGACGCAC TGCAGATCGT CGCGTTCGGC
CGGTACGCCA GGACGGTGAC CGCGGCCGAA CTGACCGGGC TCGAGGGCGT CTACGAACAG
GGCACCAACC TGCACCACGC GCTGGCGCTG GCCACCCGGC ATCTGCGCCG GCATCCCAAC
GCCCAGCCGG TCATCCTCGT GGTCACCGAC GGGGAACCGA CCGCCCACCT CGAGGACTTC
GGCGGACGCG ACGGCGCACA GGTGTTTTTC GATTACCCAC CGCATCCGCG GACCATCGCC
CACACCGTGC GCGGCTTCGA CGAGGTCGCC CGCCTCGGCG CCCAGGTGAC GATCTTCCGG
TTGGGCACCG ACCCCGGCCT CGCGCGGTTC ATCGACCAGG TCGCCCGCCG CGTGGGCGGC
CGGGTGGTGG TGCCCGACCT CGACGGGCTC GGCGCCGCTG TCGTCGGCGA CTACCTGACG
TCGCGCCGCC GGCGGTAA
 
Protein sequence
MADARAHKGH GRSSRYSRYT GGPDPLAPPV DLREALEQIG EDVMEGSSPR RALSELLRRG 
TKNMRGADRL AAEANRRRRE LLKRNNLDGT LQEIKKLLDE AVLAERKELA RALDDDARFS
EMQIEALSPS PAKAVQELSD YQWRSPEARE KYDQIKDLLG REMLDQRFAG MKEALENATD
EDRQRVNDML DDLNELLDKH ANGQDSQQDF DDFMAKHGEF FPENPRNVDE LLDSLAKRAA
AAQRFRNSLS PDQRAELDAL AQQAFGSPSL MNALNKLDSH LQAARPGEDW SGSSEFSGDN
PLGMGEGAQA LADIGELEQL AEQLSQSYAG ATMDDVDLDA LARQLGDQAA VDARTLAELE
RALMNQGFLD RGSDGKWRLS PKAMRQLGQA ALRDVAQQLS GRHGERDTRR AGAAGELTGA
TRPWQFGDTE PWNVTRTLTN AVLRQAGSSV REIPVSITVD DVEISETETR TQAAVALLVD
TSFSMVMENR WLPMKRTALA LNHLVSTRFR SDALQIVAFG RYARTVTAAE LTGLEGVYEQ
GTNLHHALAL ATRHLRRHPN AQPVILVVTD GEPTAHLEDF GGRDGAQVFF DYPPHPRTIA
HTVRGFDEVA RLGAQVTIFR LGTDPGLARF IDQVARRVGG RVVVPDLDGL GAAVVGDYLT
SRRRR