Gene Mmcs_4313 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmcs_4313 
Symbol 
ID4113143 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. MCS 
KingdomBacteria 
Replicon accessionNC_008146 
Strand
Start bp4582697 
End bp4584694 
Gene Length1998 bp 
Protein Length665 aa 
Translation table11 
GC content69% 
IMG OID638033459 
Productvon Willebrand factor, type A 
Protein accessionYP_641474 
Protein GI108801277 
COG category[R] General function prediction only 
COG ID[COG4867] Uncharacterized protein with a von Willebrand factor type A (vWA) domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.844585 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTGATG CCCGCGCGCA CAAGGGACAC GGCCGGTCAT CCCGGTACTC CCGCTATACC 
GGCGGGCCGG ATCCGCTTGC CCCGCCGGTG GATCTGCGCG AGGCGCTCGA GCAGATCGGT
GAGGACGTGA TGGAGGGCAG CTCGCCGCGG CGGGCGCTGT CCGAACTGCT GCGGCGCGGC
ACCAAGAACA TGCGCGGGGC CGACCGGCTG GCCGCCGAGG CCAACCGGCG GCGCCGGGAA
CTGTTGAAGC GCAACAACCT CGACGGCACC CTGCAGGAGA TCAAGAAGCT GCTCGACGAG
GCGGTGCTCG CCGAACGCAA GGAACTCGCC CGCGCCCTCG ACGACGACGC GCGGTTCTCC
GAGATGCAGA TCGAGGCGCT GTCCCCGTCG CCGGCCAAGG CCGTCCAGGA ATTGTCGGAC
TACCAGTGGC GCAGCCCCGA GGCCCGCGAG AAGTACGACC AGATCAAGGA TCTGCTCGGC
CGCGAGATGC TCGACCAGCG GTTCGCCGGC ATGAAGGAGG CGCTGGAGAA CGCCACCGAC
GAGGACCGCC AGCGCGTCAA CGACATGCTC GACGACCTCA ACGAGCTGTT GGACAAGCAT
GCGAACGGCC AGGATTCGCA ACAGGATTAC GACGATTTCA TGGCCAAGCA CGGCGAGTTC
TTCCCGGAGA ATCCGCGCAA CGTCGACGAA CTCCTCGACT CGCTGGCCAA ACGCGCCGCG
GCCGCACAGC GCTTCCGCAA CAGCCTCTCC CCGGACCAGC GCGCCGAGCT GGATGCGCTG
GCGCAGCAGG CATTCGGCTC GCCGTCGCTG ATGAACGCGC TCAACAAACT CGACTCCCAT
CTACAGGCGG CGCGCCCAGG TGAGGACTGG TCGGGGTCCT CGGAGTTCTC CGGCGACAAC
CCACTGGGGA TGGGGGAGGG CGCGCAGGCG CTTGCCGACA TCGGTGAGCT CGAACAGCTC
GCCGAGCAGC TGTCGCAGAG CTACGCGGGC GCCACGATGG ACGACGTCGA CCTCGACGCG
CTGGCCCGCC AGCTCGGTGA CCAGGCCGCC GTCGACGCGC GGACGCTGGC CGAACTCGAA
CGCGCCCTGA TGAACCAGGG CTTCCTCGAC CGCGGGTCCG ACGGGAAATG GCGGCTGTCG
CCGAAGGCCA TGCGTCAGCT CGGGCAGGCC GCGCTACGCG ATGTGGCGCA ACAGCTTTCG
GGCCGCCACG GTGAACGCGA CACCCGCAGG GCGGGCGCCG CCGGCGAGCT GACGGGAGCC
ACCCGGCCCT GGCAGTTCGG CGACACCGAA CCGTGGAACG TCACCCGCAC GCTCACCAAC
GCCGTTCTGC GCCAAGCGGG TTCGAGCGTA CGCGAGATCC CGGTGAGCAT CACCGTCGAC
GACGTCGAGA TCTCCGAGAC CGAGACCAGG ACGCAGGCCG CGGTGGCGCT GCTCGTCGAC
ACCTCGTTCT CGATGGTGAT GGAGAACCGG TGGCTGCCCA TGAAGCGGAC CGCGCTGGCG
CTCAACCATC TGGTGAGCAC CCGGTTCCGT TCGGACGCAC TGCAGATCGT CGCGTTCGGC
CGGTACGCCA GGACGGTGAC CGCGGCCGAA CTGACCGGGC TCGAGGGCGT CTACGAACAG
GGCACCAACC TGCACCACGC GCTGGCGCTG GCCACCCGGC ATCTGCGCCG GCACCCCAAC
GCCCAGCCGG TCATCCTCGT GGTCACCGAC GGGGAGCCGA CCGCCCACCT CGAGGACTTC
GGCGGACGCG ACGGCGCACA GGTGTTTTTT GATTACCCGC CGCATCCGCG GACCATCGCC
CACACCGTGC GCGGCTTCGA CGAGGTCGCC CGCCTCGGCG CCCAGGTGAC GATCTTCCGG
TTGGGCACCG ACCCCGGCCT CGCGCGGTTC ATCGACCAGG TCGCCCGCCG CGTGGGCGGC
CGGGTGGTGG TGCCCGACCT CGACGGACTC GGCGCCGCTG TCGTCGGCGA CTACCTGACC
TCACGCCGCC GACGGTAA
 
Protein sequence
MADARAHKGH GRSSRYSRYT GGPDPLAPPV DLREALEQIG EDVMEGSSPR RALSELLRRG 
TKNMRGADRL AAEANRRRRE LLKRNNLDGT LQEIKKLLDE AVLAERKELA RALDDDARFS
EMQIEALSPS PAKAVQELSD YQWRSPEARE KYDQIKDLLG REMLDQRFAG MKEALENATD
EDRQRVNDML DDLNELLDKH ANGQDSQQDY DDFMAKHGEF FPENPRNVDE LLDSLAKRAA
AAQRFRNSLS PDQRAELDAL AQQAFGSPSL MNALNKLDSH LQAARPGEDW SGSSEFSGDN
PLGMGEGAQA LADIGELEQL AEQLSQSYAG ATMDDVDLDA LARQLGDQAA VDARTLAELE
RALMNQGFLD RGSDGKWRLS PKAMRQLGQA ALRDVAQQLS GRHGERDTRR AGAAGELTGA
TRPWQFGDTE PWNVTRTLTN AVLRQAGSSV REIPVSITVD DVEISETETR TQAAVALLVD
TSFSMVMENR WLPMKRTALA LNHLVSTRFR SDALQIVAFG RYARTVTAAE LTGLEGVYEQ
GTNLHHALAL ATRHLRRHPN AQPVILVVTD GEPTAHLEDF GGRDGAQVFF DYPPHPRTIA
HTVRGFDEVA RLGAQVTIFR LGTDPGLARF IDQVARRVGG RVVVPDLDGL GAAVVGDYLT
SRRRR