Gene Mesil_2868 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMesil_2868 
Symbol 
ID9252390 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMeiothermus silvanus DSM 9946 
KingdomBacteria 
Replicon accessionNC_014212 
Strand
Start bp2931495 
End bp2932898 
Gene Length1404 bp 
Protein Length467 aa 
Translation table11 
GC content64% 
IMG OID 
Productvon Willebrand factor type A 
Protein accessionYP_003686217 
Protein GI297567245 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.566184 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGTGG TGGAAGCCCT TTTAGAAGCC AGCGTTCAAC CTCACCGGGA GTATCTGCAA 
GCTAACCAAC CCGGCCAGAA GCTGTTTTTG GCCCTCAAGA TTCGCCCCTC CGCCGAGGCT
ACCCGTTCCA GGCCCCAACT CGTGGTGGCC TTCGTCGTGG ACACTTCAGG CTCGATGCGC
GAGGTGGTGA CCGAGCCTAC CGAGCGCACC GGGCAAAGTG TGCGGGTAGA TGGCAAGGAC
TACGAGGTGG TGCGGGGGGC CAAAAGCAAG ATAGACTTGG TGATAGAGGC CCTGCAAAAC
CTGCTGAGCA GCCCGCAGTT ACAGCCCTCG GATCGGCTGG CTATCGTCAA GTTCGACGAC
GTTGCCGAGG TAGTCCAGCC CTTCACCCCC GCCAACGAAA AAGCCCGGTT GGTGGCTGCT
GCCGAGCGGC TCACCCAGTA CTCGGGGGGA ACCCAGATGG GGGCTGGGAT GCGTGAAGGG
ATGCGGCTGC TCGAGCGCGA AGCGGGAAGC CGCCGCTTGA TCCTGCTCAC GGATGGGCAG
ACCTTCGACG AGCCTTTAGT GGAAACGGTC GCTGCCCAAC TGGCCCAAGC TCGTATCCCG
GTGACGGCCA TCGGGGTGGG GGACGAGTGG AACGACGACC TCCTGGCCGA GATCACCGAC
CGCACCCAGG GCAAGCCCTT TCATGTGATT CCGGATAACC AGAACCCCCA GCCGCCCAGC
CTGCGAGCCA GCGAGCTGCC ACAGGCCATC CTGGGCGAAC TCGAGCACGC TGCTTTGGAG
GTTGTGACCA ACGTCACGCT CAGCGTCAAG ACGGTCAGGG ACGTGGCCCT TGAGCGCATC
ACCCGGGTCT ACCCTACCCA GACCGAGGTG GACCGCTCGG TACAGCCGCA CCCTTTGGGC
AACGCCGAAG CCGGGGACTG GACCGTCTAC ATCCTCGAGT TTACCCTGCC TACCCGTGGG
CCCTCGCGCA TCCGGCTCGC CCAGCTCGGG CTCACCTACG AGGTTCCGGG GCAGGGCTAC
CGCGGCGAGC TGCCACCTAT CGACGTGGTG GCCGAGTTCA CCACCAACGA AAGCCTCTCC
TCGCGGATCG ACCCGCAGGT GATGCAGTGG GTGCAGCAGC GCAACATCGA AATGCTGATC
AAGCAGGCTG CCCAGGAGGC CAAGAGCGAC CCGGCCAAGG CCGCCAAGAC CCTCGAGTTG
GCCCGCAACA TGACGGTAAA GCTGGGCAAT TCAGCTATGA CCCAAGCCCT CGACCGGGCG
GTGGGCGAAC TCCGAAGCAG CAAGACCCTC AGCCTGGGCA CCCAGAAGAC CCTCAAGATC
GGGGCCAAGA CCCAGACCCT CAAGGCTGAA CCCGGTAGTG CACTCCCCTC CGACGAGGAG
ATCCGCAAGA TTACCGGAGC CTGA
 
Protein sequence
MSVVEALLEA SVQPHREYLQ ANQPGQKLFL ALKIRPSAEA TRSRPQLVVA FVVDTSGSMR 
EVVTEPTERT GQSVRVDGKD YEVVRGAKSK IDLVIEALQN LLSSPQLQPS DRLAIVKFDD
VAEVVQPFTP ANEKARLVAA AERLTQYSGG TQMGAGMREG MRLLEREAGS RRLILLTDGQ
TFDEPLVETV AAQLAQARIP VTAIGVGDEW NDDLLAEITD RTQGKPFHVI PDNQNPQPPS
LRASELPQAI LGELEHAALE VVTNVTLSVK TVRDVALERI TRVYPTQTEV DRSVQPHPLG
NAEAGDWTVY ILEFTLPTRG PSRIRLAQLG LTYEVPGQGY RGELPPIDVV AEFTTNESLS
SRIDPQVMQW VQQRNIEMLI KQAAQEAKSD PAKAAKTLEL ARNMTVKLGN SAMTQALDRA
VGELRSSKTL SLGTQKTLKI GAKTQTLKAE PGSALPSDEE IRKITGA