Gene Mesil_1037 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMesil_1037 
Symbol 
ID9250530 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMeiothermus silvanus DSM 9946 
KingdomBacteria 
Replicon accessionNC_014212 
Strand
Start bp1026418 
End bp1027647 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content62% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003684450 
Protein GI297565478 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0570191 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGAAAGA TCGGAGTATT CCTATGCGCC GGCGGGCTGC TGGCCTTGGC CGGGTGCAGC 
ACCTCGCCAC AAACACAAAG CACGGCCCTA GCCGTCTCGG GGGTGGTAGG GGGTAGCGCC
AGCATGCTCA CCCTGAACGG CCAGGTGCTC GACCTGAGCA GCGCTAAAGT GACCCTAAAC
GGCGAAGACG CCAGCGCGGC GGCGGTCAAG GCCGGGATGG AGATCTCCGG AAGCGGCACC
CTGGAGGGCG GCAAGGTCAA GGTGCGCGAC CTCGAGGTGC GCTACCGGGC CCAAGGGCAG
GCCGATCAGG TGGACCTAGC CGGGAAGTTC GTCGTGGTAG CTGGGCTCAA AGCTTTCGTC
ACCGACAAGA CCCTGATCTT CCAGGAAAAC GCCGATGGAA CCGAAACCGC CCTGACCCTG
GCCGACCTGG CCGCGGGCGA TTACCTGAAA GTAGCCGGAA TTCCCCGACC CCAAGACCCC
GACGATGCCA TCCTGGCGAC GCGGCTCGAG CGCGAAAGCA GCGATGATCC TAACCGCGTA
GAGCTGATGG TTCCGATCCG CAAGCTGAAC CCGACCGCCC AGACTTTCAC CTACGGGCTA
CAGACCTATA CTGTAGATTA CAGCAAGGCC ACGCTGCGGG GGACGCTGGT GGAGGGCGCG
GTCGTCCGCT TTAGGGGAAG CAAATCCGGC ACCACCATCA CCGCCCTTCG GGTGCGGGCT
ATCGAAGGGA AAGAGAAGCC GGATGTCCCC GACGGCACCC GGATCGAACT CCGGGGCCTG
GTCGGTAACC TCAACCCCAC TACCCAGACC TTCCAGGTCG AGGGGCTAAG CGTGGATTAC
TCCGCCGCCA CCGTGATCGG CACGCTCGCT AATGGGATCG AGGTAGAGGT AAAGGGGACG
CTTTCGGGCA CCACCGTCAA AGCAATTCGG GTCAAAGTAA CCGCGCAAGA TGATGAAGAA
CCCGGCAATG CCGAACTCGA GGGGACCATC GCCAACTTCA ACGCCACCGC CAAAACGTTC
ACGGTCAATG GGGTCACCGT ATCGGTGAAC GATCAGACCG TGTACCAGCA GGACAAGATG
GGCGGCCAAA GCCTGGCCGA GGAAAATAAA GGTAAACACC TCACCGCCCA GGAGTTCTGG
GGCAGCGACC GGACCGGCCA AAGGGTCGAG GTCAAGGGGG TTCCGAGCAG TAGCACCGCG
CTGCTGGCCC GCAAGATCGA GTTAAAATAA
 
Protein sequence
MRKIGVFLCA GGLLALAGCS TSPQTQSTAL AVSGVVGGSA SMLTLNGQVL DLSSAKVTLN 
GEDASAAAVK AGMEISGSGT LEGGKVKVRD LEVRYRAQGQ ADQVDLAGKF VVVAGLKAFV
TDKTLIFQEN ADGTETALTL ADLAAGDYLK VAGIPRPQDP DDAILATRLE RESSDDPNRV
ELMVPIRKLN PTAQTFTYGL QTYTVDYSKA TLRGTLVEGA VVRFRGSKSG TTITALRVRA
IEGKEKPDVP DGTRIELRGL VGNLNPTTQT FQVEGLSVDY SAATVIGTLA NGIEVEVKGT
LSGTTVKAIR VKVTAQDDEE PGNAELEGTI ANFNATAKTF TVNGVTVSVN DQTVYQQDKM
GGQSLAEENK GKHLTAQEFW GSDRTGQRVE VKGVPSSSTA LLARKIELK