Gene Mesil_3074 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMesil_3074 
Symbol 
ID9252597 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMeiothermus silvanus DSM 9946 
KingdomBacteria 
Replicon accessionNC_014212 
Strand
Start bp3119476 
End bp3121341 
Gene Length1866 bp 
Protein Length621 aa 
Translation table11 
GC content62% 
IMG OID 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003686416 
Protein GI297567444 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000149398 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.300897 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGCGAA GGGTTATACA AGCTTTCGTG GGGGTAGGGT TGTTGGCCGG TTTGGCCCTG 
GCCGGGCCGC AGGACAACAG CTTGGTGGTG GGCACCTCGC AGGAGCCCCG GGTGCTGGTG
GGGGACTTTG TGAACGCCAT CAGCAACCAG AGCATCAAGT TCGAGATCGA GAACTTTCTT
CTTCCTCCCC TGATCCAGAC CAGCCGCGAC GTGGAGAATA TGCCGGTCCT GGTCACCGAG
GCCCCCACCG TGGCCAACAA GCGGGTGCGT TTCAGCAACC TGCCGGGGGG CAAACGGAGG
CTCGAGATTG ACCTCACCCT GCGCGAAGGC GCGGTCTGGT CCGACGGAAC CCCCATCACC
ACTGACGACG TAGCGCTGTA CTACGACTTC GGTAAGACCA AAGGGGTGCC TACCACCTCG
CCCGACTACT GGGACCGGGT GGGCTTGCAG GTCAAGGACA AGCGGAACTT CACCGTTACC
TTCGAGCCCG CTTATTTCTA TGACCTCGAC GGAAACCCCA TCGGGTATGC CCCGGCCCAC
ATCATGCGGG CCGAGTGGGA GAAGGCCAAG GCAGCGGCCC AAGGCCGCGA TGCTGCCGGG
CAGGCCGAGG TGTTTCGCAA CTTCTTCACC CAGTACGCCT CGCCACAAGC GCTCAACGCG
GGCAAGATGG TCTACTCCGG TCCCTTCATA CTCAAGCGCT GGGTCCCCGG CAATACCATC
GAGCTGGTGC GCAACCCGCG CTTTTTCATC ACCCCTCCGG GTGGGGCCGA CAAGTACGTA
CAGAAGGTGA CCTACCGCAT CATCCAGAAC ACCAACTCCC TGTTGGTGGC GATCCTGGGT
GGGGGGATTG ATGCTTCCTC GGGGGTCTCG CTGACCTTCG ACCAAGGGCG TGCCCCCCAG
CTCACCCGCC GGGCCGAGGG CCGTTTCGAG GTGTGGTTCG TGCCCACCCC CTTCTTCGAG
CACATCGAGG TTAACCAGTT CACCAACCTC GAGCAAGTCA AAAACTTGGG TCTGGCGGAT
AAGCGCACCC GCCAGGCGTT GATGTATGCG ATCAACCGTG AGGCCATCAA CAAGGCCTTC
TTTGAGGGTC TGCAACCAAT AGCCCATTCC TGGGTGTTCC CACAGAACCC CATGTATAAC
CCCAACGTGC GCCGCTATGA GTACAACCCC GACAAAGCCC GCGCGCTGTT GGCCGAGTTG
GGCTGGAAGC CGGGGCCGGA CGGCATCCTG CAGCGCACCG TGGAGGGCAA AACCGTGCGC
TTCGAACTCG AGTACCAGAC CACCGCTGGA AACGCCGTGC GCGAGCGCAT TCAGCAGTTC
ATTCAGGACA ACTTGCGTCA GGTGGGGATC GCGGTCAAGA TCAATAACGC CCCCTCGGCG
GTGGTGCTGG GCCCCAATCG CGCTCGGGCT CAGGACGGAG CCTGGACTGG CTTTTTGCAG
TTTGCCTTCA GCATGGGGTT GCAAGACGAT GGGGTGCGCT CGGCTTGCCG TGACGAGGAA
GGCAAGCAGA TTTTTGTGCC CACCAAGGAA AACGGCTACC GCGGCACCAA TTTTGGCGGC
TGGTGCAACG CCGACTTCGA TAAGTTGCGG GCTCAGGCAG TGGTGGAGTT CGATGTGGCC
AAGCGTAAAG CCCTCTTCGC CCAGATGCAA GCTATCTGGG CCGAGGAGGT GGCGATGATC
CCCCTGTACT TCCAGGCGGA TCCGCGGGTC TTCCGCAAGG GGCTTGTGAA CTGGGTCTCC
TCGACTTTCG CCAGCTCGGG CTCGCCTACC GTAGAGCCCT GGCTGATCGG CTGGGAGCAG
CGGGGGGCGC AGAAGGTCTA CGATCAGGCT AAATATGCCC TAACCATTCC CCCGGCCAGC
CGCTGA
 
Protein sequence
MRRRVIQAFV GVGLLAGLAL AGPQDNSLVV GTSQEPRVLV GDFVNAISNQ SIKFEIENFL 
LPPLIQTSRD VENMPVLVTE APTVANKRVR FSNLPGGKRR LEIDLTLREG AVWSDGTPIT
TDDVALYYDF GKTKGVPTTS PDYWDRVGLQ VKDKRNFTVT FEPAYFYDLD GNPIGYAPAH
IMRAEWEKAK AAAQGRDAAG QAEVFRNFFT QYASPQALNA GKMVYSGPFI LKRWVPGNTI
ELVRNPRFFI TPPGGADKYV QKVTYRIIQN TNSLLVAILG GGIDASSGVS LTFDQGRAPQ
LTRRAEGRFE VWFVPTPFFE HIEVNQFTNL EQVKNLGLAD KRTRQALMYA INREAINKAF
FEGLQPIAHS WVFPQNPMYN PNVRRYEYNP DKARALLAEL GWKPGPDGIL QRTVEGKTVR
FELEYQTTAG NAVRERIQQF IQDNLRQVGI AVKINNAPSA VVLGPNRARA QDGAWTGFLQ
FAFSMGLQDD GVRSACRDEE GKQIFVPTKE NGYRGTNFGG WCNADFDKLR AQAVVEFDVA
KRKALFAQMQ AIWAEEVAMI PLYFQADPRV FRKGLVNWVS STFASSGSPT VEPWLIGWEQ
RGAQKVYDQA KYALTIPPAS R