Gene Mesil_1914 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMesil_1914 
Symbol 
ID9251426 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMeiothermus silvanus DSM 9946 
KingdomBacteria 
Replicon accessionNC_014212 
Strand
Start bp1901828 
End bp1903027 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content61% 
IMG OID 
Producttransposase, IS605 OrfB family 
Protein accessionYP_003685297 
Protein GI297566325 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.70549 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGCTCC GCAAGGTCTA CCGCTTCCGC ATGGAACCCA CCCAAGCCCA AGCTGAGGCT 
TTGCTGCGTA TGGCCGGAGC TCGGCGGTTC GTGTGGAACT GGGGCCTTGC ACGGCGCAAG
GAGGCGTATG CCGCCACCGG GAAGGGGTTG ACCTACAACG GGCAGGCCAC CGAACTCACC
GCCCTGAAGA AGCGGCCTGA GATGGCCTGG CTCAGAGAAG CGGATAGCCA ACTCTTGCAA
CAAGCCCTCC AAGACCTCGA CCGGGCGTTC AAGGCGTTTT TCGAGCGACG GGCCGGGTTC
CCCCGGTTCA AGACCCGAAA GAAGGACCCG CCCCGCTTCC GCATTCCCCA GCGCGTCCGG
GTGGAGGAAG GCAAGGTTTA CCTCCCCAAG GTCGGTGGGG TGAAGATTCG CCAGAGCCAG
CCGATTGATT GCGTCATCAA AGGCGCAACA TTCAAACGCG ACACCGAAGG GCACTGGCAC
GTCACCCTGA CCGCCGAGTT CGAGATGCCC GACGTACCCC TGCCCCCCGT AAACCCTGAG
CATGTGGTGG GGATTGACCT CGGCTTGAAA GACTTCGCAG TGCTTTCCGA CGGTACAAGA
ATAGCTCCAC CCAAGTTCTA CCGCAAAGCA GAGCGCAAGC TTCGCAGGGC GCAAAGGGAG
CTTTCCCGCA AGCAGAAGGG TAGCAAGAAC CGGGAAAAGG CCAGGCATCG GCTGAACAGG
GTCCACGCCA AGGTTCGCAA CCAGCGGCAG GACTGGCTGC ACAAGCTGAC CACCGGACTC
GTCCAGAAGT ACGACGGGCT GTGCATCGAG GACCTGAACC TGAAAGGGAT GGCGAAAACC
AAGCTGTCCA AGTCGGTTCT AGATGCGGCC CCGGGTGAGT TTCGGCGGCA GTTGGAGTAC
AAGGCGGTCT GGTATCGCAA ACATCTCGTC GTGATTGACC GATACTTCCC CAGCAGCAAG
CTTTGTCGGG AGTGCGGGAC AATCCACACC GCCCTCACCC TCTCGGACAG GGTTTGGACC
TGTGAGTGCG GGGCGGTGCA CGACCGAGAC CTGAACGCGG CCCTGAACAT CCGGGCCGAA
GGGATACGAG CTATCCCCGT CGCCGTGGGG CACCCGGAGA CGCTAAACGC TTGGGGAGAG
GGTGTAAGAC CTACACAACG TAGGCAGTCC TCGTCGAACC AAGAATCCCA CGTGCTTTAG
 
Protein sequence
MLLRKVYRFR MEPTQAQAEA LLRMAGARRF VWNWGLARRK EAYAATGKGL TYNGQATELT 
ALKKRPEMAW LREADSQLLQ QALQDLDRAF KAFFERRAGF PRFKTRKKDP PRFRIPQRVR
VEEGKVYLPK VGGVKIRQSQ PIDCVIKGAT FKRDTEGHWH VTLTAEFEMP DVPLPPVNPE
HVVGIDLGLK DFAVLSDGTR IAPPKFYRKA ERKLRRAQRE LSRKQKGSKN REKARHRLNR
VHAKVRNQRQ DWLHKLTTGL VQKYDGLCIE DLNLKGMAKT KLSKSVLDAA PGEFRRQLEY
KAVWYRKHLV VIDRYFPSSK LCRECGTIHT ALTLSDRVWT CECGAVHDRD LNAALNIRAE
GIRAIPVAVG HPETLNAWGE GVRPTQRRQS SSNQESHVL