Gene Smal_1034 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmal_1034 
Symbol 
ID6478461 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStenotrophomonas maltophilia R551-3 
KingdomBacteria 
Replicon accessionNC_011071 
Strand
Start bp1182895 
End bp1186275 
Gene Length3381 bp 
Protein Length1126 aa 
Translation table11 
GC content58% 
IMG OID642730198 
Producttype III restriction protein res subunit 
Protein accessionYP_002027422 
Protein GI194364812 
COG category[V] Defense mechanisms 
COG ID[COG4096] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0485696 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATCCG TCAATTTTGA ATTTCTTCGC CCTGTAAACG AACTGCTGGC CAATCTGGCC 
GGTCTTGCAG AGGGGGTTCT GCATGTCGAT CCCGGCAGCG CACTTACACG GTTGCGCAGT
TTCGCTGAGG AGTTGACCAA GACCATCTAC AGCGAGGAGC GTCTGCCACG CTTGCCGCAG
TCCACTTTCT ATGACTTGGT GAAGAGTCCG GTATTCACCG CGTGCACAAG CAGCTCGCTG
GTCCATCAGA TCAATTTCCT GCGTATCCAG GGCAATGAAA CAGCCCATGG CGGCGAAGGA
GATGTCCGCA CGGCCAGATC GGCATTGAAG ACGGCTCACG AGCTGGCCAA GTACATGGCC
GTGAAGTACT ACAGGCTGGC CCATTCGGAC TTGCCGGCTT TTGTCGAGGT CAAGGATCCC
ACCACTGCGC TCAACGCGTT GCAGAAGTCA GTTGTCAGCT ATGAGAAGGA GCTGGCCAAA
CAGCAGGAAG AGCTGCAGCG TGTTTTGGAA CAGTTGGAGC AGAAGCGGGT ACGTGATCTT
GGAAAAGTCG AAACGCCTGC TCCAGCAGAT CAGAGGCAGC GCCAGGAAAG GAGCGAGCAG
GTTGCCGGCA GCCTGCAATG GAGTGAGGCT AAGACCCGCA AGCTCCTGAT CGACGCCATG
CTGCTTCAGG CCGGCTGGGA TGTGGGCAGC CCCGCACAGG TCGGGTTGGA GGTGGAAGTC
GATTTTCCAG GCAACGCCAG CGGCAAGGGC TATGCAGACT ATGTGCTGTG GGGAGACAAC
GGCCAGCCTC TGGCGGTGGT CGAAGCCAAG AAATCAGGCA ACGTCAGCCT TCAGGCGGGG
CGCGAACAGG CCCGCATGTA TGCCGATGGC TTTGAACTCA TGGGCATGCA ACGCCCGGTG
ATCTTCTACA GCAACGGCTA TGAGACCTTC ATCTGGGATG ATAAGCAGTA CAACGGCTAT
CGGCAGGTCT ATGGCTTCTA TGGCAAGGAC AGCTTGGAAT ATCTGATCTA CCAGCGTCAG
TACCGCGTTG CCGAGCTGGA GAAACACAAC CCCGAGTTGA GCATTGCCGA CAGGCCGTAT
CAGATCGAGG CGATCAAGAC CGTTGCCGCG CACTTCCAGA AGCAGCGGCG CAAGGCGCTG
ATCATTCAGG CCACTGGCAC CGGCAAGACC CGCGTTGCAA TTGCTTTGGC CGAACTTCTG
CTGCGCACCG GCTGGGCCAA GCGAGTGCTG TTCCTCTGCG ACCGCAAGGA GCTGCGCGTC
CAGGCCGATG ATGCCTTCAA GCAGAACCTG CCCAGCGAAC CGCGCTGCGT AATCGGCGAA
GCCAATAAGG TTGACCAGAC CGCGCGCATC TACATCGCTA CTTACCCGGG GATGATGAAC
CGTTTCGCGC AGTTGGATGT CGGCTTCTTC GACCTGATCA TTGCTGACGA AAGCCATCGC
AGCATCTATA ACAAGTACCG CGACCTGTTC GATTACTTCG ACGCTCTGCA GGTCGGGCTG
ACCGCCACGC CGGTAAGGTT CATCAGCCGC AATACCTTCG ACATGTTCGA CTGCGAAACC
ACCGACCCGA CCTTCGAGTT CGGCCTGGAT GCAGCCATCA ACAACGATCC GCCGTATCTG
GTGCCGTTCC GCGTACGCGA CCTGACAACT GACTTCCTGC GCGATGGCAT CCACTACAAC
GACCTGAATG ACGAGCAGAA GCGTCAGCTG GAAGAGGATC TGGGTGAGGA AGAGGCCAAG
CGCACGACCA TCGCCGGCAA AGACATCGGT CGCAGGATCT TCAGCGAATC CACCGACCGC
ATCATCCTTG AAAACCTGAT CGACAACGGC ATCAAGGATG ACACCGGCTC CCTGGTGGGC
AAGACGATTA TCTTTGCCCA GCGTCAGGAT CACGCCGAGC ATCTGGAGAA AATCTTCACC
AAGCTTTACC CGCAGTACGG CACGCGTGTG TGCAAGGTCA TCCACAACGA CATTCCGCAT
GTGGAAACCC TGATCAAGGA GTTCAAGAAG CCGGACAACG AATTCCGCAT CGCCATTTCG
GTAGACATGC TCGATACAGG CATTGACGTG CCGGAGGTAG TGAACCTGGT CTTTGCCAAG
CCGGTGAAAT CCTGGGTAAA GTTCTGGCAG ATGATTGGCC GTGGCACGCG TCTTCGCCCG
CATCTGTTCG GTCCGGGCAA GCATAAGGCC GAATTTCTGA TCTTTGATCA CTACGGCAAT
TTCGAATTCT TCGAGCAGGA ATACCAGGAG CCGGAAGACA CCGGCGGGAA CTCGCTGCTG
CAGACTACGT TTGCAGCACG TGTCGAACTT GCCCAAGTGG CGCTGAAGAA GAGCCATGCC
GAGGCCTTTG ACCTGGCCGT GCGCCTGATG CGCGAAGACA TCAATGACCT GCCGGACAGC
AGTGTGGCCG TGCGGCGGCA GCTGCGGTTG GTGCACCAGT TGCAGCAGAC CGACCAGCTA
CGGAATTTCG ATAGCCGCAC CCAGCACCTG GTCAGTGAGG CCATATCGCC ACTGATGTCT
GCCCGAGTGC TGCGCGACAA GCATGCCACC GCGCTGGACA AACTCATGGC AAACATCCAG
CGTTGTCTGG TGGAGCAGGC TAGCTGTTTT GAAGATGGCA AGACCGAACT ATTGGTTGCC
TTGGACAAGC TGGCAGTGAA CATTCAGGCG GTCCGGCAGA AGGATGCAGT GATTGCCGAG
GTACGCAGTG CCGCATTCTG GCAGCAGGCC AGCATTGCGA GCCTAGAACA TGTCCGCAAG
GAATTGCGCG GCATCATGAA GTACCGGCGC GTGGATGTTG GGCCGGGCTA TGACATGCCC
ACGACCCGCA CCGGCGATGG CGGGGTGATC GAAGAGGAGC GCACGACATA CATGGTAGGA
GCCAGCGAGG CGCTGATCTA CCGGCGTCGG CTCAAGCGCA TTCTCGATGA CATGCTGGCC
GCGAATCCTA CCCTGCAGAA GATCCACCAA GGCCAGTCCA TTGCCGAGCA TGAGCTAAAA
ACACTCACTT CTACCATCCT CACCAGCCAT CCGGGCGTCA GCTTGGAGGT GCTCAACGAG
TTCTACGGCC GTACCGCCAA TGAATTGCAC CTCACCGTGC GTGAGATAAT CGGCCTGGAT
GCACACGGCA TCGAAGAGCA CTTCAAGGGC TTTCTGCACG CACATCCTGG GCTTACCGCA
CAACAGGTGC GCTTCATGAA CCTGTTGAAG AACTACATTG CCACCCACGG CAGCATCGTC
ATTGAGACGC TCTACGAGCC GCCGTTTGAC AGCATTTCCC ACGAAGGCAT CGATGGTGTC
TTTACGGCAG CGGATGTGGA TGCGCTCGTG GCTGTACTCA AACCCTTCAT GCGCGGAGAA
GCCGTGTCAG CCAGCCGCTA G
 
Protein sequence
MKSVNFEFLR PVNELLANLA GLAEGVLHVD PGSALTRLRS FAEELTKTIY SEERLPRLPQ 
STFYDLVKSP VFTACTSSSL VHQINFLRIQ GNETAHGGEG DVRTARSALK TAHELAKYMA
VKYYRLAHSD LPAFVEVKDP TTALNALQKS VVSYEKELAK QQEELQRVLE QLEQKRVRDL
GKVETPAPAD QRQRQERSEQ VAGSLQWSEA KTRKLLIDAM LLQAGWDVGS PAQVGLEVEV
DFPGNASGKG YADYVLWGDN GQPLAVVEAK KSGNVSLQAG REQARMYADG FELMGMQRPV
IFYSNGYETF IWDDKQYNGY RQVYGFYGKD SLEYLIYQRQ YRVAELEKHN PELSIADRPY
QIEAIKTVAA HFQKQRRKAL IIQATGTGKT RVAIALAELL LRTGWAKRVL FLCDRKELRV
QADDAFKQNL PSEPRCVIGE ANKVDQTARI YIATYPGMMN RFAQLDVGFF DLIIADESHR
SIYNKYRDLF DYFDALQVGL TATPVRFISR NTFDMFDCET TDPTFEFGLD AAINNDPPYL
VPFRVRDLTT DFLRDGIHYN DLNDEQKRQL EEDLGEEEAK RTTIAGKDIG RRIFSESTDR
IILENLIDNG IKDDTGSLVG KTIIFAQRQD HAEHLEKIFT KLYPQYGTRV CKVIHNDIPH
VETLIKEFKK PDNEFRIAIS VDMLDTGIDV PEVVNLVFAK PVKSWVKFWQ MIGRGTRLRP
HLFGPGKHKA EFLIFDHYGN FEFFEQEYQE PEDTGGNSLL QTTFAARVEL AQVALKKSHA
EAFDLAVRLM REDINDLPDS SVAVRRQLRL VHQLQQTDQL RNFDSRTQHL VSEAISPLMS
ARVLRDKHAT ALDKLMANIQ RCLVEQASCF EDGKTELLVA LDKLAVNIQA VRQKDAVIAE
VRSAAFWQQA SIASLEHVRK ELRGIMKYRR VDVGPGYDMP TTRTGDGGVI EEERTTYMVG
ASEALIYRRR LKRILDDMLA ANPTLQKIHQ GQSIAEHELK TLTSTILTSH PGVSLEVLNE
FYGRTANELH LTVREIIGLD AHGIEEHFKG FLHAHPGLTA QQVRFMNLLK NYIATHGSIV
IETLYEPPFD SISHEGIDGV FTAADVDALV AVLKPFMRGE AVSASR