Gene TM1040_0947 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_0947 
Symbol 
ID4077341 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp1009832 
End bp1012027 
Gene Length2196 bp 
Protein Length731 aa 
Translation table11 
GC content61% 
IMG OID638006250 
Productorganic solvent tolerance protein 
Protein accessionYP_612942 
Protein GI99080788 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1452] Organic solvent tolerance protein OstA 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0025554 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.755835 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGAAGA CGGTTGAATT GAAGAAACGG CGGCGAGGCG CGCTGCTGCG GACGGTGGTA 
CCCCTGTGGT TGGCGCTTGC GGCAGCGCCC GCCCTGGCGC AAAACGCGAC CTCTGCACAG
GAAACCCCAG CCCCGGCGAT GCTGGTGGCG GACCGGGTAT TTGTATCCCC CGACCGCAAG
CTGGTAGCCG AAGGAAATGT CGAGGCATTT CAGGGCGATA TCCGCCTTCA GGCGCGGCGC
ATCTCTTATG ATCGCCAAAC CGGCATCCTT CAGATGGAGG GGCCGATCCG CATAGACCAG
AGCGGCTCGA TCACGGTACT CGCCAATGCC GCCGAGCTGG ACAGTGAATT GCGCAATGGC
ATCCTGCGTG GGGCGCGGAT GGTCTTTGAT CAGCAGGTGC AGCTCGCAGC CCTGCAGATG
ACCCGCGTCG AGGGGCGCTA TTCGCAGCTC TACAAGACCG CCGTGACCTC TTGCCATGTT
TGCGAAGATG GCAGCCCCCC GCTCTGGCAG ATCCGCGCCG AGCGCATCAC CCATGATCAG
GCTGAGCGCC AGCTCTATCT GGAGGGTGCG CAGCTGATGG TGAAAGATGT TCCGGTCTTT
TACTTTCCCG CACTGCGCCT GCCAGACCCA ACACTCGAGC GGGCTGATGG CTTTCTGGTG
CCCTCTCTCT CCAGTTCCTC GACGCTTGGC GTAGGGGTGA AGATCCCCTA TTTCAAAACA
ATCGGCCCGC ACAGAGACCT CACCATCACG CCCTACCTGT CCGAGAAGAC CAGAACGCTC
GACCTGCGCT ACCGTCAGGC CTTTCGCAAC GGCCAGATCG AGATCACCGG CGCCTTCAGC
CGCGATGACA TTCAGCCTGA TGATGGCCGG GGCTACCTCA GCCTGCGCGG CGCCTTCGAC
ATCCCGCGAG ACTTCAAGCT GAGCTTTGAT CTGAACACTG TCTCGGATGA CGGATACTAC
GCTGACTATG ACATCTCTGA CACCGACCGC ATTCGCTCCG AAATCTCGCT GATCCGGGTG
CGGCGCGACC AGCTGATCGA AGGCAAGATC TCCAACTACA AAACCCTGCG CGACGCCGAG
AACCAGGACT TCATTCCCTC GACCATCGTG ACCGGCACCT TCGAGCAGCG CCTCTTTCCC
AAGGCCGTTG GCGGCGAGCT CCGTCTGCGG CTCAACGCCT CGCAATTCAG ACGCGAATCC
TCGCTTGATG CCACCCTCAC AGATGCCAAC GGGCGCGACA TGAGCCGGGT TTCCGCCGAT
GCCACCTGGC TGCGCAGCTG GATTTTGCCC TGGGGGATTG AATCGGTCTG GACCGCCGGG
ATCGGGATCG ACAGTTTTGC GCTGTCAGAT GATGCAGCCT TTGACAATGA TGCCACCCGC
GTCACGCCCA AGGCGGCGCT GACCCTGCGC CGCCCGATGA CACGCCAGAC CGCATCAGGG
GCGGTTCAGG TGCTCGAGCC CATCGTGCAG CTTGGCTGGA CCCATGTGAA TGGCGACGAC
ACCCCGAATG AGGCCAGCAA TATCTCCGAG TTCGATCAGG GCAACCTGAT GGCGCTCTCG
CGGTTCCCCG AATCCGACGT GCGCGAGGAT GGCGAGACGT TTGTTTACGG CGTGAACTTT
GCGCATTTCG ACACCTCCGG CTGGTTTGCA ACAGGTACCA TTGCGCAGAT CCACCGCGAT
CAAGCGCAAT CGGGCTTTAC CTCCTCCTCG GGGCTGGATG GCCGCAACTC CAACGTCTTG
GTGGCGGGGC AGCTGGGCCT GCGCAACGAC CTGACGCTGA CCGCCCGCAC CCTCTTTGAC
GAGGAGTGGT CGGTGACCAA GGCAGAGTTT CGCGGCGATC TGGAACGTGA CCGGGTCAGT
CTTGCAGGGA GCTATCTTTG GTTGCAGGCC GACGCGAGCG AGAACCGCAC GGAAGAAACC
TCGGAACTCT GGTTCGATGG CACCTATGAC TTGAACCAGA CATGGCGCGC GGGCGCCAAC
ATGCGCTACG ACATTACCGA TGGGCGCGCG ACCCGTGCAG GTCTTGGCCT CACCTACAGC
AATGAATGCG TGACACTCGA CCTATCGCTC AGCCGTCGCT ATACCTCGAC GACAAGTGTT
GAGCCATCAA CGGATTTCGG ATTTACACTG TCGCTCAACG GCTTTTCCGT CAAAAGCGGC
AACACAACAA GCAGGCGATC ATGCAGCAAA ACCTAA
 
Protein sequence
MPKTVELKKR RRGALLRTVV PLWLALAAAP ALAQNATSAQ ETPAPAMLVA DRVFVSPDRK 
LVAEGNVEAF QGDIRLQARR ISYDRQTGIL QMEGPIRIDQ SGSITVLANA AELDSELRNG
ILRGARMVFD QQVQLAALQM TRVEGRYSQL YKTAVTSCHV CEDGSPPLWQ IRAERITHDQ
AERQLYLEGA QLMVKDVPVF YFPALRLPDP TLERADGFLV PSLSSSSTLG VGVKIPYFKT
IGPHRDLTIT PYLSEKTRTL DLRYRQAFRN GQIEITGAFS RDDIQPDDGR GYLSLRGAFD
IPRDFKLSFD LNTVSDDGYY ADYDISDTDR IRSEISLIRV RRDQLIEGKI SNYKTLRDAE
NQDFIPSTIV TGTFEQRLFP KAVGGELRLR LNASQFRRES SLDATLTDAN GRDMSRVSAD
ATWLRSWILP WGIESVWTAG IGIDSFALSD DAAFDNDATR VTPKAALTLR RPMTRQTASG
AVQVLEPIVQ LGWTHVNGDD TPNEASNISE FDQGNLMALS RFPESDVRED GETFVYGVNF
AHFDTSGWFA TGTIAQIHRD QAQSGFTSSS GLDGRNSNVL VAGQLGLRND LTLTARTLFD
EEWSVTKAEF RGDLERDRVS LAGSYLWLQA DASENRTEET SELWFDGTYD LNQTWRAGAN
MRYDITDGRA TRAGLGLTYS NECVTLDLSL SRRYTSTTSV EPSTDFGFTL SLNGFSVKSG
NTTSRRSCSK T