Gene TM1040_2043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_2043 
Symbol 
ID4077970 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp2149095 
End bp2150258 
Gene Length1164 bp 
Protein Length387 aa 
Translation table11 
GC content61% 
IMG OID638007361 
Producthypothetical protein 
Protein accessionYP_614037 
Protein GI99081883 
COG category[S] Function unknown 
COG ID[COG3214] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCCGAT TAAAAATAGA CAATCGAACC GCCCGCAGCC TGTGGCTTCA GTTGCATGGC 
CTGGCGCAGA CGCCCACCGG GCCTCTGGAT GTACTCGGGC TGATCGAACA GCTTGGCTTC
GTGCAACTTG ATACCATTCA GGTGGTGTCG CGCGCGCATC ATCACATCCT CTGGAGCCGC
AATCAGAATT ACCGCGAGCC GATGCTCGAC CCGCTGCTGC GCACGCACCG ACAGGTGTTT
GAGCATTTCA CCCATGATGC CTCGGTGTTG CCGATGGCGT TTCTGCCCAT GTGGCAGCGG
CAGTTTGCGC GCAAAAAGCA TCAGGTGAGC CGCTCCAACT GGTTTGGCAA GCATCTGGAC
CCCGAGCTGA TTTCGGACGT TTTGCGCCGG ATCACGGAGG ACGGCCCGCT CTCCACCAAG
GACTTTGAGA CCAGGCGTGC GGACAGGACG GCCATGTGGA CCCGCCCGCC GCACAAGATG
GTGCTCGACT ATCTCTGGTA TGCGGGCGAA CTGGCCACCT CGCATCGGGA GGGCTTTACC
AAATACTACG ATCTGGCCGA GCGCGTGTAT CCGCAAGACG TGCCTCAGCT GAGCGATCAG
GCTCAGGTGC AGGGGCTCTG TCATGCGGCG CTTGATCGGA TCGGCTTTGG CACTTTGGGG
CAGATCCGCA AATTCTGGGA GGCGTGCGCG GTCGAAGAAG TGGCGCGCTG GGCAGAAGAG
GCGGCGCCCG ATCTGATCGA GGCCGAGGTC GAGGGCGCGG ATGGCAGCTG GTCCAACGTG
CTGGCCTGCA GCGATATCGA AACCCGCATT GCGGCGCTCT CATCGCCCAC CTCGCGGCTG
CGCATTCTGA ACCCGTTCGA CCCGGCCATC CGCGATCGCA AGCGGCTGGC GCGGCTGTTT
GGCTTTGACT ACACAGTTGA AATGTTTGTG CCCGCCGCCA AACGACAGTG GGGGTATTAC
ATCTACCCGC TCTTGGAGGG CAGCCGCCTG GTGGGCCGCG CCGAGATCAA GGGCGATCGC
AGCAAGGGTA CGCTCACGCT CAGCAAACTT TGGATGGAAC ATCCGCATTT GAAGACGCCC
AAACGCCTGC AGAAACTTGA TGCGGAACTT GGCCGCCTTG CACGACTCGC GGGTCTGCAG
AGGGTGATCT GGGCCGTGGA ATAG
 
Protein sequence
MARLKIDNRT ARSLWLQLHG LAQTPTGPLD VLGLIEQLGF VQLDTIQVVS RAHHHILWSR 
NQNYREPMLD PLLRTHRQVF EHFTHDASVL PMAFLPMWQR QFARKKHQVS RSNWFGKHLD
PELISDVLRR ITEDGPLSTK DFETRRADRT AMWTRPPHKM VLDYLWYAGE LATSHREGFT
KYYDLAERVY PQDVPQLSDQ AQVQGLCHAA LDRIGFGTLG QIRKFWEACA VEEVARWAEE
AAPDLIEAEV EGADGSWSNV LACSDIETRI AALSSPTSRL RILNPFDPAI RDRKRLARLF
GFDYTVEMFV PAAKRQWGYY IYPLLEGSRL VGRAEIKGDR SKGTLTLSKL WMEHPHLKTP
KRLQKLDAEL GRLARLAGLQ RVIWAVE