Gene TM1040_2189 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_2189 
Symbol 
ID4078180 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp2298764 
End bp2299795 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content63% 
IMG OID638007511 
Producttranscriptional regulator 
Protein accessionYP_614183 
Protein GI99082029 
COG category[K] Transcription 
COG ID[COG4977] Transcriptional regulator containing an amidase domain and an AraC-type DNA-binding HTH domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.156364 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATCCG CAAAAAACAT CCTCCAGCCG GAGAACCGCC CCCTCAAGAC CGCCGTTCTG 
GTGATGGATG AGTGTAACAC ATTGTCCTTT GCCGCAGCAG TCGACCCCAT GCGCGCCGCC
AATCGGCTGG CGGGGCGGGC GGCTTTTGAC TGGGACTATG TCAGCGCCTC AGCCGAGCCG
CCGATGCTCA CCTCAGGCCT CACCGTGCCG ACCACGCCGC TGGCGCGGCT GGATGACTGT
GACCTCTTGA TTGTCGTTGC CGGCTTTCAA CTGGCACGCC ATGCCACACC AAGCCTTCTG
GCCGGTCTGC GCCGCATTGC GGGCAGTGGC GCCACCATCG CCGGGATCGA CGGTGGCCCC
TGGCTCATGG CCGAAGCCGG GCTGCTGGAC GGCCATGCGG CAACGACCCA TTGGGAAGAT
TTGGAGAATT TCTCGGCCCG CTTCCCCGAT ATTCACTGTC GCACGGATCG CTTTACCGTC
TCTGAGGGGC GCATGACTTC AGGCGGGGCG ACGCCGGCAA TCGAGATGAT GTTGCATATC
ATCGGCGCGC GTCACGGGCA TGGCTTTGCC TCACGAGTGG CAGGTCTCTT CCTTTATGAC
GGCACCGCCT CGCCCCAGCG CAGCCAGAGC CGCCTTGGTC ATCACAAGCA CAACGCCCTC
ACCGCGAAGG CCAACGCCAT AATGGAAGCC GCGCTGGATG ACCCCAAACC GCTGGCGGAA
ATCGCCGAGG TGCTCGGCAC CAGCCCCCGC AGCCTGCAAC AGCAATTCCG CCTGCGCCTC
AACACAACGC CACAGGATCA CTATCTGCAA CTGCGTCTGG CAGAGGCTCG CCGACTGGTC
ACAGACACCA ATCTGCCCTT GATGGAAATC GCGGTGGCGA CGGGGTTCAC GTCGCAATCC
ACCTTTGCGC GCGCCTTTCG CACCGCGCAT GGGCTCTCGG CGCGAGAGCT ACGCCAGAAC
AGCGCCGAGA CCGCGTTTGC AAGCTACCCT GTGCTCAAGG CAACCAGCCA CGCCAGCCAC
AGCACCAACT AG
 
Protein sequence
MKSAKNILQP ENRPLKTAVL VMDECNTLSF AAAVDPMRAA NRLAGRAAFD WDYVSASAEP 
PMLTSGLTVP TTPLARLDDC DLLIVVAGFQ LARHATPSLL AGLRRIAGSG ATIAGIDGGP
WLMAEAGLLD GHAATTHWED LENFSARFPD IHCRTDRFTV SEGRMTSGGA TPAIEMMLHI
IGARHGHGFA SRVAGLFLYD GTASPQRSQS RLGHHKHNAL TAKANAIMEA ALDDPKPLAE
IAEVLGTSPR SLQQQFRLRL NTTPQDHYLQ LRLAEARRLV TDTNLPLMEI AVATGFTSQS
TFARAFRTAH GLSARELRQN SAETAFASYP VLKATSHASH STN