Gene TM1040_2695 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_2695 
Symbol 
ID4077002 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp2836696 
End bp2837706 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content62% 
IMG OID638008020 
ProductAraC family transcriptional regulator 
Protein accessionYP_614689 
Protein GI99082535 
COG category[K] Transcription 
COG ID[COG4977] Transcriptional regulator containing an amidase domain and an AraC-type DNA-binding HTH domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.4326 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.587426 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACAGAA ACCCCGCCTC TCCCGCTGCA TCCGATCACA GCGATGACAC CCCGCGACTG 
GCGGTGGAAA TATTTGTGCA GCCGGGATTT TCCCAGCTCG AGCTTTCGTT GATTCTCGCT
GTGTTTGAGG CGGCAAACGC AATGGAGACC GGCATCTGGT TCTCCGTTCG CATCACCTCT
GACAGCCCGG GCGTGGTGAC AGGCGGCGCG GGCATGATGG TGCGGGCAGA ACCTGCGATT
GGTCTTCAGT ATCTTCAGGA TCTGATGTTT GTGGTGGGGG GGCGCAATTG CAGCGGCGGC
AGCTGGCTCG CACGGGCGCG CGCAATGCAG AAACTGCGTC GCCCGGTGTT CCTGCTGTCG
GATGCAGCAA CCGCCTATAT CCGCAGATGT GCGCCGCTCT CGGGGCCCGC CACCACCCAT
TGGCAAGACC TGCGCGCCCT GCGTGAGACC GGCGAATACC CCACGCTCAC CGATAGCCTC
GTGGCGGAAA ATGCAGGCAT TCTGACCTCG GCCGGGGGGG GATATACGGC GGAAATGGTG
GTGCGTCACC TCTCGCAGAT CCTTGCACCG CAACACTGCG CCGAATTGGC CAGCGTGTTG
ATGATCGAAA CCGCTCGGGG TTACAGCGGA GAACAACCCA AAGGGGCCGC GCGCAACACC
AATCTTCTGG AGGCGCGGCT GGTGCGCGCT ATGGCGATCA TGGAAGAATG CATCGAATAT
CCCCTGTCCA CCGCAGAGGT GGCCGAGCGG GCGGGGATTT CGGTGCGGCA TCTGGAACGC
CTGTTTCTGA CCCATCTCAA CACCACACCG GCCAAACACT ACATGCAGCT GCGCCTGAAG
CTGGCCAACA AGCTCATCAC CGACACCAAC CTGCCGATTG CAGAGATCGC CTTTGCCAGC
GGCTTTGCGT CCTCTACGTC GCTGTCGCGC GCGTATCGGC GTGAATATAA TATGACCCCC
TATCAGGTGC GCGCCCGTGA TCGGGCCGGT GCGGGTCTGC GCGCGGACTA G
 
Protein sequence
MDRNPASPAA SDHSDDTPRL AVEIFVQPGF SQLELSLILA VFEAANAMET GIWFSVRITS 
DSPGVVTGGA GMMVRAEPAI GLQYLQDLMF VVGGRNCSGG SWLARARAMQ KLRRPVFLLS
DAATAYIRRC APLSGPATTH WQDLRALRET GEYPTLTDSL VAENAGILTS AGGGYTAEMV
VRHLSQILAP QHCAELASVL MIETARGYSG EQPKGAARNT NLLEARLVRA MAIMEECIEY
PLSTAEVAER AGISVRHLER LFLTHLNTTP AKHYMQLRLK LANKLITDTN LPIAEIAFAS
GFASSTSLSR AYRREYNMTP YQVRARDRAG AGLRAD