Gene TM1040_3409 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3409 
Symbol 
ID4075583 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008043 
Strand
Start bp430821 
End bp431840 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content56% 
IMG OID638004918 
ProductAraC family transcriptional regulator 
Protein accessionYP_611643 
Protein GI99078385 
COG category[K] Transcription 
COG ID[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.804556 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.710727 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGATGGA TTTCGACGGT CTTCGTACAC AAAGCACTGG ACGCTGCAGT CTCTCTTGCA 
AGCGCCGATG AAGATGCACG CGGGAAACTT TTCAAGAGCG TTGGTCTTGA TCCTTCGGCT
CCGGTTGATC CCGGCGCAAT GATCTCAGAT GGCGACTTCT TTGGGTTATT GGAACGCATC
GCAAAACTTG ATGATCGCGG TCGGTTCGTT CCGGTTCAGA TGGGCGCCTC TATGTGTTGC
GACGATTACG GCGCTTTTGG CCTCGCGTTC AAATCCGCAC CCGATCTGCT CAGCTCCTAC
GCGCGGGTAG AACGCTTTGG AAAGGTTGTC ACCTCGATAG CTAATTTCCG TGTTAAACAG
GTGGGACCCT CCGTTTTTAT GGAAGTTGTT CAAGGAGGGG ACCCGCGTCT TGGTCTTAGG
ATGACCAATG AACTGGCTTT GGCCGCTACG ATGTCGCTCA GTCAGGAGGT CAGCAGCGAG
GATTTTTCTC CCGTCGCCGT TCACCTCATG ACGGAGCGCC CCGAAGTCGA CGACGTGTAT
CACGCGCATT TTCGTTGCCC TGTTCACTTT GGCGCAGACC ACGATGCGCT TGAGGTGGCT
ACCACGGCAG CTGTCCGGTC CAATCGTCTT TCCGACAATG GGATGTCCAG GTTTTTTGAG
ACACATCTCG ACAACCAGCT TAGCCAAATC AGTGACAGGT CCGAACTGGA GCAGGGCATT
CTGGATCAAA TCGGCGAAGC GTTGAGCGAA GGTGTGCCCA CGCTCGCCGA GATCGCCGGG
TGTATGGGGA TGAGCAGCAG AACCTTGCAA CGCCGCCTGT CCGCAGAAGG TCTGGCTTAC
CAAGACCTGG TTTCAAGCGC GCGGAAATCA CTCTCCGAAC AGCTTTTGAG ACGCACGGAC
TACGCTTTGG CAGAGATCGC CTTCCTGACT GGTTTCTCCG ACCAGAGCAC GTTCACACGC
GCCTTTAAGC GTTGGCACCA GCAGACACCC GCCAACTACC GACGCGGCAC GCCTGTTTAG
 
Protein sequence
MGWISTVFVH KALDAAVSLA SADEDARGKL FKSVGLDPSA PVDPGAMISD GDFFGLLERI 
AKLDDRGRFV PVQMGASMCC DDYGAFGLAF KSAPDLLSSY ARVERFGKVV TSIANFRVKQ
VGPSVFMEVV QGGDPRLGLR MTNELALAAT MSLSQEVSSE DFSPVAVHLM TERPEVDDVY
HAHFRCPVHF GADHDALEVA TTAAVRSNRL SDNGMSRFFE THLDNQLSQI SDRSELEQGI
LDQIGEALSE GVPTLAEIAG CMGMSSRTLQ RRLSAEGLAY QDLVSSARKS LSEQLLRRTD
YALAEIAFLT GFSDQSTFTR AFKRWHQQTP ANYRRGTPV