Gene TM1040_2412 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_2412 
Symbol 
ID4076738 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp2551834 
End bp2552994 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content58% 
IMG OID638007734 
ProductAraC family transcriptional regulator 
Protein accessionYP_614406 
Protein GI99082252 
COG category[K] Transcription 
COG ID[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.462091 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGGTATG CCTATCTGCG AGGACTTCGG AATGCACGGC TACAGAACAA CAACGACGAA 
ACATCGACAG TTTTTCGTTC CTGTTCCGTG CCAACGTTCC AAAAGATGCG GAGCGCGTAC
GGGAGGCGAG TGGTGAAGGA TGTTTTTGCC CCCAATGGTG GGGTCTTTAT CAACCGGCTG
GTCGAATACG TAACCGGTCA GGGGCATGAC TTCGAGAACA TGCTAGCCCA AAAGCTCGCA
ATCTCTGAAC CTCTGAGCTC GAGCGCTCAG GTCCTCCCTG TTTATGAACA CTCCTTGGTC
TTCGGGCTGG CGGAGAGGGT GTGCAACGAC ACGTCGATTG GCTACGCTTT GGCCTATCAA
TGCACGCTGC GTGATGCAGG ACTTGTGGGC TATGCGGTAA GCGCCTCGGA GACTGCTGGT
GAGGCGCTGT ACACTCTAAG CCGCCTCAGC AACGTCTTTG AGGTCGTCTC TGCCTCCGCC
AGTGGCGGGC TTGTGGATCT GCGCTGGGAC TTTGGATCGC AGGGTAAACT GGATCTGCGT
CACTGGAGCG AGTTCATCGC TACCCTCTTG GTTCGTAGCT TAAAAACCCT CTGCACCGGT
GCGGTTGCGC CGGTAGGGGT TGAGTTCACC CATACTGCGC CATCCTCCTC AGAGCAGGCG
GTCCTGGCCT TTGGTGTCAA ACCAACCTAC CGCGGGCGCC TGAATCGTCT GACCTTTCGC
GAACAGGATC TGCGCCAGCC CTTGCGTAGT GCAGATGCAG GCTTGCTGAG GCTCCTGCTG
GAGCATGCGG AGCTGTTGCG CCGCCGCCCG GACAGGAACA GCAATGATCT GTCGATCACC
GTTGAGCGTC TGATTATGGA CGGCATGTCA GAAGGAGATG CCAGCCTGGC GCAAGTGGCA
GAGTCGCTGG ATATGAGCCA GCGCACGCTG TCCCGGAAGC TTGCCAGCGA GGGGACGAGC
TTCTTTGCGA TCCTGGAGGG GGTGCGAAAA TCGCTGGCCC TGCGCTACCT CCAGCAAAAC
GAGAAATCCC TCTCAGAGAT CTCCTTTGCT TTGGGCTACT CCAGTCTGAG CAGTTTCAAT
GACGCCTTCA GGCGTTGGTA CGACCAAAGT CCCGGAAGCT ATCGGAGCGA TGCGCTCAAA
GAGGCTGCTG TGAAGTCCTG A
 
Protein sequence
MGYAYLRGLR NARLQNNNDE TSTVFRSCSV PTFQKMRSAY GRRVVKDVFA PNGGVFINRL 
VEYVTGQGHD FENMLAQKLA ISEPLSSSAQ VLPVYEHSLV FGLAERVCND TSIGYALAYQ
CTLRDAGLVG YAVSASETAG EALYTLSRLS NVFEVVSASA SGGLVDLRWD FGSQGKLDLR
HWSEFIATLL VRSLKTLCTG AVAPVGVEFT HTAPSSSEQA VLAFGVKPTY RGRLNRLTFR
EQDLRQPLRS ADAGLLRLLL EHAELLRRRP DRNSNDLSIT VERLIMDGMS EGDASLAQVA
ESLDMSQRTL SRKLASEGTS FFAILEGVRK SLALRYLQQN EKSLSEISFA LGYSSLSSFN
DAFRRWYDQS PGSYRSDALK EAAVKS