Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_3409 |
Symbol | |
ID | 4075583 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008043 |
Strand | + |
Start bp | 430821 |
End bp | 431840 |
Gene Length | 1020 bp |
Protein Length | 339 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 638004918 |
Product | AraC family transcriptional regulator |
Protein accession | YP_611643 |
Protein GI | 99078385 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.804556 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.710727 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGATGGA TTTCGACGGT CTTCGTACAC AAAGCACTGG ACGCTGCAGT CTCTCTTGCA AGCGCCGATG AAGATGCACG CGGGAAACTT TTCAAGAGCG TTGGTCTTGA TCCTTCGGCT CCGGTTGATC CCGGCGCAAT GATCTCAGAT GGCGACTTCT TTGGGTTATT GGAACGCATC GCAAAACTTG ATGATCGCGG TCGGTTCGTT CCGGTTCAGA TGGGCGCCTC TATGTGTTGC GACGATTACG GCGCTTTTGG CCTCGCGTTC AAATCCGCAC CCGATCTGCT CAGCTCCTAC GCGCGGGTAG AACGCTTTGG AAAGGTTGTC ACCTCGATAG CTAATTTCCG TGTTAAACAG GTGGGACCCT CCGTTTTTAT GGAAGTTGTT CAAGGAGGGG ACCCGCGTCT TGGTCTTAGG ATGACCAATG AACTGGCTTT GGCCGCTACG ATGTCGCTCA GTCAGGAGGT CAGCAGCGAG GATTTTTCTC CCGTCGCCGT TCACCTCATG ACGGAGCGCC CCGAAGTCGA CGACGTGTAT CACGCGCATT TTCGTTGCCC TGTTCACTTT GGCGCAGACC ACGATGCGCT TGAGGTGGCT ACCACGGCAG CTGTCCGGTC CAATCGTCTT TCCGACAATG GGATGTCCAG GTTTTTTGAG ACACATCTCG ACAACCAGCT TAGCCAAATC AGTGACAGGT CCGAACTGGA GCAGGGCATT CTGGATCAAA TCGGCGAAGC GTTGAGCGAA GGTGTGCCCA CGCTCGCCGA GATCGCCGGG TGTATGGGGA TGAGCAGCAG AACCTTGCAA CGCCGCCTGT CCGCAGAAGG TCTGGCTTAC CAAGACCTGG TTTCAAGCGC GCGGAAATCA CTCTCCGAAC AGCTTTTGAG ACGCACGGAC TACGCTTTGG CAGAGATCGC CTTCCTGACT GGTTTCTCCG ACCAGAGCAC GTTCACACGC GCCTTTAAGC GTTGGCACCA GCAGACACCC GCCAACTACC GACGCGGCAC GCCTGTTTAG
|
Protein sequence | MGWISTVFVH KALDAAVSLA SADEDARGKL FKSVGLDPSA PVDPGAMISD GDFFGLLERI AKLDDRGRFV PVQMGASMCC DDYGAFGLAF KSAPDLLSSY ARVERFGKVV TSIANFRVKQ VGPSVFMEVV QGGDPRLGLR MTNELALAAT MSLSQEVSSE DFSPVAVHLM TERPEVDDVY HAHFRCPVHF GADHDALEVA TTAAVRSNRL SDNGMSRFFE THLDNQLSQI SDRSELEQGI LDQIGEALSE GVPTLAEIAG CMGMSSRTLQ RRLSAEGLAY QDLVSSARKS LSEQLLRRTD YALAEIAFLT GFSDQSTFTR AFKRWHQQTP ANYRRGTPV
|
| |