Gene TM1040_0156 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_0156 
Symbol 
ID4078823 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp171551 
End bp173017 
Gene Length1467 bp 
Protein Length488 aa 
Translation table11 
GC content61% 
IMG OID638005450 
ProductGntR family transcriptional regulator 
Protein accessionYP_612151 
Protein GI99079997 
COG category[E] Amino acid transport and metabolism
[K] Transcription 
COG ID[COG1167] Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCATTT CCGTCGAAAC GTTTTTTCTC AATACCGATG CGCAGGGCAC GTTACAGGCC 
CAGATCCAGG AGATGATTGC CGCTGGCATT CTCTCGGGGC GGTTTCGCGC GGGTGAAAAA
CTGCCCTCAT CACGCAAATT GGCGCAGCAT CTCGGGGTGA GCCGCATCAC CGTGACGCTG
GCCTATACGG AACTTGTGGC AAATGACTAT CTGAGCGCTC GCGGACGGTC GGGATATTTC
GTCTCCCAGA CCGCGCCGGT GCCGCCCTCT TTCTCGCCTA TTCAAAAAGA AGCGGACAGC
GTCGATTGGA ACCGCGCCAT CACTCAGGAT TTCACCGGCG GCGACAGCCC CCCCAAACCG
CGTGACTGGC GCAACTATCG CTATCCCTTC ATCTACGGGC AGGCGGATGC CACCCTGTTT
GATCACGCCA ACTGGCGCCT CTGCGCGCTC CGGGCGCTCG GGCAAAAGGA CTTTGCTGCG
ATGACCGGCG ATTACTTTGA TCAGGATGAC CCGCTCTTGA TCGAATATAT CGCCCGCAAC
ACGCTGCCGC GCCGAGGGGT GATTGCCCGG CCCGAGGAAA TCCTGATCAC GCTGGGTGCA
CAAAACGCGC TCTGGACCGT GGTGCAGCTG TTGTTGCAGC CCGGCCGCAA GGCCGCCATT
GAAGACCCGA GCTATTACAC GCTGCGTGAC CAACTCAGTC ATACAGGCTG CGATCTGGAT
GTGATCGCGG TGGATGAGGA CGGGTTGCCG CCAGCACAGA TCGCAACCAA CACCGATGTG
ATTTTCACCA CTCCGAGCCA TCAGAGTCCG ACCACCGCGA CAATGCCAAT GGCGCGCCGC
AAGGCGCTGT TGTCGCGCGC CACTGAAATC GGTGCGGTGG TAGTGGAGGA CGACTATGAA
TTCGAGATGT CCTTTCGCAA TCAGCCCTCG CCTGCGCTCA AATCCATCGA CCGCGATGGG
CGGGTGATCT ATCTGGGCAG CTTCTCCAAA TCGCTCTTTC CGGGGTTGCG GTTGGGGTAT
CTGGTGGGGT CGGAGCCCTT CATCCGACAG GCGCGCGCAC TCAGGGCCAA TGTCTTGCGC
CATCCGCCGG GCCATGTGCA GCGCACCGTT GCCTATTTCC TGTCTCTTGG TCACTACGAC
GCGCAAATCC GGCGCACCGC CAAAGTCCTG CAAGAGCGCC GCGCCGTACT GGAGCGCGCG
GTCGAGGCCG AAGGATTGTG CCCCGCCAAT CGCAGCCTAT ACGGGGGATC CTCTCTCTGG
ATGCAGGCCC CTGATCAGGT CAACATGGGG CAGGTGGGCC TGAAGCTGCG CGAAAAAGGT
GTGTTGATCG AACCCGGCGC GCCCTTTTTT GCGCGAGACA CTCGGCGGCA CAACTTCTAC
CGGCTCGGAT ATTCGTCGAT CGCCTCAGAG CGCATCCCGC AAGGCATCGC ACATGTGGCC
GAGGCGATCC GGGATAGCCA GTCCTGA
 
Protein sequence
MAISVETFFL NTDAQGTLQA QIQEMIAAGI LSGRFRAGEK LPSSRKLAQH LGVSRITVTL 
AYTELVANDY LSARGRSGYF VSQTAPVPPS FSPIQKEADS VDWNRAITQD FTGGDSPPKP
RDWRNYRYPF IYGQADATLF DHANWRLCAL RALGQKDFAA MTGDYFDQDD PLLIEYIARN
TLPRRGVIAR PEEILITLGA QNALWTVVQL LLQPGRKAAI EDPSYYTLRD QLSHTGCDLD
VIAVDEDGLP PAQIATNTDV IFTTPSHQSP TTATMPMARR KALLSRATEI GAVVVEDDYE
FEMSFRNQPS PALKSIDRDG RVIYLGSFSK SLFPGLRLGY LVGSEPFIRQ ARALRANVLR
HPPGHVQRTV AYFLSLGHYD AQIRRTAKVL QERRAVLERA VEAEGLCPAN RSLYGGSSLW
MQAPDQVNMG QVGLKLREKG VLIEPGAPFF ARDTRRHNFY RLGYSSIASE RIPQGIAHVA
EAIRDSQS