Gene TM1040_1386 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1386 
Symbol 
ID4075879 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp1480196 
End bp1481611 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content61% 
IMG OID638006696 
ProductGntR family transcriptional regulator 
Protein accessionYP_613381 
Protein GI99081227 
COG category[E] Amino acid transport and metabolism
[K] Transcription 
COG ID[COG1167] Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTACAA TTTGGCCCTC CACCCTGACC GGGTCCCTGA CCGAAAAGCC GGGGCCCAAG 
TACAAGCGCG TGGCTGATAC GATACGATTG GCGGTGGAGA GTGGTACATT GCAGGTGGGC
GCAAAGCTGC CGCCTGTCCG CGAGCTCGCC TATCAGCTCA GTATCACGCC TGGCACCGTC
GCGCGGGCCT ACACGATCCT GACAGACGAG GGCATCTTGC AGGCCGAGGT CGGACGCGGA
ACCTTTGTGG CCGAACGCGA AACGGCTGTG ATGGATGACG TCTGGTCGCG GCAGTTGCAT
CTTCTTGAAC GCAGCAACCC CAATCACGTC TCTCTGTTCA GCCCGCGTCT TGTCGACGTG
GGTCAAGTCA AGCTGATCCG CGAAGCGCTC CGAAAAGCCG CTGATTGCGA CCGCTTGGCG
TTGTTGAATT ATCCGACGCG CGACGCCTAT CAGCCGGTGC GCCAAGCGGT TGTCGACTGG
CTATCCGATA CCCCGCTCGG GCCACTGAGT GAGCGCGATG TGGTCTTGAC CCATGGGGGC
CAGAATGCCG TCATGACGGT GATGCAAGCG ATCCTGAAAG AGAGCAATCC CACCATCTTG
GTCGAGGACC TCTCTTATGC CGGGTTCAGG CGCGCTGCAG AGATCCTGCG TGCCAAAGTG
GTCGGTGTTG CGATCGACGA AAACGGCATT GTGCCAGAGG CGTTGGAGGC TGCGATCAAG
AAATCGGGTG CATCTGTACT TTGTACCAGT CCCGAAGTGC ACAATCCAAC CGGCGCTTTT
ACCCCGCTTG AACGGCGCAA AGAGATCCTC GCCATATTGA GACGACACAA TGTACAATTG
ATCGAGGACG ATTGTTATCG CATGGGGGAG GCCCGCGCGC CGACCTATCG CGCGCTCGCG
CCGGAGCTCA GCTGGCATGT GACATCGATT TCCAAATCGC TCACCCCGGC GCTGCGGGTG
GGGTTTGCAA TCGCCCCCGT CGGCAAATCC GCAGATCTGC GCCGCGTTGC GGAATACGGT
TTCTTTGGGC TTGCGCAGCC GCTCGCCGAG GTGACACGCC TGCTATTGTC GGACCCGCGC
AGCAAGAAGC TGGTCCAGAC CGTGCGCGAC GAGATGGCGG AATATGTCCG TGTCGCCGTA
AACGCCCTCG GAGCCTTTGA ACTCACCTGG GACTCGCAGG TGCCGTTCCT TTGGCTGCGG
CTGCCTTCGG GCTGGCGAGC GGGTGCGTTT ACACGTGCGG CCGAAGGGCA GGGGGTGCAA
CTGCGCTCTG CAGATGAGTT TGCCCTGAGG GACGGGCGTG CGCCCAATGC AGTGCGCATC
GCCGTCAATG GGCATGTCAC GCTGGCGCGA TTTGAGGATG CCATGCTGCG TTTGCGCATG
CTATTGGACA ACCCACCAGA GCAAATCAGC GTCTGA
 
Protein sequence
MGTIWPSTLT GSLTEKPGPK YKRVADTIRL AVESGTLQVG AKLPPVRELA YQLSITPGTV 
ARAYTILTDE GILQAEVGRG TFVAERETAV MDDVWSRQLH LLERSNPNHV SLFSPRLVDV
GQVKLIREAL RKAADCDRLA LLNYPTRDAY QPVRQAVVDW LSDTPLGPLS ERDVVLTHGG
QNAVMTVMQA ILKESNPTIL VEDLSYAGFR RAAEILRAKV VGVAIDENGI VPEALEAAIK
KSGASVLCTS PEVHNPTGAF TPLERRKEIL AILRRHNVQL IEDDCYRMGE ARAPTYRALA
PELSWHVTSI SKSLTPALRV GFAIAPVGKS ADLRRVAEYG FFGLAQPLAE VTRLLLSDPR
SKKLVQTVRD EMAEYVRVAV NALGAFELTW DSQVPFLWLR LPSGWRAGAF TRAAEGQGVQ
LRSADEFALR DGRAPNAVRI AVNGHVTLAR FEDAMLRLRM LLDNPPEQIS V