Gene TM1040_1854 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1854 
Symbol 
ID4077879 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp1954506 
End bp1956245 
Gene Length1740 bp 
Protein Length579 aa 
Translation table11 
GC content62% 
IMG OID638007170 
ProductGAF sensor hybrid histidine kinase 
Protein accessionYP_613849 
Protein GI99081695 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase
[COG0784] FOG: CheY-like receiver 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.46631 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCGAG CCGCCAAATT GACACCTGAT CCCTATGATG TGCAGCTGCG CGAACTTGGC 
CTCGTTCAAC GACAACCCGA CTCCGACATC GACAATCTGA CCCGGCTCGC TGCCCAAGTG
CTGGATGCGC CAACGGCGCT GGTCTCTGTG GTGCAGCGCA GCCTTGATCG TCAATTCTTC
AAAAGCTGCG TGGGCCTGCC CGAGCTGTCG AGCGTGCTGC GTCAGACACC GCTCTCTCAT
TCGTTTTGCC AATTTGTGCA GGACGCCAAC CAGCCTCTGA TCGTGCCCGA TAGCCGCAAT
GACGCCCAGC TGCGTGACAA TGCGGCGGTG ATCGACCTCA AGATGGTTGC CTATCTGGGG
GTGCCGATTC ATTTGCCGGA TGGAACGCCC ATTGGTGCGC TCTGCGTCAT TGATCGCGTG
CCGCGTCAGT GGAGCTCGGA CAATCTCAAG ACCCTGCAAC ACCTTGCGGT GGCCGTCGAC
AATGTCATCG CCCTGAAACA TGCGCGCGAA CTCGCCGCCG AGGCAGAGCG CACTGCCAAA
CGCGAAGCCG AGGCGCGTAA GACCTACCTT GCCCACATGA GCCACGAGAT CCGCACGCCG
CTCAATGGCA TCATCGGCTC GGTCGATCTG TTGCTGCGCG AAGCCTCCCG CGTCAATCTG
GACCCGCGCG AGCAGAACGA TCTGTTGCGC ACCATCAACC GCTCTGCGCG CAACCTTCAG
CATCTGCTGA ATGATGCGCT CGACATTGCC AAGATCGATG CCGGCAAGCT CGAACTGGCC
CCAGCGGCCT TTGACCTGCA TGAAACGGTT GATGACGTGA TGAAGCTGTT CTCGGCCCAG
GCCTGCGAGA AAGGCGTTGA GCTGAACCAC AGCTTTCGCG GCATCGAGGC GCAGGAACAG
CGCTATGGCG ACCGGTTCCG GCTCGCGCAG ATCCTTGGCA ACCTTTTGAG CAACGCCATC
AAATTCACCG ATGACGGAAG CGTATCGCTG CAGATCAATG GCACCCCGGA TGCGCTGCAT
CTCACGCTGC GCGACAGCGG CTGCGGGATT GCGCCGGACC GTCTCGACAA ACTGTTCCTG
CCCTACACCC AAGCCAGTGC CGATGTGGCG CAGACCAAGG GAGGCACGGG CCTTGGTCTG
ACGATCGTGC ACCAGATGGT CGAGCTGATG CAGGGGCGGA TCCGCGCCGA GAGCGTGCCC
GGTCGGGGCA CCGTCTTTCA CCTCTATCTG CCGTTGCCGT TGACCACATC TGCCCCCGAA
GCCGAAAAAG TGCATGACCC GGCAAGGGAT GGGATCGGCG ACACCGCGCC TGTTGCATCT
CCCCCGCCTG CCCCATCCAA TGCGCCCTTG TCCGGCAAAC GGGTGCTCAT CGCCGATGAC
AGCCCGGCCA ACCGGCTGGT GCTGCAAAAG ATGCTCGAGA ACCTCGGCGC GACGGTGGAC
AAGGCCTTTG ACGGTGGCGA TGCCTTCGGT CGGGCGGTGG CGCGGGCCTA TGATGTGTTG
CTGCTGGACA TCCACATGCC GGGCCAGACC GGCACCGAGG TGGTCAAGAA GCTGCGCGCG
GATCCGCGCC ATCAGGCGCA TGACGCGCTG TGTATCGCGG TGACAGGCAG CACCGAAAAG
CAAGAGGTCG AGCATTACCT GGACTCGGGT TTTGATGCCT GGATCGGCAA ACCGCTGCGC
CAATCCGACC TGATGCGGGT GCTGTCGCCG CTGCTTCTCG AACCCCCGTC CCGCGCGTGA
 
Protein sequence
MTRAAKLTPD PYDVQLRELG LVQRQPDSDI DNLTRLAAQV LDAPTALVSV VQRSLDRQFF 
KSCVGLPELS SVLRQTPLSH SFCQFVQDAN QPLIVPDSRN DAQLRDNAAV IDLKMVAYLG
VPIHLPDGTP IGALCVIDRV PRQWSSDNLK TLQHLAVAVD NVIALKHARE LAAEAERTAK
REAEARKTYL AHMSHEIRTP LNGIIGSVDL LLREASRVNL DPREQNDLLR TINRSARNLQ
HLLNDALDIA KIDAGKLELA PAAFDLHETV DDVMKLFSAQ ACEKGVELNH SFRGIEAQEQ
RYGDRFRLAQ ILGNLLSNAI KFTDDGSVSL QINGTPDALH LTLRDSGCGI APDRLDKLFL
PYTQASADVA QTKGGTGLGL TIVHQMVELM QGRIRAESVP GRGTVFHLYL PLPLTTSAPE
AEKVHDPARD GIGDTAPVAS PPPAPSNAPL SGKRVLIADD SPANRLVLQK MLENLGATVD
KAFDGGDAFG RAVARAYDVL LLDIHMPGQT GTEVVKKLRA DPRHQAHDAL CIAVTGSTEK
QEVEHYLDSG FDAWIGKPLR QSDLMRVLSP LLLEPPSRA