Gene TM1040_3336 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3336 
Symbol 
ID4075235 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008043 
Strand
Start bp346296 
End bp347858 
Gene Length1563 bp 
Protein Length520 aa 
Translation table11 
GC content58% 
IMG OID638004844 
Productperiplasmic sensor signal transduction histidine kinase 
Protein accessionYP_611570 
Protein GI99078312 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.490957 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCCGCA AGTGGCGCCC GCGCCTCTGG ACTGTTGTCC TCCTGGTGCT GGCAATTGTG 
CTGTGCCTTC CGATTGCCGG GCTGATCCTG TTTCGGTTCT ATGACAATCA GTTGGTGCAA
CAGACCGAAG AAAGCGTGCT GGCGCAGGCT GCGGTGATGG CGGCCACCTA TGCAGACCTT
TACAGCGAGG CTGCGGGGCT CGCCCCGCCA AAGCCAAAGC CCGTGTCGGC GCAAGACGCC
ATCTTTCCGT CGCTGTCAAT CAACAGCGCC ACCGTCTTGC CACCTCGTCC CGACGCGGCC
AACCCGTCAA CATCAGTGAC TGCAACTTAC CAGCGCCTCG CACCGCAACT CTCTCGGATC
GCAAGTTCAG CGCAAGCGCA AACACTTGCG GGCTATCGCT TCCTTGACAC GCAAGGCAAT
GTGATTGCTG GGACCGCCGA AATTGGCCGC GATCTGGGCC ACGTCAGCGA AGTCCGTGCC
GCCCTTGATG GTCATGTCGT TTCTGTTGCG CGCACGCGGG TGCGAGACAG CAGCCCGCCT
GCGCTTTATA CGCTCAGCCG TGGGGCCAGA GTGCGCGTCT TTGTGGCAAT GCCTGTACAT
GTGAACGAGG CTTTGATCGG GGCGGTCTAT GTCAGCCGTA CACCCAGCCA TATTTTTCGT
TTCCTCTATG GGGAGCGCTT CAATTTGCTA AAGGCCGCTG CCTTTGTTGC ACTGTCTACC
ACGCTGATCG GATATGTGTT CTGGCGCTTC ATCACCCGCC CCATCCGACT GCTGAAAGAG
CGTAGCCAAC TGGCCACCCA AGGCAATCAC GCCTTTGAGG CACCAGATCA TCTTGGCACG
CGCGAGATCG AAGACCTAAG CCTCAGCTTC AAATCGCTGA CCGAGCGGCT GCAGAACAAT
CGCGATGCGC TCAAGACGTA TACGGCCCAT GTCACCCACG AATTGAAATC ACCCCTCACT
GCACTGAAGG GCGCAGCAGA GCTCTTGCGT GACGATGATC TGAGCCAGAG CCAGCGACAT
CGGTTGCTCG ACACGATCGA GAAAGGCGGC ACACGCATAG AAGATCTGCT GGCCCATATG
CGTGCCTTCA GCCTTGCGGA CCAACAAGCG ATGTCCGGGC GCTGTAGTCT TGAGCAGATC
CAAGATCAGA TCACGCAGGC GTTTCCCGCT CTTAGTATCA TGATCGAGAA TGGCTCTCTT
GGTCTACCAT CAGAGGCCAC CACACTCTCC ATCCTGTTGA CGCATCTTTT GCAAAACGCA
CAGCAACATG GCGCAAAAAC CGTCAAACTA CGCACAGCGC ACACAAACGG ATCAATCACC
CTGCGGATCT CGGATGACGG CGCGGGGATC AGTGCGGGCA ATGCCGACAA GATCTTGCAG
CCTTTCTTCA CCACCCGACG CGACAGTGGC GGAACAGGGA TGGGACTCAA TATCGTGAAA
TCGACAGTGG AGGCCCTTGG CGGGCACTTG TACATTCTAC CGCAAGACAC GGGCGCAGGG
TTCGAGTTGG AGTGGCCCAA CGCCACCCCG TCACTGGATC GAACTCAAGC TCCCGGCGCA
TAG
 
Protein sequence
MIRKWRPRLW TVVLLVLAIV LCLPIAGLIL FRFYDNQLVQ QTEESVLAQA AVMAATYADL 
YSEAAGLAPP KPKPVSAQDA IFPSLSINSA TVLPPRPDAA NPSTSVTATY QRLAPQLSRI
ASSAQAQTLA GYRFLDTQGN VIAGTAEIGR DLGHVSEVRA ALDGHVVSVA RTRVRDSSPP
ALYTLSRGAR VRVFVAMPVH VNEALIGAVY VSRTPSHIFR FLYGERFNLL KAAAFVALST
TLIGYVFWRF ITRPIRLLKE RSQLATQGNH AFEAPDHLGT REIEDLSLSF KSLTERLQNN
RDALKTYTAH VTHELKSPLT ALKGAAELLR DDDLSQSQRH RLLDTIEKGG TRIEDLLAHM
RAFSLADQQA MSGRCSLEQI QDQITQAFPA LSIMIENGSL GLPSEATTLS ILLTHLLQNA
QQHGAKTVKL RTAHTNGSIT LRISDDGAGI SAGNADKILQ PFFTTRRDSG GTGMGLNIVK
STVEALGGHL YILPQDTGAG FELEWPNATP SLDRTQAPGA