Gene TM1040_0137 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_0137 
Symbol 
ID4078742 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp151103 
End bp152278 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content53% 
IMG OID638005431 
Producthistidine kinase 
Protein accessionYP_612132 
Protein GI99079978 
COG category[T] Signal transduction mechanisms 
COG ID[COG4251] Bacteriophytochrome (light-regulated signal transduction histidine kinase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00141334 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.983928 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAATGT CGATGCTCGA GAGTGAGGTC ACAGAGGCCT GGGCCGAACA ATACGAAACC 
GCTTTTGTGA CATATTGCAA CACCGGCTCC GAGGAGGCTT TGATCGAAGC ATATGCCTTT
GCCCGGGAAG GTCTTGCTGC CAGCATGACC CTCGCAGTTT TTGCGTGTCT CCATCATGCA
GCATTTTTCA AGCTTCTCGA GCAATCCGGT CCGGGGAACA ATCTTTATGA ACGCGCATTG
GACTTTTATC TCGAAGGGGT CTCAGTGTTC GATATGGCCA TCCATGGGTA CCAAAACAAT
GTTGCGCGCC TCAAAGAAGA AGTCACAGAG CGACGGCGCA TAGAGGAAGA TCTGCGCGCC
GCCACCTTTG AGCTGTCTCG ACAACGCAGC GACCTCGACA TTCAAGTTCG CCAACGCACC
GCAGAGTTGC GAGAGAGGGC GGAGGAGCTG GAGCAGTCCA ATCGCTTGCT CCTCCAGACC
AACAAGGAGA CATCAGAGTT TTCCTACGCG CTGTCTCATG ACCTCAAATC TCCGATCAAC
ACGATAGGCA TGCTTCTTGA TGCTATCCGA GAGGAACTGC CACCAGACAG CGAATCCGAA
TGCGCAGACC TCGTATCGGA TGCATCGCTG ACAGCAGAGC GCATGAAGCG TCTGATCGAT
GATGTATTGC AATACTCGCA AGTCGTTGGA AACACGCTTG AACGGGAATT GGTCGACATG
AGCCAACTGT GCCAAGACGC CCTGTCTGAC ATGCGCCACG CGATCGACGA AGCACAAGCC
GATATCTCAT GCTCCCATCT CCCGGTCGTG CGCGGAAGCG CGTTTCAACT GGCTATCATG
CTGCGAAATT TCCTATCAAA CGCACTGACC TACCGCGATG CCTCTCGCCC CTTAAGGGTC
GAGATCTCAG CGGGACCCAC CGCAGAGACT GGCCGGGTTT TGATTTCGAT TGCCGACAAC
GGCATCGGCA TGCCACCAGA CTGCCATGCC CGGATTTTTA ACCTGTTCAC AAGGCTTCAC
ACTTACAGCG ATTTTGAAGG ATCCGGCATT GGGTTAGCAT TATGTAAACG CGTGGCAAAT
AATCATAACA GTGACATCGA AGTCGAATCC GTCGAGGGGC AAGGGACTAT ATTCAGCTTC
TCTATCGAAA GCGAGGAGGT TGATACATGG CATTAA
 
Protein sequence
MTMSMLESEV TEAWAEQYET AFVTYCNTGS EEALIEAYAF AREGLAASMT LAVFACLHHA 
AFFKLLEQSG PGNNLYERAL DFYLEGVSVF DMAIHGYQNN VARLKEEVTE RRRIEEDLRA
ATFELSRQRS DLDIQVRQRT AELRERAEEL EQSNRLLLQT NKETSEFSYA LSHDLKSPIN
TIGMLLDAIR EELPPDSESE CADLVSDASL TAERMKRLID DVLQYSQVVG NTLERELVDM
SQLCQDALSD MRHAIDEAQA DISCSHLPVV RGSAFQLAIM LRNFLSNALT YRDASRPLRV
EISAGPTAET GRVLISIADN GIGMPPDCHA RIFNLFTRLH TYSDFEGSGI GLALCKRVAN
NHNSDIEVES VEGQGTIFSF SIESEEVDTW H