Gene TM1040_1997 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1997 
Symbol 
ID4077454 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp2101188 
End bp2102966 
Gene Length1779 bp 
Protein Length592 aa 
Translation table11 
GC content59% 
IMG OID638007312 
Productperiplasmic sensor signal transduction histidine kinase 
Protein accessionYP_613991 
Protein GI99081837 
COG category[T] Signal transduction mechanisms 
COG ID[COG4251] Bacteriophytochrome (light-regulated signal transduction histidine kinase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCACATA CAGCCCCTCA CCATAAGAAG ATCTCCACCC TGCACATGCT CGTCATTGGC 
CTGTCTCTCG CCATGACAAT CGGCGCCTGG CTCTATTCGA AACACCAAGT GGACAAGCAG
ATCGAGGCCC GTTTCATCGC CTCGCGCGAT CGCACCGTCG ACCTCATCGT GGATCGCATG
TCACGCTACG AGGATGCCTT GTGGTCCGGC GTCGCGCACG CCCGTTCCCT GAGCGGTGAG
CTGAACGCGC GCGACTGGCA TACGTTTTCC GAAACGCTTC ACATCGAAGA GAAATACCCC
GGTGTGAACG GTATTGGCTT CATTCACAAT GTAGAGCGCA CTGACCTGCC CGCCTACCAC
GAGGCAAGGA GCAGCGAAGG GCGGCCGATC CAAATCTATC CGGAGCATGA TCTCGAGTAC
CTGCTGCCAA TCACCTACAT CGAGCCCGAA GCCGCCAACG CGGCTGCAAT CGGTCTGGAT
GTGGCGCATG AAACCAACCG GCGCACGGGT CTGCTTGCAA GTCGCGACAG CGGCAGCGCG
CGCATCACCG GGCCGATTGT GCTTGTGCAG GACAGCGGCA ATACACCCGG TTTTCTGTTC
TACACCCCGC TCTACGCAGG CCAAATGCCC ACCACCGTCG CCGAGCGCCG GGAGCGGTTC
CTTGGCGTTG TCTATGCGCC CTTTGTGGTG CGCAAGCTTG TGGAGGGCTT GCTTTCGAAG
GATCTGCGGG ATGTACGCTT CAGCCTCAGC GATGGCGAGG CGGTGATCTA TGACGAACAC
GGCCCCGACG AGCCGCTGTA TGACGTGGAC CCGATGTTCA CCGACACGGT CGTGGTCGAC
ATGTACGGGC GAAAGTGGAC GGTCAATCTG CGTACAAACC TGGCCTTTCG CGCACAGAAC
AGCTCTGCAC AGCCCAGCCT GATCCTTGCC GGCGGACTGG TCATCGAAAT CCTGGTGATC
TCGCTGCTGG TGATGCTCTC GCGCTCGACC AAGCAGGCGC ATAGTCTCGC GCGCGACCTC
ACCGCAGAAC TGCGTGCCAA AACCAAACAC CTCGAGAAGG CCAATGCCGA AATCGAACAA
TTTGTCTATG TGGCATCGCA TGACCTGAAG ACCCCCGCGC GCGGCATCGG GTTCCTTGTG
GATGTGATCG AAGAAGAGCT CGAGGACGTG CTCAAATCCG CCCAAAATCG CGGCGAACTG
CAGATGCAGC TCGATATGAT CCGCGACCGC GTGGCGCGTA TGAATGATCT CACCAAGGGC
ATCATGGAAT TTTCGCGCGT CGGCCACTAT GGCTCCGAGG GAGAGCCGCG CTTGCCTGTA
AGCAACCTGA TCGAGGACTG CGTGGCTGAT TTTGAGGTTG ATCCCAAGCA AGTCCATCTG
GCCTCGGATG TGAGTGAAAT CGCCTGCGAC AGCCATAACT TCCGGCGGGT TCTGGAGAAC
CTGATCGGCA ACGCTTTCAA ATACCACCCT CACCCGCGCG CCGCCAAAGT CGAGGTGGCA
ATCAAGGATC GCGGAGATCG CCTGACAGTG AGCGTCAAGG ACGACGGCAA CGGCATTGCG
CCGGAGTTTC ACGACAAGAT CTTTGACGTC TTTCAAACCC TACGCAAGGG AACCGAACCC
GAAAGCACCG GCATCGGTCT GGCGATTGTG AAAAAGGCGA TCCAGCGGCA CGGGTTTGAT
ATCACAGTAA CCTCATCCGA GGGCCATGGC GCAGAATTCT CATTCTGTTG GCCCAAAGAC
CGCAGCCAGA GCAGCATGGA CTTGGAGAAC GTAGCCTGA
 
Protein sequence
MAHTAPHHKK ISTLHMLVIG LSLAMTIGAW LYSKHQVDKQ IEARFIASRD RTVDLIVDRM 
SRYEDALWSG VAHARSLSGE LNARDWHTFS ETLHIEEKYP GVNGIGFIHN VERTDLPAYH
EARSSEGRPI QIYPEHDLEY LLPITYIEPE AANAAAIGLD VAHETNRRTG LLASRDSGSA
RITGPIVLVQ DSGNTPGFLF YTPLYAGQMP TTVAERRERF LGVVYAPFVV RKLVEGLLSK
DLRDVRFSLS DGEAVIYDEH GPDEPLYDVD PMFTDTVVVD MYGRKWTVNL RTNLAFRAQN
SSAQPSLILA GGLVIEILVI SLLVMLSRST KQAHSLARDL TAELRAKTKH LEKANAEIEQ
FVYVASHDLK TPARGIGFLV DVIEEELEDV LKSAQNRGEL QMQLDMIRDR VARMNDLTKG
IMEFSRVGHY GSEGEPRLPV SNLIEDCVAD FEVDPKQVHL ASDVSEIACD SHNFRRVLEN
LIGNAFKYHP HPRAAKVEVA IKDRGDRLTV SVKDDGNGIA PEFHDKIFDV FQTLRKGTEP
ESTGIGLAIV KKAIQRHGFD ITVTSSEGHG AEFSFCWPKD RSQSSMDLEN VA