Gene TM1040_3667 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3667 
Symbol 
ID4075636 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008043 
Strand
Start bp719557 
End bp721377 
Gene Length1821 bp 
Protein Length606 aa 
Translation table11 
GC content61% 
IMG OID638005187 
Productcyclic nucleotide-binding protein 
Protein accessionYP_611896 
Protein GI99078638 
COG category[T] Signal transduction mechanisms 
COG ID[COG2905] Predicted signal-transduction protein containing cAMP-binding and CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.773951 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCCTAC CGGCCGCCCT CACCCAGTTT CTCGCATCCG TGCACCCCTA CGACAGCCTG 
CCCGATCCGG TGCTGGCGCA TGTGGCGCAG CAATGTGCGT TACACAATGT AGCTGTGGAT
GAGCGGCTCT TCTCTCTCGG CGATACGGTG TCTTCTCTTT ATATTATCGT CTCGGGCGAG
ATCGAGATCA CCGACGAAGC CGGGGTGCAA CTGTCGATCC TTGGCTCACG CAATTCCTTT
GGCGAACGCG CGCTCCTGCG CGGAGAGCGT GCAAGCCGCA GCGCGAGGGC GACCTCTGCG
AGCGAGGTGA TCGCCCTCCC CGCCGAGGTG TTTCACCAGC TTATCGACCA GCATGAATCG
GTCGCGCGCT TCTTCGACCG CCGCCGACCA CCTCCGCGCA CGGGCAACAG CCTTGCCAGT
CTCACGGTGG AGCAGTTGAT GACGCGCGCG CCTGTCACCT GTACGCCTGA GACGCCCATT
CGCGACGCTG CCGCGCTGAT GCACCGCCAT CACATCTCCT CGATCTGTAT CTGTGATCCG
GATGGGTTCC ACGGGATCGT GACCCTGCGC GATCTGAACA GCAAAGTGAT CGTGGGTGGC
ATAGATCCGC TCGAGCCGAT CTCCGGGATC ATGACGGAGG ATGTGCTGAC GTTGGCGCCA
CAGGCGTTGG TCACGGATGT CCTGCATCTG ATGGTGGAGC GTAACATCCA CCATGTTCCG
ATTGTGAACG AGAGAGGCCT GTTGGGCATC GTCACGCAAA CCGATCTCAC TCGAGCGCAG
GCGCTCTCTT CCGCCGATCT GGTCGGCCGT ATTGCGCGGG CCGAGGACGC CTCCGAGATG
GCGCGGGCCA CGGCGCAGAT CCCGCAACTC CTGGTGCAGC TCGTTGAAGC GGGCAATCGG
CATGAGGTGA TCACGCGTCT GATCACCGAC ATTGCTGACA TCGCTACTCG ACGGCTCCTC
TCTCTGGCAG AAGCACAGCT TGGTCCGCCG CCCGTACCTT ATCTTTGGCT CGCGTGTGGC
TCGCAGGGAC GTCAGGAACA GACCGGGGTT TCCGATCAGG ACAATTGCCT GATCCTGTCT
GATGACTTGA CTGATGCGCA AATGCCCTAC TTTGCGAAGC TGGCCCGATT TGTCAGTGAT
GGGCTTGATC GCTGTGGCTA TTTCTACTGT CCGGGCGACA TGATGGCGAC CAACCCACGC
TGGTGTCAGC CCTTGCGAGT CTGGCGCGGG TATTTCCAGA CTTGGATCGC CAAACCCGAT
CCCGAAGCGC AGATGCTTGC CTCGGTCATG TTCGACCTGC GCCCGATCGG CGGGGACAAG
AGCCTCTTTG ATCACCTTCA GAGCGACACG CTTGAGGCTG CCGCAAAGAA TTCGATCTTT
ACCGCGCATA TGATCTCCAA TTCGCTCAAG CATCAGCCGC CTCTCGGCCT GTTGCGCGGA
TTGGCAACCA TCCGCTCTGG CGATCACCGG GATGAGCTTG ATCTGAAACA CAACGGCGTC
GTGCCGGTGG TGGATCTGGG CCGGATATAT ACGCTGCAGG GGCGGCTGCG GCCGGTGAAC
ACCCGCGCGC GGCTTGAAGC GGCCCTTGCT GCGGGGCTTC TGTCTGCGTC CGGGGGGGCC
GATCTCTTGG ATGCCTATGA TCTCATTGCA TCCATGCGGC TTGAATTGCA GACCAAACAG
ATCAAAGCGG GCCAGGCTGC GGGCAACTAT CTCAACCCAT CTTCGCTGTC GGATTTTGAA
CGCAGCCATT TGCGCAATGC CTTTGTGGTC GTGAAGACCA TGCAATCTGC CGTTGGCTCG
GGCACAGGAA CCTTAGGATG A
 
Protein sequence
MPLPAALTQF LASVHPYDSL PDPVLAHVAQ QCALHNVAVD ERLFSLGDTV SSLYIIVSGE 
IEITDEAGVQ LSILGSRNSF GERALLRGER ASRSARATSA SEVIALPAEV FHQLIDQHES
VARFFDRRRP PPRTGNSLAS LTVEQLMTRA PVTCTPETPI RDAAALMHRH HISSICICDP
DGFHGIVTLR DLNSKVIVGG IDPLEPISGI MTEDVLTLAP QALVTDVLHL MVERNIHHVP
IVNERGLLGI VTQTDLTRAQ ALSSADLVGR IARAEDASEM ARATAQIPQL LVQLVEAGNR
HEVITRLITD IADIATRRLL SLAEAQLGPP PVPYLWLACG SQGRQEQTGV SDQDNCLILS
DDLTDAQMPY FAKLARFVSD GLDRCGYFYC PGDMMATNPR WCQPLRVWRG YFQTWIAKPD
PEAQMLASVM FDLRPIGGDK SLFDHLQSDT LEAAAKNSIF TAHMISNSLK HQPPLGLLRG
LATIRSGDHR DELDLKHNGV VPVVDLGRIY TLQGRLRPVN TRARLEAALA AGLLSASGGA
DLLDAYDLIA SMRLELQTKQ IKAGQAAGNY LNPSSLSDFE RSHLRNAFVV VKTMQSAVGS
GTGTLG