Gene TM1040_2229 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_2229 
Symbol 
ID4077296 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp2342347 
End bp2344095 
Gene Length1749 bp 
Protein Length582 aa 
Translation table11 
GC content58% 
IMG OID638007551 
Productdiguanylate cyclase/phosphodiesterase 
Protein accessionYP_614223 
Protein GI99082069 
COG category[T] Signal transduction mechanisms 
COG ID[COG5001] Predicted signal transduction protein containing a membrane domain, an EAL and a GGDEF domain 
TIGRFAM ID[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGCAGG CGCGTCAGTT GAACCGGATG TCGCAGCGCG TGACGGCCGC GATGGAGGCC 
TCCAAAGATG GCTTTGCGAT CTTCGACGAA GAAAACCGTC TCGTGACCTG CAACGATCGT
TATCGAGAAT TGAGCCATGT TCACCCCGAC AAAGTTGTGC CCGGCATGAC CCTCACGGAG
ATCCTCGCTG ACGCGGTAGA ATCCGGGCAC TATTTGCTGC GTGGGGTTTC GGCCAAGGAT
TACATCCAGA ATTACAACCA ACGCATCGTC AACCATGGCT TTTGTACAGA AACCCAGCTG
GAACTGGCTG GCGATCGCCA TATCGTGTCA CGGGTGAATG AAACCGATTT TGGCGACAAG
GTGGTGACAC GCATCTGCAT TACAGATCTG GTGCGGAATG AACGGGAACA GCGCCGCGCC
GCGCAGGCGC TCAGAGAGAC GAAAGACCGG TTGGAGTTGC AATCCCTCAG CGATCCACTG
ACGCATTTGC CCAACCGGCG ACACCTGGAT CGCGAGCTGA GCCGAAGATT GCAGACAGAA
CCCGTTGCTC TGGTGCGGAT CGACCTGGAC CGGTTCAAGA AAATCAACGA CATCCTCGGC
CACGAGGCCG GGGACTATGT GCTTTGCCAC GTGGCGGATG TGCTTCGGGC GCACACGAAG
GCGGGGGATG TTCCCGCGCG GGTGGGAGGC GACGAATTTG TCGTTCTCTG TCGATCCGGG
ACAACGCTCG AGCAGGCGCA AACCCTGGCG ACTCGGGTGC TTCGGGCGGT GCTAGAGCCT
GTGGTCTGGG GCAACAAGCG CTGCAATTTC GGGGCAAGCT TTGGGGTCGC GCATGGCGTT
CCCGGTGAAA TCACGGCCAG CGAGTTGCTG AGCAATGCGG ATGCGGCGCT CTACCGGGCG
AAGGCTTCGG GGCGCGGCGC GGTAGAAGTT TTCACATCCG AGATGCGGGC AGAGGTACTG
GAAGAGCGCG CGCTCTCGGA CCGACTGCCC TATGCGATCG AGGCAGGCGA AATAACGCCC
TATTATCACA CGCAGCATGA TGCTCTGACC TGGCATCTAG CCGGGGTTGA GGTGCTGGCG
CGCTGGGAGC ATCCCGACCG CGGTGTACTG CCCCCCGACA GGTTTCTGGG CATTGCCAAG
CAGCTTGGGC TTGAGGCGGA GTTGGATGGC TGCATCTTTG ACAGGGCGGT GGCGGATATG
CGCAGCCTGC GCGCCGAGGG CATTCTGGTG CGGCGGGTGG CATTCAACGT AAGCGCGGCG
CGCATCATGC AGCCGAGCTT TATCGAAACG GTCCGCGCAC GCATCCCCGA TCAGCGCGAG
AGTTACGCGT TTGAAATTCT GGAATCGATC TCCTGCGAGG ATGAGGGCGA GGCGCTGATC
TTCTGCATCG ATGCGCTGAA GGATCTTGGT TTTCAGATTG ATGTCGACGA TTTCGGATCC
GGCCATGCGT CGATCAATGG CGTCCTCAAC ATCGAGCCGG ACGCCCTGAA AATCGACAGA
AATATCATCT TCCCGCTTGG AAAAAGCGAG CGCGCGGAAC GAATGGTGGC GTCAGTTGTG
GACCTTGCAC ACACTTTGGA TGTGAAGATC ATCGCCGAGG GGGTGGATAC CATTGAAAAA
GCAAAAACAC TCGGCGCCAT CGGTTGCGAT ATATTACAGG GGTTTTATTT CTCCAAACCA
CGTAGCTTTG CAGAATTGAA AAAGAACCTC GAAAGGTTGG ATCTCGGCGA TCAGGTGTCC
AACCTGTGA
 
Protein sequence
MEQARQLNRM SQRVTAAMEA SKDGFAIFDE ENRLVTCNDR YRELSHVHPD KVVPGMTLTE 
ILADAVESGH YLLRGVSAKD YIQNYNQRIV NHGFCTETQL ELAGDRHIVS RVNETDFGDK
VVTRICITDL VRNEREQRRA AQALRETKDR LELQSLSDPL THLPNRRHLD RELSRRLQTE
PVALVRIDLD RFKKINDILG HEAGDYVLCH VADVLRAHTK AGDVPARVGG DEFVVLCRSG
TTLEQAQTLA TRVLRAVLEP VVWGNKRCNF GASFGVAHGV PGEITASELL SNADAALYRA
KASGRGAVEV FTSEMRAEVL EERALSDRLP YAIEAGEITP YYHTQHDALT WHLAGVEVLA
RWEHPDRGVL PPDRFLGIAK QLGLEAELDG CIFDRAVADM RSLRAEGILV RRVAFNVSAA
RIMQPSFIET VRARIPDQRE SYAFEILESI SCEDEGEALI FCIDALKDLG FQIDVDDFGS
GHASINGVLN IEPDALKIDR NIIFPLGKSE RAERMVASVV DLAHTLDVKI IAEGVDTIEK
AKTLGAIGCD ILQGFYFSKP RSFAELKKNL ERLDLGDQVS NL