Gene TM1040_1169 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1169 
Symbol 
ID4075955 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp1257343 
End bp1259133 
Gene Length1791 bp 
Protein Length596 aa 
Translation table11 
GC content60% 
IMG OID638006475 
Productdiguanylate cyclase with GAF sensor 
Protein accessionYP_613164 
Protein GI99081010 
COG category[T] Signal transduction mechanisms 
COG ID[COG2199] FOG: GGDEF domain 
TIGRFAM ID[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0126253 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATCA ATACGATCAC GAATTGGGCC TACGGATTGA CTGTTCTTCT GACCGCGTTG 
TCGGGCGCCG CGTTCATCGC GTCGATCAAC AGCTCGGCCG AAGAACGCAT GGCGGTTGAA
ACACATCTAA TCCTGAACGA GTTGGGCGAA GAGCTTGCGA TCGGAGCGGA GCTGCGCACC
GACGAGGCAC GGCTTTATGT GATGCGCGGC GACCCGGATC ATCTTGAGGC TTTTGAGCGG
ACCGATCAGG CCGAAATGGC GCTGGAAGAG TCCGCCCGCG ACGGCACGCG TCTTGGCACC
ACCCCGGAAG AGACCGCATT TCTGAATCGG ATCATCGCGG ATATAGATGC GCTGCAAGAG
ATGGAGCGTG CGGCAATCAA AGCTTATCAA GCGGGCAACA TTGACCGCGC GCGGGGGCTC
TTGTTCGGGG GCGCGCATTA CCAGGCCCAC CTCAAACTCA TTGACGATGT AGTCCAGTTC
CGCCGGATGG TGCAAAGCCG GACAGACCGT GAGCTCGACA CCGCACATCA ACGCAGCGAC
TGGTTCGAAT TCGTTGCGCA GATCTCCCTC GCGCTCACCG CCACCGTTTT CCTCGGCGTA
CTCTATTTTG TCCTGCGTCG CCGGGTGGCG GTGCCCCTGG CACGCATGGC CGGCATCGTA
AAGCGCCTCG CCAGCCAGGA CTACGAGGTC GAGGTGCCAT TGGATCGCAG ACGGGACGAG
ATTGGTGAGC TCAATCGCGC CGTGCATGTG TTCCGCGAAA ACGGGCTCGA GCGGGAACGT
CTCGACGCCG AGCGCCGCCG TGACATCAAG ATCAAGGATC TGATCCTGAA GCTCATGCAC
CGGGTTCAGG CCTGCCAGTC TCTCGACGAA CTGGGGGATG TCCTTGCGCG GTATGCGCCG
CAGATCTTTC CCGACCTCTC CGGCACGCTC TACCTGCGCG CTGACGGTCA CGAGACCTTG
AACTGTGTGT CCCGATGGCA GACGTCATTC GACGACCCGC TCGAGATCAT CTGCGAGGCC
TGCTGGGCCC TGCGCCGCGG ACGCGCTCAT TATAGCGCCT TGGAATCAAA AGAAGATGTG
ATTTGCAACC ATCTCGCTGA CCCGAACGTT TCGACCCTGT GCATTCCGCT CGCTGCTCAG
GGCGAGACCA TCGGCCTTCT GAGTTTCTCT GGCGCCGAGA CCTCGACAGA GGTTGCGCGC
GAAGATCGGG TCTATCTGGA GCTTATTGCT GAAAACGTCG CGCTTGCGGC CGTGAACCTC
AATCTGCGCA CCCGGCTCTC GCAGCTGGTC GAACATGATC CTCTGACCGG GCTTTTGAAC
CGGCGCTCGC TGGATGTGGC CTTTGGCGAT TTTGTGCAAA ACCCGCCAAC CCTTCCGACC
GCCTGCCTGA TGATCGACAT TGATCACTTC AAGCGGATCA ATGATGAGTT TGGCCATGAG
GCCGGGGATA CGGTGATGCA AAGCTTTGCC GGGTTACTCC GCGAGGTCGT GGGCGACAGC
GGCTCCTGTT ACCGCTTTGG CGGCGAGGAA TTTGTGGTGA TCCTGCCGGA TCACGACGTC
GATGCCGCGC ATGAGATCGC CGAAGGCATT CGCACCCGCA CCGCAAAGAT GGCCATTGCC
CATCGCGGGC AATCTCTTGG GAAGGTCACG ATTTCCATCG GTCTTGCGGT TGCAGCATCC
GCAGGCACCC TCGACACCTT GTTGAGCCGC GCGGATTCAG CGCTGCTGCA GGCCAAGGTG
ACGGGTCGCA ACATCACGTT GCACGAGACA CTTGAGCTCG CCCCTTCCTG A
 
Protein sequence
MKINTITNWA YGLTVLLTAL SGAAFIASIN SSAEERMAVE THLILNELGE ELAIGAELRT 
DEARLYVMRG DPDHLEAFER TDQAEMALEE SARDGTRLGT TPEETAFLNR IIADIDALQE
MERAAIKAYQ AGNIDRARGL LFGGAHYQAH LKLIDDVVQF RRMVQSRTDR ELDTAHQRSD
WFEFVAQISL ALTATVFLGV LYFVLRRRVA VPLARMAGIV KRLASQDYEV EVPLDRRRDE
IGELNRAVHV FRENGLERER LDAERRRDIK IKDLILKLMH RVQACQSLDE LGDVLARYAP
QIFPDLSGTL YLRADGHETL NCVSRWQTSF DDPLEIICEA CWALRRGRAH YSALESKEDV
ICNHLADPNV STLCIPLAAQ GETIGLLSFS GAETSTEVAR EDRVYLELIA ENVALAAVNL
NLRTRLSQLV EHDPLTGLLN RRSLDVAFGD FVQNPPTLPT ACLMIDIDHF KRINDEFGHE
AGDTVMQSFA GLLREVVGDS GSCYRFGGEE FVVILPDHDV DAAHEIAEGI RTRTAKMAIA
HRGQSLGKVT ISIGLAVAAS AGTLDTLLSR ADSALLQAKV TGRNITLHET LELAPS