Gene TM1040_1036 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1036 
SymbolcpdB 
ID4078548 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp1115617 
End bp1117575 
Gene Length1959 bp 
Protein Length652 aa 
Translation table11 
GC content61% 
IMG OID638006340 
Productbifunctional 2',3'-cyclic nucleotide 2'-phosphodiesterase/3'-nucleotidase periplasmic precursor protein 
Protein accessionYP_613031 
Protein GI99080877 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0737] 5'-nucleotidase/2',3'-cyclic phosphodiesterase and related esterases 
TIGRFAM ID[TIGR01390] 2',3'-cyclic-nucleotide 2'-phosphodiesterase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.222972 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGTGGC AGATTAATCG TAGATCCTTT CTCGCAGGCA GCGCGGGCCT TATTGCGCTA 
CATCCGTTTT CGGTGGCTGC GGCCTCGGGG CAGGTTCACC TGCGCCTGAT GGAAACAACG
GATCTGCACG TCCATGTGTT TCCCTATGAC TATTATGGCG ACAAACCCGT GGACACCCTG
GGCCTCGCCC GCACCGCATC GCTGATCAAC GACATCCGCG CCGAGGCCAC AAACTCGCTC
TTGGTGGACA ATGGGGATTT CCTGCAGGGC AACCCGATGG GCGACTACAT CGCCTATGAG
CGTGGCATGA AAGAAGGCGA CCAGCACCCG GTGATCACCG CGATGAACAC CGTCGGCTTT
GATGCCTCGA CCTTGGGCAA CCACGAATTC AACTACGGGA TTTCCTTTTT GATGAAATCC
CTTGCGGGCG CCGGGTTCCC GGTGGTTTGT GCCAATGTCG CCAAGAAAAC CGGTGCCTCA
CCGCGCGAGG ATGAGACCCT GTTGCCGCCC TATGTGATTC TGGAGCGCGA ACTCACCGAT
GGCGCGGGCA AGGCGCACCC GATCAAGATC GGCCTCATCG GATTTGTGCC GCCGCAGGTG
ATGAACTGGG ACCGCAAACA TCTTGAAGGC AACGTGCAGG CGCGCGACAT CGTCGAATGC
GCCCGTGCCT ATGTGCCGGA GATGAAGGAG AAAGGCGCAG ATATCATCAT CGCGCTCTCG
CATTCCGGCA TCGGCTCGGC CGATCACAGT GACGGCATGG AAAATGCCTC GGTGCCGCTG
GCGGCAGTAG AGGGCATTGA CGCCATCATG ACCGGTCACA GCCATCTGGT GTTCCCCTCC
TCGACCTATG CGGATTTTGC AGGCGTTGAC GCCGACAAAG GCACGATCCA CGGCACGCCC
GCCGTCATGG GCGGCTATTG GGGCAGCCAT ATGGGGCTCA TTGACCTGAT GCTGGAGCGC
GATGGCAATG GCTGGCGCGT TGTGGGCCAT GCCTCCGAGG CGCGCCCGAT TTCCAAGCGC
AACGAGGATC GCAGCGTGAC CGCACTGGTC GAGAGCGATC AGAGCGTGCT GGAGGCGGTG
CAGGCCGATC ACGACGCCAC GCTTGCTTAT GTGCGCCGCG CGGTGGGCAA GACTGATGCA
CCGCTGCACA GCTATTTTGC GCTGGTGGCA GATGACCCTT CGGTGCAGAT CGTGTCGATT
GCACAGACCT GGTATATCTC TCAGATGCTG AAGGGCACCG AGTATGAGGC TCTGCCGATC
CTCTCTGCCG CCGCCCCTTT CAAGGCAGGT GGCCGAGGTG GTCCGGAGTA TTACACCGAT
GTTCCGGTGG GGGATGTGGC AATCAAGAAT GTCGCCGATC TCTACCTCTA TCCCAACACC
GTACGCGCCG TGAAAGTCAC CGGCCAGCAG GTGAAGGACT GGCTTGAGCG CTCGGCGGGC
ATGTTCAACC AGATCGAGCC CGGCAAGGCC GACCAGGTGC TGCTCAACCC GTCGTTCCCC
AGCTACAACT TCGATGTGAT CGACGGCGTG ACCTATCAGA TTGACCTGAG CCAGCCGCCG
ATGTTCGCGC CCAAGGGCGA GCTGATCAAC CCGGATAGCA ACCGGATCGT GAACCTTGAG
TTCAACGGAG CCCCCATTGA TCCCGCGCAG GAGTTCATCA TTGCCACCAA TAACTACCGT
GCCAGCGGCG GTGGCAGCTT CCCCGGTGCG ATGGGTGATA CCATCGTCTT TGAAGGGCCA
GACACCAACC GCGATGTGAT CGTGCGCTAT ATCGTCGAAC AGGGCACCAT CAGCCCTAAG
GCGGATGGCA ACTGGAGCTT TGCCCCGCTG CCGGACACCT CGGTTCTGTT TGACACCGGC
CCCAAAGCCG CCGCCTATGC CGAGAGCGTC CCCGGTGTTA CCATCGCACC TGCAGGGGAT
GGGCCGGACG GGTTTGCCCG CTTCAAGATC ACGCTCTGA
 
Protein sequence
MSWQINRRSF LAGSAGLIAL HPFSVAAASG QVHLRLMETT DLHVHVFPYD YYGDKPVDTL 
GLARTASLIN DIRAEATNSL LVDNGDFLQG NPMGDYIAYE RGMKEGDQHP VITAMNTVGF
DASTLGNHEF NYGISFLMKS LAGAGFPVVC ANVAKKTGAS PREDETLLPP YVILERELTD
GAGKAHPIKI GLIGFVPPQV MNWDRKHLEG NVQARDIVEC ARAYVPEMKE KGADIIIALS
HSGIGSADHS DGMENASVPL AAVEGIDAIM TGHSHLVFPS STYADFAGVD ADKGTIHGTP
AVMGGYWGSH MGLIDLMLER DGNGWRVVGH ASEARPISKR NEDRSVTALV ESDQSVLEAV
QADHDATLAY VRRAVGKTDA PLHSYFALVA DDPSVQIVSI AQTWYISQML KGTEYEALPI
LSAAAPFKAG GRGGPEYYTD VPVGDVAIKN VADLYLYPNT VRAVKVTGQQ VKDWLERSAG
MFNQIEPGKA DQVLLNPSFP SYNFDVIDGV TYQIDLSQPP MFAPKGELIN PDSNRIVNLE
FNGAPIDPAQ EFIIATNNYR ASGGGSFPGA MGDTIVFEGP DTNRDVIVRY IVEQGTISPK
ADGNWSFAPL PDTSVLFDTG PKAAAYAESV PGVTIAPAGD GPDGFARFKI TL