Gene Ndas_2953 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_2953 
Symbol 
ID9246806 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp3527945 
End bp3529126 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content74% 
IMG OID 
Productdiguanylate phosphodiesterase with CBS domains 
Protein accessionYP_003680869 
Protein GI297561895 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAACTG TGACTGCGCT GCATCCCCAA CACACTCCCA CGGCGCATCC GGACGGGCCG 
GGTGCGGTCG CCCGCCGCGC CCCGGGCGCT CCCGGCTACC AGCCCGTCGT CGACCTGGAC
TCGGGGACCG TCGTGGCCGT CGAGGTCACC GCCTCGCCGC CGCCCTCCGT GCGGGGCTCC
TACGCGGAGG TGACCCCCCA GGAGGAGGCG GCCGTGGCCG AGTGGCTCCT GGCCCTGGTC
CGCGAGACCG CCGTGTCCGA GAGCCTGCTG CCGATGGTCC TGCCGCTGCC CGCGCGGATC
CTGGCCGGGG AGGGGTTCGC CCCCCTGGTG GAGAGCCTGT TGCGCCGTGC GGGCCGCCGA
CCGCGCGACA TCACCTTCAT GCTCAGCCCG GACATGGCGG AGCTGGCGCG CAGGACCGTG
GTGTCCGGCG TCTCCCGGCT GCGGGCGGCG GGTTTCCGGT GCGGGTTCGG GACGGCGATG
GTGCGCCCGG ACCTGGTGGT GGAGGCAGCG CCGTTCCTGA TGCGGATCGA CCCGGCGATC
GTCTCCGGTG TGGCCGGGGA CCAGCGGCAC GCCACCGTGG TGGAGGGCCT GGCTCGGATC
GGCCGGGGCA GCGGCGTGTA CGCGCTGGCC TCGGGCGTGG GCAGCGTCGA GGACGTGGTG
CGGCTGCGCC GCTGCGGGAT ACGTGTGGGC ACGGGGCCGT TCTTCGCCGA CAGCGCGTGG
CGGCCCGGCG AGCGGGTGAC CCCGGTGCCC GAGCCGAGCG CCGGACAGGG CGGGGAGGAG
GACTCCGGCC CCCGGGTCAC CGAGTTCATG GTGCCGCCCG TGGGACTGGA CTCCGACGCC
ACCGCGGAGC AGGTGCTGGA GTCCTTCACC GGGGACCCGG CGCTGAACAG CGTCATCCTC
ATCGACCACC GGGACCGCCC GGTCGGGGTG GTGGACCGGA CGCGGTTCCT GCTGTCGGTG
ACGGGCCGCT ACGGGCACGC CCTGCACGCC AAGCGTCCGG CGCTGCGGCT GGCCGAGTCC
CCGCGCACGG TTCCGGCGTG GATGTCGGCG CTGGCGGCGC TGCGGGTGGC GGGCCAGGAC
ACCGAGCGGG TCTACGACGA CCTGATCGCC ACCAACTCCT ACGGCCAGTG CCTGGGCGTA
GTGCACATCA GCGACCTCAT CCAGTCCCTG TCGCGCAGCT GA
 
Protein sequence
MRTVTALHPQ HTPTAHPDGP GAVARRAPGA PGYQPVVDLD SGTVVAVEVT ASPPPSVRGS 
YAEVTPQEEA AVAEWLLALV RETAVSESLL PMVLPLPARI LAGEGFAPLV ESLLRRAGRR
PRDITFMLSP DMAELARRTV VSGVSRLRAA GFRCGFGTAM VRPDLVVEAA PFLMRIDPAI
VSGVAGDQRH ATVVEGLARI GRGSGVYALA SGVGSVEDVV RLRRCGIRVG TGPFFADSAW
RPGERVTPVP EPSAGQGGEE DSGPRVTEFM VPPVGLDSDA TAEQVLESFT GDPALNSVIL
IDHRDRPVGV VDRTRFLLSV TGRYGHALHA KRPALRLAES PRTVPAWMSA LAALRVAGQD
TERVYDDLIA TNSYGQCLGV VHISDLIQSL SRS