Gene Ndas_2240 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_2240 
Symbol 
ID9246090 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp2680860 
End bp2682455 
Gene Length1596 bp 
Protein Length531 aa 
Translation table11 
GC content80% 
IMG OID 
Producttranscriptional regulator, PucR family 
Protein accessionYP_003680168 
Protein GI297561194 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0227847 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.453495 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCGGGAC AGAGCGGGGC GCAGGGCCAC ACGGTCCCGT TGAGCACCGT CGTGGGCCGC 
CGCGACCTGG GGCTGACCAC CCTCGTGGCC GCCGGGAACC CGGGGATCGG CTGGGCGGTG
GCCAGCGAGC TGGCCGACCC GGCCGCCTAC CTGCGCGGCG GTGAACTGCT GCTCACCGCG
GGCGTCAACC TGCCCGCCGC CCCCGCCGGG CTGCGCGGCT ACGTGGACTC CCTGGTCGGC
GCGGGGGTCA GCGCCCTCGG TTTCGGTGTG ACCCCGGTGC ACGACACCGT CCCCGCCGGG
CTGGTCGAGC AGTGCCGCGC ACGGGGGCTG CCCCTGGTGG AGGTGCCCCG CCCCACACCC
TTCGCCGCCG TCAGCCAGGC CGTGGGCGCC GAACTCCAGG AGCTGCACCT GCGCGACCTG
CGCCGCCTGG GCGAGGCGCA CCAGGCCCTG GCCCTGGCCG TCACCGCCGA CGCCCCCGTG
GACCGGGTCC TGCGGGTCCT GGCCGACGCC CTGGACGGCT GGGCGGTCCT GGCCCGCCCG
TCGCCCGCCG TGCCGGGCGG CGCCCACCGC ACGCCGGGCG CCCCGGCGGA GCTGGACCCC
GAACTGCGCG GGCTCGCGGA CCGGCTCACC CGGCCGCGCG GCCCGCGCGG CGCCAAGGCC
CGTGTGCGGG GGGACGAGGT CTTCCTGCAC ACCGTCGGCA CCCCGCCGCA GGAGCACGGA
GTGGTCCTGG TCGGCCGCCC CGAGCCGCTG GACGTCACCG ACCGGGCCGT CCTGCGCACC
GCCACCGCCC TGCTGGACCT GCTCGCCCGC GCCTCCCGGG GCGCCCCGCC CGCCCCGGGC
CGCCTGATCA CCGGTCTGCT CCTGGACGGC GGGCTCACCG GCGCGGCCGT GCCGCTGCTG
GCCGAACTCA CCGCCCCGAC GGACGCCTTC GTGGGGTCGG CGGCGACCGG GGCCTCCGTG
AGGGCCGGCG CACCGGGGGA CCCCGCCGCC GCGGGCGCGC CGGGGGACCC CGCCGCCTAC
CGGGTCCTGC GCGCCCGGCC CGCCGGGCGG GGCCGCCACA CCGCCCCCGC CGCGCTGCCC
CTGGGCACCC GGCTGCTGGA CGCGGGCCCC GGCGAGGACC TGCGCGCCGT CCTCGCCGAC
CGGGGCGAGG CCGCGCACCT GGCCCACCTG GACCGGCTGC TCGACCACGG CTGGATCGGC
GCGCTGAGCG GGCCGGTGGA CCCGGCGGAG CTGGCCGCCG CGGACCGCCG GGCCGCCGCC
CTGCTCACCC GTGCCCGCGC GGTCGGCGGG CCGCTGCTGG AGGAGCCCGC CGACCCCTTC
GACGCCCTCC TGGGGCCGGG GGGAGGAGAG GACCTGGCCC GGCGTGTCCT GGGACCGCTG
GCCGAGGACA CCGACTCCGC GCGCCTGCTG CGCCGCACCC TGCGGGTCTG GCTCACCCGG
CACGGCAACT GGGACCGGGC CGCCGCCGAC CTGGGCGCCC ACCGCAACAG CGTCCGCTAC
CGGATCGGGC GGATCGAGCG CGATCTGGGC GTGGACCTGG CCGACGCCGA GCAGCGCATG
CGGCTGTGGT TCGCGCTCAC CCGCTGGCGG CACTGA
 
Protein sequence
MSGQSGAQGH TVPLSTVVGR RDLGLTTLVA AGNPGIGWAV ASELADPAAY LRGGELLLTA 
GVNLPAAPAG LRGYVDSLVG AGVSALGFGV TPVHDTVPAG LVEQCRARGL PLVEVPRPTP
FAAVSQAVGA ELQELHLRDL RRLGEAHQAL ALAVTADAPV DRVLRVLADA LDGWAVLARP
SPAVPGGAHR TPGAPAELDP ELRGLADRLT RPRGPRGAKA RVRGDEVFLH TVGTPPQEHG
VVLVGRPEPL DVTDRAVLRT ATALLDLLAR ASRGAPPAPG RLITGLLLDG GLTGAAVPLL
AELTAPTDAF VGSAATGASV RAGAPGDPAA AGAPGDPAAY RVLRARPAGR GRHTAPAALP
LGTRLLDAGP GEDLRAVLAD RGEAAHLAHL DRLLDHGWIG ALSGPVDPAE LAAADRRAAA
LLTRARAVGG PLLEEPADPF DALLGPGGGE DLARRVLGPL AEDTDSARLL RRTLRVWLTR
HGNWDRAAAD LGAHRNSVRY RIGRIERDLG VDLADAEQRM RLWFALTRWR H