Gene Ndas_2239 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_2239 
Symbol 
ID9246089 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp2679132 
End bp2680451 
Gene Length1320 bp 
Protein Length439 aa 
Translation table11 
GC content76% 
IMG OID 
Productputative phytochrome sensor protein 
Protein accessionYP_003680167 
Protein GI297561193 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0494571 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.300779 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTTTTC CGCCGCTGGA CACGGCCCTG CCGGCCGGGG CCGACGCCCG CGAGCACGCC 
CGCCTGCTCC GGCGCGTGCA CGAGGCCTCG CTCTCCGGGC GCCCGGCGCC GGCCGCCCTG
CGGCCGGTCA TCGAGGACTC CTGGTCCCGC AGCCGACGCT TCGGGATCGA CCCCGACAGC
GCCCCGCCGC CCCGCATGGC CCGGCTGGAC GAACTCCAAC GGCACCGGGA CGCCTCCCCG
ATCGCCGAGG TCCTGCCGCT GATCCGCCGC TCCCTCGTCT CGGTCGCCGA CGAGGCCGAC
CACATCATGC TGGTCACCGA CGCCTCCGGT CAGGTCCTGT GGCGCGACGG CTCCCACCGC
GTCCGCGCCC TCGGCGACCG CGTCGGGCTC GTCGAGGGGG CCTTCTGGAA CGAGGGCAGC
ACCGGCACCA ACGCCATCGG CACCGCCCTG GTCGTGGGGC GGCCCGTGCA GGTCTACTCC
GCCGAGCACT TCATGCGCAG CCTGCACGCC CTCACCTGCG CCTGCGCACC CATCCACGAC
CCCCGCGACG GCCGCCTGCT CGGCGCCGTC GACGTCACCG GCCCCGTCTC CACCATCCAC
CCCTCCACCC TCGCCCTGGT CAGCGCGGTG GCCCAACTGG CCGAAGCCCA TCTGCAGAGC
CTCCACCACA CCCACCTGGA GCGGCTGCGC TCGGTGGCCG CGCCCCTGCT GGCCGGGATG
AGCGAACGCG CGCTGGTGGT GGACGAGGCC GGGTGGACCG CCGCCGCCGT CCACATGGAG
CCGGTCCGCA GGGTGCTGCT GCCCAAACAG CGCGGGAGCG GCACCGCCTG GCTGCCCGCC
CTGGGGGAGT GCGCCCTGGA GCCCCTGCCC GGCGGATGGC TGCTGCGCCC GCGTCCGGCG
GCGGAGAGCG CGCCCTCCAC GGTCACCCTG GACCTGACCC GGCCCTCGCC CAGGATGGTG
GTCGCCGGGC CCAGCGGGGA GTGGGCGCAC CGGCTCACCC CGCGCCACGC GGAGCTGCTG
CTGCTGTTGG CCGTGCACCG GGCCGGGCGC ACCGGAGCGC AGCTGTCCCA GGACGTGTTC
GGCGCGGGCG GGCACGTGGT GACGGTGCGC GCCGAGCTCT CGCGCGTGCG CCGCCACCTG
GGCGGCATCA TCCAGAGCCG CCCCTACCGG TTCAGCGGGG AGGTGCGGGT GCGGGTGGTG
CGTCCCCCCT CACCGGTGGA CCTGCTGCCC GGGTCGGTGG CCCCCGGGGT GTGCGCGCTG
CGGGACGCGC TCCGCGACGG CGACTGTCCC ATGGCTCTGC GCGACGGAAC CGCAACCTAG
 
Protein sequence
MSFPPLDTAL PAGADAREHA RLLRRVHEAS LSGRPAPAAL RPVIEDSWSR SRRFGIDPDS 
APPPRMARLD ELQRHRDASP IAEVLPLIRR SLVSVADEAD HIMLVTDASG QVLWRDGSHR
VRALGDRVGL VEGAFWNEGS TGTNAIGTAL VVGRPVQVYS AEHFMRSLHA LTCACAPIHD
PRDGRLLGAV DVTGPVSTIH PSTLALVSAV AQLAEAHLQS LHHTHLERLR SVAAPLLAGM
SERALVVDEA GWTAAAVHME PVRRVLLPKQ RGSGTAWLPA LGECALEPLP GGWLLRPRPA
AESAPSTVTL DLTRPSPRMV VAGPSGEWAH RLTPRHAELL LLLAVHRAGR TGAQLSQDVF
GAGGHVVTVR AELSRVRRHL GGIIQSRPYR FSGEVRVRVV RPPSPVDLLP GSVAPGVCAL
RDALRDGDCP MALRDGTAT