Gene Ndas_2643 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_2643 
Symbol 
ID9246494 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp3152512 
End bp3153882 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content75% 
IMG OID 
Productputative PAS/PAC sensor protein 
Protein accessionYP_003680566 
Protein GI297561592 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.351947 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGGGG AGACGGGGCC TCCCGGGGCC GGCAACGGCT CCGCCACCGA CGCGGCCCTC 
AGCGCGCTGC TGGAGGACAG CGCCGAGGAA CTCTACGAGT CCGCGCCGTG CGGCTACCTG
TCCACGCTCA TGGACGGCAC GGTCGCCAGG ATCAACGCGA CGCTGCTGAG GTGGCTCGGC
CTGGAGCGCG CCGCCGTGGT GGGCCGCATG CGCTTCACCG ACCTGCTCAC CGTGGGCGGC
AGGCTCTACC ACGAGACGCA CTTCGCGCCC CTGCTGCACC TGCGGGGCGA GGTCAACGGC
ATCGCCCTGG AGATGCGGGC CTCCGACGGC GGCCGCCTTC CCGTGCTGGT CTCCTCCACC
GTCAAGCGCG ACGGCGGGGG CCAGCCGCTG CTGGTCCGCA CCACCGTCTT CGACGCCACC
GACCGCCGCT CCTACGAGGA GGAGCTGCTG CGCCGCCGCA GGGAGGCGGA GCAGGCCCGC
GCCGAGGCAG AACGGGCCCG CGAGGAGGCC GAACGGGCCC GCGCCGAGGC CGAGGAGGCG
CACCGGCGGG CCGAGGCGGA CCGGGCGCGC CTGGGAGAGG CGCTCGTCAT CCTCCAGAGG
GCCCTGCTGC CCGACACCCT GCCCGACGTT CCGGGCATGG AGGCCGCCGC CTACTACCAC
ACCGCCTCCC CCTACCGGCT GGGCGGCGAC TTCTACGACC TCTTCCCGCT CGGTGACGGG
TGCTGGGCGT TCTTCCTCGG CGACGTGAGC GGCAAGGGGC CCGAGGCCGC GACCCTGACC
TCCCAGGCCC GTTACGTCCT GCGCACCACC GCCCTGCACT CGTCCGAACC CGCGGACGCC
CTGGGCACGC TGAACACCGC CCTCCTGGAG CGCTACGCCG ACAACGGCGA CCCCCGCTAC
TGCACCGCCG TCTTCGGTGT CCTCGAACCC GACGGCGACG CCGGGCACGT CCGTGCCCGC
CTGGCCTTCG GCGGCCACCC CCCGGCGCTG GCCCTGCGCG GGGACGGCCG GGCCGAGTTC
CTGTTCGCCC CCGGCGGGAT GCTCGTCGGC GTGCTGCCCG ACGCCCACTT CGCCACCGTC
GAGACCGCCC TCGCCCCCGG CGACACCTTC GTGCTCTACA CCGACGGCCT GACCGAGGCC
CGCACCGGCG CCGGACCCGA CACGATGTAC GGCGAGGAGG CCCTGCTCTC CTTCGTCGCG
CGGCACGCCC CCTCCACGGC CCACGGCGTC GTCGACGCGC TGGCCGAACT GCTGGAGGGT
TTCGGCGAGG GCTTGGAGGA CGACACCGCC CTCCTCGCGC TCGGCGTCCC CGCCCCGCCC
TCCACCGCCC CGTCCGGAAC CGGACACCAC ACGATGAGCG GTCCACGATG A
 
Protein sequence
MSGETGPPGA GNGSATDAAL SALLEDSAEE LYESAPCGYL STLMDGTVAR INATLLRWLG 
LERAAVVGRM RFTDLLTVGG RLYHETHFAP LLHLRGEVNG IALEMRASDG GRLPVLVSST
VKRDGGGQPL LVRTTVFDAT DRRSYEEELL RRRREAEQAR AEAERAREEA ERARAEAEEA
HRRAEADRAR LGEALVILQR ALLPDTLPDV PGMEAAAYYH TASPYRLGGD FYDLFPLGDG
CWAFFLGDVS GKGPEAATLT SQARYVLRTT ALHSSEPADA LGTLNTALLE RYADNGDPRY
CTAVFGVLEP DGDAGHVRAR LAFGGHPPAL ALRGDGRAEF LFAPGGMLVG VLPDAHFATV
ETALAPGDTF VLYTDGLTEA RTGAGPDTMY GEEALLSFVA RHAPSTAHGV VDALAELLEG
FGEGLEDDTA LLALGVPAPP STAPSGTGHH TMSGPR