Gene Ndas_2200 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_2200 
Symbol 
ID9246050 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp2630410 
End bp2632719 
Gene Length2310 bp 
Protein Length769 aa 
Translation table11 
GC content75% 
IMG OID 
Productputative sensor with HAMP domain 
Protein accessionYP_003680128 
Protein GI297561154 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.930898 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000377518 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCCCCACA ACAAGCGCCG TTCGCTTCGC GCCCGCATGA TCACGCTGGT CCTGTTCCCC 
AGCACCGTGC TGCTGGCCCT GTGGGCCGCC TTCACCGCCC TGCTCGTCGG CGACATCGGT
GAACTGCGCA CCACCGCCAC CTTCACCGAA CAGGTCGGCG TCCCCGTCGA GGAGACCATC
AGCGCGCTCC AGCGCGAACG CCGCGCCACC ATGGACGCGG CCTGGGGCAC CGAGCGCACC
GCCCTGTACC TGACCTACTC GCGCCAGCGC ACCGACGAGG CCGTGGACGC CCTCAGCCGG
AGCCTGGACG CCTTCGACGT GCAGGACCTG CCCGAACAGG CCCTGGCCTT CCGCGCCGCC
GTCAACCGCG TCGACACCCT GCGCGCCCGG GTGGACGCCT CCTCCCCCAC CGAGATCGAC
CTGGAGGAGA CCTCCACCGT CTACGACGAG GTCATCGAAC AGGGCCTGCG CGTGTGGGAC
GGCCAGGTCG AGCGCGCCGA CCCCGCCCAG GCCCCGCACC TGCGCTCCCT GACCTCGCTG
ATGCGCACCC GCGAACTCCT CAACCAGCAG GACACCGTCC TGGCCCACGC GGTGGCCACC
AACTCCTTCC CCGTCCAGGC CCACGCACAG TTCGCCGCCG CGGCCGGAGC CCAGCACTAC
ACCTGGGACC GGGTCGTGGT CGAGATGAGC GAGGAGGACG GCCAGGACTA CATCGAGCTG
GAGTCCTACA GCGAGGTGCA GCGCATCTAC CAGCTCCAGG AGAGCATCAT CTCCATGCCC
GTGCGGGGCG GGGCCACCGT GCCGGTCAAC GCCACCGCCT GGCAGGGCGC CGCCGAGGCC
GTGGACGCGC GCATGCGCGA CGTCGAACAG GGCCAGATCG ACCGGGTCAT CGCCTTCAGC
CGCTCGCAGT CGGCCGAGCT GCGCACCAGC GTCCTGCTGG TGAGCGTGCC CGCCCTGGTG
GCCGCCCTGG TCTCCTCCGC GGTCGCGGTG GGCGCCACCC AGCGCCTGGG CAGGCGCCTG
CAGGACCTGC GCACGGCCAC CCTGGAGCAC GCCCGGGTGC GCCTGCCGGA GGTGACCGCA
CGGCTGCGCG CGGGCGGCAG CGTGGACGTG GACGCCGAGG TGCCGCGGCT GCGGGTGGAG
AGCACCGACG AGGTCGGGCA GGTGGCCGAG GCCTTCAACG ACGCCCAGCG CGCGGCCGTG
GCGGCGGCGG TGGAGGAGGC GCAGGTGCGC GCGGGCGTGC GCAACATGTT CCGCAACATC
GCCCGGCGCA CGCAGGCGCT GGTGCACCGC CAGCTGGGGC TGTTGGACGC GCTGGAGCGC
GCCGAGACCG ACCCCAAGGT GCTGGAGTCG CTCTTTCGCA TCGACCACTT CAGCACGCAG
ATGCGGCGCA ACGCCGAGAA CCTGATGCTG CTCAGCGGGG ACGCCCCCGT GCGGCGGGGA
CTGGCGCCGG TGCGCCTGCA CGAGGCGGTG CGGGCCGCGG CCAGCGAGAT CGAGGACTAC
GTGCGGGTGC GGGTGCTGGC GGTGCCGGCG GTGGCGCTGC GCGGCGAGGT GGGCGCGGAC
ACCGTGCGGC TGCTGGCGGA GCTGCTGGAG AACGCCACGT CCTTCTCGCC CCCGGGCACG
GAGGTGACGG TGCGCGGCCG GGCCGGGGAG GCGGGCGGGT GCGTGCTGGA GGTGCGCGAC
CGGGGGCTGG GGATGACGCG GGAGCAGTTG GAGGCGTCCA ACGCGCTGCT GCGCGACCCG
CCGCGCTTTG ATCTGGCCCG GATGCGCGAG GACTCCCAGC TGGGGTTGTT CGTGGTGGCC
ACCATCGCCG CGCGCCACGG CCTGGAGGTG ACGCTGAGCG CCGGGCCCGA GCGCGGCGCG
TGCGCGGTCG TGGCCGTTCC CGCCGACGCG CTGGCGGACC AGGGCGCGGG CGCCGCGGAG
CCGGGCGGGG ACCGGGAGGC CGCGACCGCG CGGTCGCAGC GGCTGGTCCT GGCGGGCGGG
GTCGAGGAGC GGGCGGCGCG GGAGGACGGG GCCCGGGCCG CGCCCGAGAC CGCGGACGCG
GCCGGGGACG AGGACACGGG CGATACCTAC AAGGGTCTGC CCCGGCGGCG GCGCAAGGGC
GTGCAGCCCT CCCCCGCGCC CGCCCCGGCC GCGCCCGGGC ACCCCGGGGA GTCCGGGGGC
CGGGAGGCGG CGGCCGAGCG GTCGCTGACC GAGATCCGGT CGATGATGAG TGCTTTCCAG
TCCGGGAGCC TGCGCGGGCG CGCACAGCCC CTGGACGGGG ATGGAGACGG CGTGCGCGGT
GCGCGCCCCG GTGAGAGCTC GGAAGGGTGA
 
Protein sequence
MPHNKRRSLR ARMITLVLFP STVLLALWAA FTALLVGDIG ELRTTATFTE QVGVPVEETI 
SALQRERRAT MDAAWGTERT ALYLTYSRQR TDEAVDALSR SLDAFDVQDL PEQALAFRAA
VNRVDTLRAR VDASSPTEID LEETSTVYDE VIEQGLRVWD GQVERADPAQ APHLRSLTSL
MRTRELLNQQ DTVLAHAVAT NSFPVQAHAQ FAAAAGAQHY TWDRVVVEMS EEDGQDYIEL
ESYSEVQRIY QLQESIISMP VRGGATVPVN ATAWQGAAEA VDARMRDVEQ GQIDRVIAFS
RSQSAELRTS VLLVSVPALV AALVSSAVAV GATQRLGRRL QDLRTATLEH ARVRLPEVTA
RLRAGGSVDV DAEVPRLRVE STDEVGQVAE AFNDAQRAAV AAAVEEAQVR AGVRNMFRNI
ARRTQALVHR QLGLLDALER AETDPKVLES LFRIDHFSTQ MRRNAENLML LSGDAPVRRG
LAPVRLHEAV RAAASEIEDY VRVRVLAVPA VALRGEVGAD TVRLLAELLE NATSFSPPGT
EVTVRGRAGE AGGCVLEVRD RGLGMTREQL EASNALLRDP PRFDLARMRE DSQLGLFVVA
TIAARHGLEV TLSAGPERGA CAVVAVPADA LADQGAGAAE PGGDREAATA RSQRLVLAGG
VEERAAREDG ARAAPETADA AGDEDTGDTY KGLPRRRRKG VQPSPAPAPA APGHPGESGG
REAAAERSLT EIRSMMSAFQ SGSLRGRAQP LDGDGDGVRG ARPGESSEG