Gene Ndas_3088 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3088 
Symbol 
ID9246944 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp3695888 
End bp3697258 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content74% 
IMG OID 
ProductAAA ATPase central domain protein 
Protein accessionYP_003681003 
Protein GI297562029 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCCTCGG CCGCCTACCC TGGACAGGTG TCCGACACGC TCTTCGACGA CGCCGGCGCG 
GAGGCCGCCA GAGGCCAGGA ACCGCTCGCC GTGCGCATGC GCCCCCGCAC CCTCGACGAG
GTCGTGGGCC AGCGCCACCT GCTCGGCGAG GGCAGCCCGC TGCGCCGCCT GGTCGAGGAC
GACGCGCCCA TGTCCGTGTT CCTGTGGGGC CCGCCCGGCA CCGGCAAGAC CACCCTGGCC
ACCGTGGTCA GCCGCGTCAC CAAGCGCCGC TTCGTCGAGC TGTCCGCCGT CAACGCCGGG
GTCAAGGACG TGCGCGCCGT CATCGACGAC GCCCGGCGCC GCATGGGCAT GCACGGCACC
CGCACCCTGC TCTTCGTGGA CGAGGTCCAC CGCTTCAACA AGACCCAGCA GGACGCGCTG
CTGCCCGCCG TGGAGAACCG CTGGGTCAGC TTCATCGGCG CCACCACCGA GAACCCCTTC
TTCTCCGTGG TCAGCCCCCT GCTGTCGCGT TCGCTGCTGC TGTCCCTGGA GTCGCTGGAG
GACGCCGACG TCCGCGCCCT GGTGGACCGG GCGCTGGCCG ACGAGCGCGG GCTGGACGGG
CGCTACACGC TCTCCGACGA GGGCGCTGAC CACCTCGTAC GCCTCGCGGG CGGCGACGGC
CGCCGCTCCC TGACCTACCT GGAGGCCGCC GCGCTCGTCG CCGGCGCCCC CGGCGCGGAG
CCGGTCACCA TCACGGCCGA ACACGTCGAA CGCGCCGTGG ACCGGCACGC CGTGCGCTAC
GACCGCTCCG GCGACCAGCA CTACGACGTC GTCAGCGCCT TCATCAAGAG CATGCGCGGC
TCGGACCCCG ACGCCGCCCT GCACTACCTC GCCCGCATGA TCGAGGCGGG GGAGGACCCC
CGCTTCATCG CCCGACGCGT GGTCGTGCAC GCCAGCGAGG ACGTCGGCAT GGCCGACCCC
ACCGCCCTGC AGACCGCCGT GGCCGCCGCC CAGGCCGTGG AGCTCATCGG CATGCCCGAG
GCCCGCATCA ACCTCGCCCA GGCCGTCATC CACATCAGCC TGGCCCCCAA GTCCAACGCG
GTCGTCTCCG CCATCGACGC CGCCGCGGCC GACGTGCGCG CGGGCCTGGC CGGGCCCGTC
CCCGCCCACC TGCGCGACGG CCACTACCGG GGCGCCGCCG AACTCGGCCA CGGCAAGGGC
TACCGCTACG CCCACGACTT CCCCGGCGGC GTCGCCCCCC AGAGGCACGC CCCCGACGGC
CTCGCCGACC GCGAGTACTA CCGGCCCACC CAGCACGGCG CCGAACGGCG CTTCGGCGAG
GTCCTCCAGC GCATCAAGGA GGTCCTGCGG GGCGGCGGGC AGCGCGGCTG A
 
Protein sequence
MPSAAYPGQV SDTLFDDAGA EAARGQEPLA VRMRPRTLDE VVGQRHLLGE GSPLRRLVED 
DAPMSVFLWG PPGTGKTTLA TVVSRVTKRR FVELSAVNAG VKDVRAVIDD ARRRMGMHGT
RTLLFVDEVH RFNKTQQDAL LPAVENRWVS FIGATTENPF FSVVSPLLSR SLLLSLESLE
DADVRALVDR ALADERGLDG RYTLSDEGAD HLVRLAGGDG RRSLTYLEAA ALVAGAPGAE
PVTITAEHVE RAVDRHAVRY DRSGDQHYDV VSAFIKSMRG SDPDAALHYL ARMIEAGEDP
RFIARRVVVH ASEDVGMADP TALQTAVAAA QAVELIGMPE ARINLAQAVI HISLAPKSNA
VVSAIDAAAA DVRAGLAGPV PAHLRDGHYR GAAELGHGKG YRYAHDFPGG VAPQRHAPDG
LADREYYRPT QHGAERRFGE VLQRIKEVLR GGGQRG