Gene Ndas_5524 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_5524 
Symbol 
ID9249427 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014211 
Strand
Start bp716795 
End bp718321 
Gene Length1527 bp 
Protein Length508 aa 
Translation table11 
GC content73% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003683409 
Protein GI297564436 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGGCG AAAAAGTCGA CGGCTGGGTG ACGGAGAGCC TCCAGGGGAT CATCGAGCGC 
ATGTTCGATC TCCTCATCTA CAGCGGCGCG TCGAGCCCCG TCTACAACGC GGACCGCCGA
GGAGAGGGAC GCCTCAACCT CCAGATGGAC CAGGGAGTCC TGCAGCTCCA GGACGTCTTC
AACCCGCTGG TCCTGTTCCT GGCCACGCTG GGCATCCTGC TGGCGGCCAT CCGCCTGCTG
TGGACCCGGC GCCTCGACCC GATGCTGGAC CTGATCCGCG GTCTGCTGAC GGTCATCGTC
GTCACCTTCG GCGGGCTCTT CCTCATCTAC ATCCTGCGGC AGTTCAGTTT CGGGCTCACC
AACATGGTGC TCACCACCGG GACCTACGGG GACCAGAGCA GCTTCCTGGA GTCCGCGCAG
AGCGTCTACC CGAGCATCAT GCGCGTGGGC GTCGACGGCG CCGACCTGGG GTTCGCGGAC
AACGAGGTCG CCCGCGGGGC CGTGACCTCG CTGGTCGGAC TCGCGGTGAT CCTCGGGCTG
ATCGCGCAGA TGATCGTCCT CGCCCTCCTG GAGGCGGTCA CCTACCTCAT CGCCTGTCTG
CTGCCCCTGG CCGCGGCCAG CAGCATGATC CCGAACCTGA AGATCTTCCC GAAGGTCATC
GGCTGGCTGT TCGCCTGCTT CCTGTACAAG CCCCTCCTCC TGATGATCTA CCTGGTCGGT
CTCGTCGTCC TCGCGAGCGC GGGCCCCGGC GGGGACGACC TGCTCAGGTA CCTCACCGGA
GCGGCGGTCA TGCTCCTGGC CACCGGCGCG CTCCCCGTCC TCATGAAGCT CATGAGCTCG
GGGAGCGTGT GGATGATGGC GCTGGCCGCG GGCGGCCTCA ACTCCGTCAC CGGCGGTGGC
GGCGGCGGTG GCGGCGGCGG CGGGAACAAC GGTGGCGGGA ACACCGCCGG CTTCCAGAGC
AGCGCGGGCT CCGCCGACAG CGGTACCGCC ACCGCCCAGC AGGCGGCGGC CGGGGCCAAC
GAGCGCGCCG CGGGCCTGGC GCGCAACCTC GACGAGGGCG GGCCGCTCCC GGGCGGCGAC
GCGGTCCCGG TGGCGGCGGT GAGCGCGGCC GGCACCGGGT CGGGGGCCGC GGCCACGTCG
GCCCTGGGAA CGCAGACCGG GCAGCTCTCC GGCCTGGGCG GAACCGGAGC CGGAGGCACC
GACGGGGCCG CGGACTCCGC CGGGGGCGCG GTCGGCGGCG CCGACGGCGG CACGGACGCG
GCGGCCTCCG GGGCGACGGA GTCCGTGAGT TCGACGGTCA GCGGCTCGGA ACAGTCCTAC
AGCCACACGA CCGCGTCCGG AGCGGACGAG GTCTCCGGGG GCGGTTCGGC CTCCGGAGCC
GACGCCGGAG GCGCCGACGC GCCGGGCGGC GCGGCCACGG CCCCGGACGG AGGCGGCCCG
GCCGCCTTCG GGGGCGGGTC GGCGCCCGCC CCGGAGGGGG CGTCCGGAGC ACAGTTCGCC
GACGGAGCAG ACGGAAGGAT GGTCTAG
 
Protein sequence
MNGEKVDGWV TESLQGIIER MFDLLIYSGA SSPVYNADRR GEGRLNLQMD QGVLQLQDVF 
NPLVLFLATL GILLAAIRLL WTRRLDPMLD LIRGLLTVIV VTFGGLFLIY ILRQFSFGLT
NMVLTTGTYG DQSSFLESAQ SVYPSIMRVG VDGADLGFAD NEVARGAVTS LVGLAVILGL
IAQMIVLALL EAVTYLIACL LPLAAASSMI PNLKIFPKVI GWLFACFLYK PLLLMIYLVG
LVVLASAGPG GDDLLRYLTG AAVMLLATGA LPVLMKLMSS GSVWMMALAA GGLNSVTGGG
GGGGGGGGNN GGGNTAGFQS SAGSADSGTA TAQQAAAGAN ERAAGLARNL DEGGPLPGGD
AVPVAAVSAA GTGSGAAATS ALGTQTGQLS GLGGTGAGGT DGAADSAGGA VGGADGGTDA
AASGATESVS STVSGSEQSY SHTTASGADE VSGGGSASGA DAGGADAPGG AATAPDGGGP
AAFGGGSAPA PEGASGAQFA DGADGRMV