Gene Ndas_5318 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_5318 
Symbol 
ID9249218 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014211 
Strand
Start bp483724 
End bp484902 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content72% 
IMG OID 
Productpeptidase S1 and S6 chymotrypsin/Hap 
Protein accessionYP_003683204 
Protein GI297564231 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.262877 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTCGACG TGATCCTGGT CGTCCTGTTG CTGCTGTTCG CGGTGACGGG GTACCGACAG 
GGTTTCATCG TCGGTGTCTT CAGCTTCGCG GGCTTCGTCG GAGGGGGCGT CCTCGCCGCC
CTGACGGCCC CGGCACCCAT CCAGGCGTGG GTGGAGGACC CCGGCAGGCA GGCGCTGCTG
GCGATCGCCG TGGTGTTCCT GTCCGCGGCG CTCGGCCAGT TCCTGCTCTC CTACCTGGGC
ACCTTCGTCC GCAACAAGGT GACGTGGGAC TCGGCGCGGG TCCTGGACGC CATCGGCGGC
GCCCTGATCA GCGGGCTCTC GGTGCTGCTC GTGGCCTGGT TCATCGGCAG CACGGTGGCC
AACTCGGCGC TGCCGTTCGT CGCGGGCCAG GTCAGGGACT CGCGCATCCT CCACTCGGTG
GACACGCTGA TGCCCGAGGC CGCCCACAGC GGGTTCTCCA CGTTCCGCCG GATCGTGGAC
CAGAGCGCCT TCCCGCAGGT GTTCAGCGGC CTGGGCACCG GTGAGCTGGC CGAGGTGGCG
CCGCCGGACC CCGACGTGCT CACCACCCCG GAGCTGATCG AGTCGAGCCG CAGCGTGGTG
AAGGTGCTGG GCACCGCGCC CTCGTGCCAG CGCCGCGTGG AGGGGACCGG CTTCGCCTAC
GCGGAGGACC GGATCATGAC CAACGCGCAC GTGGTCGCCG GGGTCACCGA CGACCTGCGG
GTGGTCACCC GGGAGGGCTA CCAGCTCGAC GCCACGCTGG TGCTCTTCGA CGCCCAGCAG
GACCTGGCCG TGCTGCACGT GCCGGGCCTG GACCTGGAAC CGCTGGAGTT CACCTACGAG
GCCCCGCAGG GCGGTGACGC GGTCGTGGCG GGCTTCCCGC GCAACAGCGG CTTCACGGCC
GTCCCGGCGC GCGTTCGCGC CCGCCAGACG GCGCAGGGGC CGGACTTCTA CCACTCCCAG
CAGGTGAGCC GGGAGATCTA CCAGGTGCGC GCCGTGGTGC GCCCGGGCAA CTCCGGCGGC
CCGCTGCTGT CGCCGGACGG CACCGTGTAC GGGGTGGTCT TCGCCGCCGC CACGAACGAG
CCCGAGACGG GTTACGTGCT CACCGCCGAC GAGGTCGCGG AGAACGCCCA GAGCGGCCTG
GAGAACGACG AGCAGGTCTC CTCCCAGGCC TGCGACTGA
 
Protein sequence
MLDVILVVLL LLFAVTGYRQ GFIVGVFSFA GFVGGGVLAA LTAPAPIQAW VEDPGRQALL 
AIAVVFLSAA LGQFLLSYLG TFVRNKVTWD SARVLDAIGG ALISGLSVLL VAWFIGSTVA
NSALPFVAGQ VRDSRILHSV DTLMPEAAHS GFSTFRRIVD QSAFPQVFSG LGTGELAEVA
PPDPDVLTTP ELIESSRSVV KVLGTAPSCQ RRVEGTGFAY AEDRIMTNAH VVAGVTDDLR
VVTREGYQLD ATLVLFDAQQ DLAVLHVPGL DLEPLEFTYE APQGGDAVVA GFPRNSGFTA
VPARVRARQT AQGPDFYHSQ QVSREIYQVR AVVRPGNSGG PLLSPDGTVY GVVFAAATNE
PETGYVLTAD EVAENAQSGL ENDEQVSSQA CD