Gene Ndas_5199 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_5199 
Symbol 
ID9249092 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014211 
Strand
Start bp343648 
End bp345570 
Gene Length1923 bp 
Protein Length640 aa 
Translation table11 
GC content76% 
IMG OID 
Productpeptidase S1 and S6 chymotrypsin/Hap 
Protein accessionYP_003683085 
Protein GI297564112 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.219359 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.800514 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAACCCCA ACGAACCGAG TGCAGGTACC GAGCCCGGTG CGGCGGCCGG GAGCGGGCTT 
CCCGAGCACC GTCCCGCCGA GCCCGCCGAC CAGGCGGGGC CCTCCGGCCC GCACGAGGGG
GGCTCCTCCA CAGGTGACGG GACCGATGTC CGCGGCGCGT CCGGCGGGGA GCACGGGGGT
GTCCCCGCCG GACAGGACGA CCTGACCCAC GGGGCCGAGC ACGCCGGGCC CCAGCGGTCC
TTCGGGGGTT CCGAGTCGCA GGGGGCCGGC GGGGAGTCCG GGGCCGGGGA GCCGGGCGAG
CGCGTGGAGC GGCCCGCGCA GCACGAGCCC GGCGGGTACA CCGGCGTGTG GCAGAGCGGC
GAGCAGCCCG GCGGGCACCC CGGCGCCGAG CGGAGCGGCG GGTACGCTGC GGCCGGACAG
ACGGGGGAGG GATTCCCGTC CCAGCAGGGC GGACACCCGG GGGCCGAGCG GCACGGGTGG
CCGGAGACCG GCCCCGCGGG GACGGGGCCG ACCTTCTCCC CGCCCGGACA GCCGCCGAGG
TGGGCCACCG GGCCCAACGA CCCGGAGCAC GCCTCCTACA CCTTCCCGCC GCCCGGGGGC
GGCTACGGCG CCGCCGCAGG CGGCCAGCAC GAGCAGTTCT CCGCCCACCA CCCCGCTCCC
CCGCACGGCG GGCAGAACGG CCCCCACGGG CCGGTCGGGC ACGGGCAGCC GCCGTACGGT
GACGGCGGCG CGTTCGGGGC AGGCGGCCCC GGCGGTCCCG GGGACCACGG CGGCCACGGC
GGGCAGCCCC CGTTCGGCGG CGCCTTCCCC GGGTCGGTGC CCCCGCAGGG AAGCGGCGGG
AAACGCGGCT CCGGCAGGAT CGTGACCGTC GCCGCGATCA CCGCCCTGGT CACGAGCCTC
ATCGTGGGCC CGATGACGGC CCTGGGCACC GCCTACCTGT TCCCCAACGG CCTGAGCGGG
CCGATCAGCT CGCTCAACCA GGAGCAGGAG AGCACGCAGA CCGAGGGCGA GGTGGGCGAG
GTCGCCGACA CGGTCCTGCC GAGCGTGGTG TCCATCCGCA CCGCCAACGG CGGCGGCAGC
GGTGTGGTCA TCTCCTCCGA CGGCCAGATC CTCACCAACG CGCACGTCGT GGCCGCCGCC
GAGGGCGGTC CGATCGAGGT GCTGTTCAAT GACGGCAGCT CCGCGCGCGC CGAGGTCCTG
GGATCGGACC CGGTCTCCGA CATCGCGGTG ATCCAGGCGG AGGGGCGCAA CGACCTCACC
CCGGCCGCCC TCGGCGACTC CGAGCAGGTC GGCGTGGGCG CCGAGGTGGT CGCGATCGGT
TCCCCGCTGG GGCTGTCGGG CACGGTGACC ACGGGTGTGG TCAGCGCGCT GAACCGTCCG
GTGAACACCG GGCAGTCCGG GCAGACGTCC ACGGTGATCA ACGCGATCCA GACGGACGCG
GCGATCAACC CCGGCAACTC GGGCGGCCCG CTGGTGAACA TGAACGGCGA GGTCATCGGG
ATCAACACCG CGATCGCGGG CGTCTCGCAG GACAGCGGCT CGGTGGGGCT GGGCTTCGCC
ATCCCGATCA ACCAGGTCCG CCCCATCGCG GAGCAGCTGG TCGAGGACGG CAGCGCGAGC
TACCCGGCGA TCGAGGCGAC CATCACCAAC TCCCGCGTCG GCGGCGCGGA GATCGTGGAG
GTCACCGAGG GCGGCGCGGC CGCCGAGGCC GGGCTCCAGG CCGGTGACGT GGTGGTGTCC
GTGGACGGCG AGCAGGTGTC CACGCCGGAC GAGCTGATCG CGCAGATCCG GATCCGCCAG
CCCGGCGAGG AGGTGACCCT GGGGGTCGTC CCCGACGGCG GCAGCGGCTC CGAGGAGGAG
GTCACGGTGA CGCTCGGGGA GCAGAGCGTG GAGGCGGCCC AGAACGAGGA GGGCGGGAAC
TGA
 
Protein sequence
MNPNEPSAGT EPGAAAGSGL PEHRPAEPAD QAGPSGPHEG GSSTGDGTDV RGASGGEHGG 
VPAGQDDLTH GAEHAGPQRS FGGSESQGAG GESGAGEPGE RVERPAQHEP GGYTGVWQSG
EQPGGHPGAE RSGGYAAAGQ TGEGFPSQQG GHPGAERHGW PETGPAGTGP TFSPPGQPPR
WATGPNDPEH ASYTFPPPGG GYGAAAGGQH EQFSAHHPAP PHGGQNGPHG PVGHGQPPYG
DGGAFGAGGP GGPGDHGGHG GQPPFGGAFP GSVPPQGSGG KRGSGRIVTV AAITALVTSL
IVGPMTALGT AYLFPNGLSG PISSLNQEQE STQTEGEVGE VADTVLPSVV SIRTANGGGS
GVVISSDGQI LTNAHVVAAA EGGPIEVLFN DGSSARAEVL GSDPVSDIAV IQAEGRNDLT
PAALGDSEQV GVGAEVVAIG SPLGLSGTVT TGVVSALNRP VNTGQSGQTS TVINAIQTDA
AINPGNSGGP LVNMNGEVIG INTAIAGVSQ DSGSVGLGFA IPINQVRPIA EQLVEDGSAS
YPAIEATITN SRVGGAEIVE VTEGGAAAEA GLQAGDVVVS VDGEQVSTPD ELIAQIRIRQ
PGEEVTLGVV PDGGSGSEEE VTVTLGEQSV EAAQNEEGGN