Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_5199 |
Symbol | |
ID | 9249092 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014211 |
Strand | - |
Start bp | 343648 |
End bp | 345570 |
Gene Length | 1923 bp |
Protein Length | 640 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | |
Product | peptidase S1 and S6 chymotrypsin/Hap |
Protein accession | YP_003683085 |
Protein GI | 297564112 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.219359 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.800514 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGAACCCCA ACGAACCGAG TGCAGGTACC GAGCCCGGTG CGGCGGCCGG GAGCGGGCTT CCCGAGCACC GTCCCGCCGA GCCCGCCGAC CAGGCGGGGC CCTCCGGCCC GCACGAGGGG GGCTCCTCCA CAGGTGACGG GACCGATGTC CGCGGCGCGT CCGGCGGGGA GCACGGGGGT GTCCCCGCCG GACAGGACGA CCTGACCCAC GGGGCCGAGC ACGCCGGGCC CCAGCGGTCC TTCGGGGGTT CCGAGTCGCA GGGGGCCGGC GGGGAGTCCG GGGCCGGGGA GCCGGGCGAG CGCGTGGAGC GGCCCGCGCA GCACGAGCCC GGCGGGTACA CCGGCGTGTG GCAGAGCGGC GAGCAGCCCG GCGGGCACCC CGGCGCCGAG CGGAGCGGCG GGTACGCTGC GGCCGGACAG ACGGGGGAGG GATTCCCGTC CCAGCAGGGC GGACACCCGG GGGCCGAGCG GCACGGGTGG CCGGAGACCG GCCCCGCGGG GACGGGGCCG ACCTTCTCCC CGCCCGGACA GCCGCCGAGG TGGGCCACCG GGCCCAACGA CCCGGAGCAC GCCTCCTACA CCTTCCCGCC GCCCGGGGGC GGCTACGGCG CCGCCGCAGG CGGCCAGCAC GAGCAGTTCT CCGCCCACCA CCCCGCTCCC CCGCACGGCG GGCAGAACGG CCCCCACGGG CCGGTCGGGC ACGGGCAGCC GCCGTACGGT GACGGCGGCG CGTTCGGGGC AGGCGGCCCC GGCGGTCCCG GGGACCACGG CGGCCACGGC GGGCAGCCCC CGTTCGGCGG CGCCTTCCCC GGGTCGGTGC CCCCGCAGGG AAGCGGCGGG AAACGCGGCT CCGGCAGGAT CGTGACCGTC GCCGCGATCA CCGCCCTGGT CACGAGCCTC ATCGTGGGCC CGATGACGGC CCTGGGCACC GCCTACCTGT TCCCCAACGG CCTGAGCGGG CCGATCAGCT CGCTCAACCA GGAGCAGGAG AGCACGCAGA CCGAGGGCGA GGTGGGCGAG GTCGCCGACA CGGTCCTGCC GAGCGTGGTG TCCATCCGCA CCGCCAACGG CGGCGGCAGC GGTGTGGTCA TCTCCTCCGA CGGCCAGATC CTCACCAACG CGCACGTCGT GGCCGCCGCC GAGGGCGGTC CGATCGAGGT GCTGTTCAAT GACGGCAGCT CCGCGCGCGC CGAGGTCCTG GGATCGGACC CGGTCTCCGA CATCGCGGTG ATCCAGGCGG AGGGGCGCAA CGACCTCACC CCGGCCGCCC TCGGCGACTC CGAGCAGGTC GGCGTGGGCG CCGAGGTGGT CGCGATCGGT TCCCCGCTGG GGCTGTCGGG CACGGTGACC ACGGGTGTGG TCAGCGCGCT GAACCGTCCG GTGAACACCG GGCAGTCCGG GCAGACGTCC ACGGTGATCA ACGCGATCCA GACGGACGCG GCGATCAACC CCGGCAACTC GGGCGGCCCG CTGGTGAACA TGAACGGCGA GGTCATCGGG ATCAACACCG CGATCGCGGG CGTCTCGCAG GACAGCGGCT CGGTGGGGCT GGGCTTCGCC ATCCCGATCA ACCAGGTCCG CCCCATCGCG GAGCAGCTGG TCGAGGACGG CAGCGCGAGC TACCCGGCGA TCGAGGCGAC CATCACCAAC TCCCGCGTCG GCGGCGCGGA GATCGTGGAG GTCACCGAGG GCGGCGCGGC CGCCGAGGCC GGGCTCCAGG CCGGTGACGT GGTGGTGTCC GTGGACGGCG AGCAGGTGTC CACGCCGGAC GAGCTGATCG CGCAGATCCG GATCCGCCAG CCCGGCGAGG AGGTGACCCT GGGGGTCGTC CCCGACGGCG GCAGCGGCTC CGAGGAGGAG GTCACGGTGA CGCTCGGGGA GCAGAGCGTG GAGGCGGCCC AGAACGAGGA GGGCGGGAAC TGA
|
Protein sequence | MNPNEPSAGT EPGAAAGSGL PEHRPAEPAD QAGPSGPHEG GSSTGDGTDV RGASGGEHGG VPAGQDDLTH GAEHAGPQRS FGGSESQGAG GESGAGEPGE RVERPAQHEP GGYTGVWQSG EQPGGHPGAE RSGGYAAAGQ TGEGFPSQQG GHPGAERHGW PETGPAGTGP TFSPPGQPPR WATGPNDPEH ASYTFPPPGG GYGAAAGGQH EQFSAHHPAP PHGGQNGPHG PVGHGQPPYG DGGAFGAGGP GGPGDHGGHG GQPPFGGAFP GSVPPQGSGG KRGSGRIVTV AAITALVTSL IVGPMTALGT AYLFPNGLSG PISSLNQEQE STQTEGEVGE VADTVLPSVV SIRTANGGGS GVVISSDGQI LTNAHVVAAA EGGPIEVLFN DGSSARAEVL GSDPVSDIAV IQAEGRNDLT PAALGDSEQV GVGAEVVAIG SPLGLSGTVT TGVVSALNRP VNTGQSGQTS TVINAIQTDA AINPGNSGGP LVNMNGEVIG INTAIAGVSQ DSGSVGLGFA IPINQVRPIA EQLVEDGSAS YPAIEATITN SRVGGAEIVE VTEGGAAAEA GLQAGDVVVS VDGEQVSTPD ELIAQIRIRQ PGEEVTLGVV PDGGSGSEEE VTVTLGEQSV EAAQNEEGGN
|
| |