Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_4790 |
Symbol | |
ID | 9248673 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 5677263 |
End bp | 5678732 |
Gene Length | 1470 bp |
Protein Length | 489 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | |
Product | Propeptide PepSY amd peptidase M4 |
Protein accession | YP_003682680 |
Protein GI | 297563706 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGTCCC ATGCCCCCGA CCGGCTCGAC TCCCCCGAGA CGCCTCCTCC GGACCGCCGC TCCCCCGTCT GGGCGGCCCT GCGCCAACTC CTCCTACGCC TGCACTTCTA CGCCGGAGTC CTCGTCGCGC CGTTCATCGC CGTGGCCGCC CTCACCGGAC TGCTGTACGC GTACGCGCCG CAGCTGGAGC AGGTGCTCTA CGCCGACCAG CTCCGCGTCA CCCCCGGCGA CAGCGCGCTG CCCCTCCAGG AACAGGTGGA CACCGCGGTC GCGGCACGTC CCGAGGGAAC GCTGAGCGCC GTGCGCCCGG GCACGGCCCC CGACGAGAGC ACCCGCGTCC TGCTGGACGT GGAGGGGCTC CCCGAGAGCT ACCGGCTCGC CGTCTTCGTC GACCCCTACA ACGGCGAGGT CCTCGGCGAG ACGACCAGCT ACGGCAGCAG CGGCGCGCTG CCCGTGCGGT CGTGGCTGTC CGAACTCCAC CGTCACCTGC ACCTGGGTGA GTTCGGCCGC CTGTACGGCG AACTCGCCGC GAGCTGGCTG TGGGTGGTCG CTCTGGGCGG GGTCGTGCTG TGGACCGCGC GCCGGCGCAA GGCCCGCCGC CTGCGCCGCA CCCTGCTGCC CGAGCCCTCC GCCAAGGGGC GGAACCGGAC CATGTCCTGG CACGGGTCGC TCGGCATCTG GGCCGTAGCC GGCCTGCTGA TGGTGTCCGC CACCGGACTG ACCTGGTCCC GGTTCGCCGG GGCCAACGTG ACCGACATGC GGGTGGCCCT CGGCTGGACC ACGCCGTCGC TGTCGGCGCC CGCCCCGTCG GAGCACGCGG ACCACGAGGA GCAGGCGGAC CACGCGGGGC ACGAGGACCA CGGCGGGCAC GGCGACCACG CGGACCACGG CGCGGCCGGA CAGGACGTGG ACCTGGACAC GGTCCTGGCG TCCGTGCACG CCGCGGGGAT CGACGGCCCG ATGGAGATCT CGGTGCCCGT GGAGGAGGGC GCGCCCTTCA CCGTCCAGGG AACCGGGCGG AGCTGGCCCG TGCACCAGGA CGCGGCGGCC GTGGACGCGA CGAGCGGCGA GGTGGTCGAG GAGCTGCGGT TCGAGGACCA TCCCTTCGTC GCCAAGATGG CCACGTGGGG GATCGCCTTC CACATGGGTC TGCTGTTCGG CCTGCCGAAC CAGCTCCTGC TGACGGGGAT CGCCCTGTCC GCGCTCTTCC TCGTGTTCTG GGGCTATCGG ATGTGGTGGC TGCGCCGCCC GACCCGGGAC ACCGCGTTCG CCATGGGACG GCCGCTCGCC CCGCGCGGGA CCTGGCGGGG GCTGCCGTGG TGGTGCCTGG CCCTGGTCGC CGCGGCGGCG GTGGGCGTCG GCCTGTTCCT GCCGGTGTTC GGGGTCTCCC TGCTGGCCTT CCTCGTGGTG GACGCGGCCC TGGGCCTGCG GCGCGGGCGC ACGCGCCCGG AGTCCGTCCC GAAGCCGTGA
|
Protein sequence | MTSHAPDRLD SPETPPPDRR SPVWAALRQL LLRLHFYAGV LVAPFIAVAA LTGLLYAYAP QLEQVLYADQ LRVTPGDSAL PLQEQVDTAV AARPEGTLSA VRPGTAPDES TRVLLDVEGL PESYRLAVFV DPYNGEVLGE TTSYGSSGAL PVRSWLSELH RHLHLGEFGR LYGELAASWL WVVALGGVVL WTARRRKARR LRRTLLPEPS AKGRNRTMSW HGSLGIWAVA GLLMVSATGL TWSRFAGANV TDMRVALGWT TPSLSAPAPS EHADHEEQAD HAGHEDHGGH GDHADHGAAG QDVDLDTVLA SVHAAGIDGP MEISVPVEEG APFTVQGTGR SWPVHQDAAA VDATSGEVVE ELRFEDHPFV AKMATWGIAF HMGLLFGLPN QLLLTGIALS ALFLVFWGYR MWWLRRPTRD TAFAMGRPLA PRGTWRGLPW WCLALVAAAA VGVGLFLPVF GVSLLAFLVV DAALGLRRGR TRPESVPKP
|
| |