Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_2877 |
Symbol | |
ID | 9246728 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 3435846 |
End bp | 3436949 |
Gene Length | 1104 bp |
Protein Length | 367 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | |
Product | proteinase inhibitor I4 serpin |
Protein accession | YP_003680794 |
Protein GI | 297561820 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.129029 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGACGA GTACGTCCCC GCGCCGGGAC CATCTGGAGT TCGCCGCCGC GCTGGACCGG GTGCTGACCC GGCAGGGGGA ATCGCACGTG TGGTCCCCCC ACTCGGTGGG CACGGTGCTG GCGCTGCTGG CCACCGGCGC CCGGGAGCGC ACCCTGGCCG AGCTGGAGGC GCTGCTGGGC GCGGACGTGC GGGGCCAGCT GGAGGCGCTG GACGCGGCCG TGGCCGCCGA GCCCGGACTC GACCTGGCCA GCCTCAACGG CCTGTACGTG CCCGCCGACC TGGAGGTCCT GCCGGGTTTC ACGTCCCGGG TGCGCGAGCG CGCCGGGGCC GAGGTGGAGC ACGCCGACTT CGAGCACGAC TCCGAGGGCG TGCGCTCACG GGTCAACGCC CGTGTGGCGG AGGTGACCCG CGGCCTGATC GAGGAACTGC TCCCCGCGGG CAGCGTCCAC CCGGGCGTGC GGCTGCTGCT GGTCAACGCG CTGTGGGTGA AACTGGCCTG GCCAGACCCC TTCGACCCCG CCCGGACCCG GGACAGGCCC TTCCACGCAC CGTCGGGCAG GCGGAAGGTG CCCACCATGC ACCGTTCGGC CCGTCTGCCG CACGCGCGGG CCCGCGGGTG GAGCATGGTG AGCCTGGAGG GCGACCACGG CCTGACCCTG GACGTGCTGC TGCCCGACGA GCGCTCCGCC GCTCCCGCCC CGGTGACGGC CGACGCGCTC GCCGACCTGT ACGCCCACCG CTCCTCCCAG CAGGTGGACC TGGCCCTGCC GCGCTTCGAG GTGGAGACCG ACACCTCTCT GCTCGAACCG CTGGCGGCGC TGGGCGTGCG CGACCTGGCC ACGGACGAGG CCCGCTTCGA CGGGATCAGC CCGGAACCGC TGCGCGCCGA CGAGATCCTG CACCAGTCGG TGCTGCGGGT GGACGAGAAG GGCGCCGAGG GCGCCGCCGC GACCGCCGTG ATGATGCTCC GGGCGGCCGC GGTGGCGCCG AGGCCGGTCG CGTTCACCGT GGACCGGCCG TTCGTGTTCG TGCTGCGCCG CGGCGGGGCG GTCCTGTTCC TGGGCCGCGT CACCGACCCC GTGGACCCGG GCCCCGCCTC CTGA
|
Protein sequence | MQTSTSPRRD HLEFAAALDR VLTRQGESHV WSPHSVGTVL ALLATGARER TLAELEALLG ADVRGQLEAL DAAVAAEPGL DLASLNGLYV PADLEVLPGF TSRVRERAGA EVEHADFEHD SEGVRSRVNA RVAEVTRGLI EELLPAGSVH PGVRLLLVNA LWVKLAWPDP FDPARTRDRP FHAPSGRRKV PTMHRSARLP HARARGWSMV SLEGDHGLTL DVLLPDERSA APAPVTADAL ADLYAHRSSQ QVDLALPRFE VETDTSLLEP LAALGVRDLA TDEARFDGIS PEPLRADEIL HQSVLRVDEK GAEGAAATAV MMLRAAAVAP RPVAFTVDRP FVFVLRRGGA VLFLGRVTDP VDPGPAS
|
| |