Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_4224 |
Symbol | |
ID | 9248098 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 5042228 |
End bp | 5043334 |
Gene Length | 1107 bp |
Protein Length | 368 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | |
Product | PTS system, mannitol-specific IIC subunit |
Protein accession | YP_003682122 |
Protein GI | 297563148 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.110658 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCACAC AGCAGAGCAC GGACAGACTC GCCTCGGTGC GCTCCGGAGT ACAGCGCTTC GGAGGGTTCC TGTCGAGCAT GGTGATGCCC AACATCGGCG CGTTCATCGC CTGGGGCCTG ATCACCGCCC TGTTCATCCC CGACGGGTGG TGGCCCAACG AGCAGATGGC CGGGCTCGTC GACCCGATGA TCAAGTACCT GCTGCCGCTG CTGATCGCCT ACACCGGCGG CGCGCTGGTG CACGACAGGC GCGGCGGCGT GGTCGGCGCG GCCGCGACCA TGGGCGTGAT CGTCTCCGCG GACATCCCCA TGTTCCTGGG CGCGATGTTC ATGGGTCCGT TCGCGGCCTA CCTCATGAAG CACTTCGACC GGGTCGTCCA GCCGCGCATC AAGGCCGGCT TCGAGATGCT GGTCAACAAC TTCAGCGCCG GCATCCTCGC CGCGATCCTG GCCGCCCTGG GCGTCTACGC GGTCGGACCG GTCGTGGAGG GCATCGCCAC CGGCCTGGGC AAGGGCGTGC AGTTCCTCAT CGACCTGAGC CTGCTGCCGC TGGTCTCGGT CATCGTCGAG CCCGCCAAGG TGCTGTTCCT CAACAACGCC ATCAACCACG GCGTCTTCAC CCCGCTGGGC ACGGCCCGCG CGGTCGCCGA CGGCAGGGCC ATCGAGTTCC TCATCGAGTC GAACCCCGGA CCGGGCCTGG GCATCCTGCT GGCCCTGATG TTCTTCGGCT CCAAGGTCAG CCGCGCCACC GCGCCCGGCG CGGCCGTCAT CCACTTCTTC GGCGGGATCC ACGAGATCTA CTTCCCGTAC ATCCTCGCCC AGCCGAAGCT GATCCTCGCC GCGATCGGCG GCGGTATGTC CGGCGTCGCG ACCTTCATGA TCATGGACGC CGGGCTCGTC TCCGCCGCCT CCCCCGGCAG CATCATCGCG ATCATGGCGG TCACCCCGCA GGGAGGCCAC CTGTCGGTCC TGGCCGGGGT CGTCGCCGCC ACCATCGTCT CCTTCGTCAT CGCCTCGCTC CTGCTCGGCT TCGGCCGGTC CGAGCGCAAG GCCGAGCGCG AGGAGAAGGC CAAGCAGGAA GCCGCTCAGA ACCAGGAGAA CAGCTGA
|
Protein sequence | MTTQQSTDRL ASVRSGVQRF GGFLSSMVMP NIGAFIAWGL ITALFIPDGW WPNEQMAGLV DPMIKYLLPL LIAYTGGALV HDRRGGVVGA AATMGVIVSA DIPMFLGAMF MGPFAAYLMK HFDRVVQPRI KAGFEMLVNN FSAGILAAIL AALGVYAVGP VVEGIATGLG KGVQFLIDLS LLPLVSVIVE PAKVLFLNNA INHGVFTPLG TARAVADGRA IEFLIESNPG PGLGILLALM FFGSKVSRAT APGAAVIHFF GGIHEIYFPY ILAQPKLILA AIGGGMSGVA TFMIMDAGLV SAASPGSIIA IMAVTPQGGH LSVLAGVVAA TIVSFVIASL LLGFGRSERK AEREEKAKQE AAQNQENS
|
| |