Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_1552 |
Symbol | |
ID | 9245402 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 1902056 |
End bp | 1903105 |
Gene Length | 1050 bp |
Protein Length | 349 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | |
Product | binding-protein-dependent transport systems inner membrane component |
Protein accession | YP_003679487 |
Protein GI | 297560513 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.233578 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.142881 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGCGC CCTCGCATGC CCCCGATTCG GCGGAGCAGG CGGAGCAGAA CGACGCCGTG AACGTCGCTG CGCAGGCCCA CGGGGTCCGC GGAGACTCCG ACAACCGGTC CCTGCGCCAG ATCGCCTGGC AGCGGCTGCG CAAGGACAAG GTCGCGATGG TCTCCGGCGT GGTCGTGGTC CTGCTCATCC TCGCCGCGAT CCTCGCGCCC CTCCTGGCCA AGTGGTTCGG GCATCCGCCC ACCCAGTTCC ACCAGGACCT GATCGAGCCC GGCACCGGCC TGCCGGCCAA CGACCCGGCC AACCCGAGCC CGTTCGACAC CGACCCCTGG GGCGGTATCA GCGCCGACCA CCTGCTCGGC GTGGAGCCGG TGACCGGACG CGACCTGTTC AGCCGCATCC TCTACGGCGC CCAGATCTCC CTGCTGGTGG CCTTCCTGTC CACGCTGCTG TGCGTGTTCA TCGGCACTGT CCTGGGCATC GTCGCCGGGT ACAAGGGCGG CTGGGTCGAC ACCCTCATCA GCCGGGCCAT GGACATCTTC CTGGCCTTCC CGCTGATGCT CTTCGCCATC GCGCTCGTGG GCGTCATCCC CGACGGCGTC CTGGGCCTGA GCGGCAACGG CCTGCGCATC GGCGTCATCG TCTTCATCAT CGGCTTCTTC AACTGGCCCT ACATCGCGCG CATCGTCCGG GGGCAGACGC TCTCGCTGCG CGAGCGGGAG TTCGTGGAGG CCGCCAGGAG CCTGGGCGCC AGCAACCGGC ACATCCTCTT CCGGGAGATC CTGCCCAACC TGGTCACGCC GATCATCGTC TACTCGACCC TGCTCGTCCC CACGAACATC CTGTTCGAGG CGGCCCTGAG CTTCCTGGGC GTCGGTATCA ACCCGCCCAC GCCGAGCTGG GGCAAGATGC TCTCCGACGC GGTGCCGCTG TACGAGAAGG CGCCCTACTT CGTGGTCTTC CCGGGTCTGG CCATCTTCAT CACCGTCCTG GCGTTCAACC TGTTCGGCGA CGGGCTGCGC GACGCCTTCG ACCCCAAGAC CTCCGACTGA
|
Protein sequence | MSAPSHAPDS AEQAEQNDAV NVAAQAHGVR GDSDNRSLRQ IAWQRLRKDK VAMVSGVVVV LLILAAILAP LLAKWFGHPP TQFHQDLIEP GTGLPANDPA NPSPFDTDPW GGISADHLLG VEPVTGRDLF SRILYGAQIS LLVAFLSTLL CVFIGTVLGI VAGYKGGWVD TLISRAMDIF LAFPLMLFAI ALVGVIPDGV LGLSGNGLRI GVIVFIIGFF NWPYIARIVR GQTLSLRERE FVEAARSLGA SNRHILFREI LPNLVTPIIV YSTLLVPTNI LFEAALSFLG VGINPPTPSW GKMLSDAVPL YEKAPYFVVF PGLAIFITVL AFNLFGDGLR DAFDPKTSD
|
| |