Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_5564 |
Symbol | |
ID | 9249467 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014211 |
Strand | - |
Start bp | 764528 |
End bp | 765568 |
Gene Length | 1041 bp |
Protein Length | 346 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | |
Product | binding-protein-dependent transport systems inner membrane component |
Protein accession | YP_003683449 |
Protein GI | 297564476 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.271716 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.315175 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACTGAGC GGACCATGAC CAAGGGCGAA CCGCCCTCCC CGGCCGACGC GTCCCGCGGG CGCGGGGGCC GGGGGCGGTC CCCGGCCGTC CCGGACCGCC GGGGCGGGTC CGCGTCCCCG AGCCGCAGGC GGGTCCCGTT CGGTGCGAGG CTGCGCCGCG ACTGGCAGCT GCTGCTGATG ACGGTCCCGG CGATCGGCCT GCTGGCGGTG TTCCACTACA CGCCGACCCT CGGCAACATC ATCGCCTTCC AGGACTACAA CCCCTGGGAC GGGGTGTGGG GCAGCCCGTG GGTGGGGCTG GCGCACTTCG AGCGTCTGTT CACCGACCCC CGCTTCTGGT CCGCGGCGGG CAACACGCTG GTCATCGCCG CCGTCCAATT GGTGTTCTTC TTCCCCATCC CGATCGCGCT GGCCATCCTG CTGGACAGCG TCCTCAGCCC CAGGCTGCGG ATGGTGCTCC AGAGCATCGT GTACATGCCG CACTTCTTCT CGTGGGTCCT GGTCGTCACC CTGTTCCAGC AGATCCTGGG CGGCGCCGGA CTGTTCTCGC AGATCCTGCG GCAGAACGGG TACGCGCCGC TGGAGGTCAT GTCCGATCCC GACGCGTTCC TGTTCGTGGT CACCTCCCAG ATGGTCTGGA AGGACGCCGG GTGGGGCACG ATCATCTTCC TGGCGGCGCT GGCGGCCGTG AACCAGAACC TCTACGAGTC CGCGGCCGTG GACGGCGCCG GGCGATGGCG GCGGATGTGG CACATCACCC TGCCGGGCCT GCGCCCGGTG ATCGTCCTGC TGCTCATCCT CAAGATCGGC GACATCCTCA ACGTCGGCTT CGAGCAGTTC TACCTCCAGC GCGACGCGTT CGGATCGGGC GTGTCGGAGG TGCTGGACAC CTTCATCTAC CACCAGACCC TGGTGACGGG GAACTTCAGC GCGGGAGCGG TCGCGGGCCT GGTCAAGGGC GTGGTCGGAC TGGTCCTCAT CGTTCTGGCC AACAAGCTGG CCCACAAGAT GGGTGAGAAC GGAATCTACC GACGAGCATG A
|
Protein sequence | MTERTMTKGE PPSPADASRG RGGRGRSPAV PDRRGGSASP SRRRVPFGAR LRRDWQLLLM TVPAIGLLAV FHYTPTLGNI IAFQDYNPWD GVWGSPWVGL AHFERLFTDP RFWSAAGNTL VIAAVQLVFF FPIPIALAIL LDSVLSPRLR MVLQSIVYMP HFFSWVLVVT LFQQILGGAG LFSQILRQNG YAPLEVMSDP DAFLFVVTSQ MVWKDAGWGT IIFLAALAAV NQNLYESAAV DGAGRWRRMW HITLPGLRPV IVLLLILKIG DILNVGFEQF YLQRDAFGSG VSEVLDTFIY HQTLVTGNFS AGAVAGLVKG VVGLVLIVLA NKLAHKMGEN GIYRRA
|
| |