Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_4000 |
Symbol | |
ID | 9247872 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 4783872 |
End bp | 4784828 |
Gene Length | 957 bp |
Protein Length | 318 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | |
Product | binding-protein-dependent transport systems inner membrane component |
Protein accession | YP_003681903 |
Protein GI | 297562929 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.224944 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.135837 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGCTG TGACCGCGCC CGCGCGCGAG GCGCCCGCGG CGGCGACGCC GCCCGAGCGG CCGAGGCGGA GCCGCGGCGG GGGCGGATGG GCCCGCCGGG CCCCGCTCCT GCCCGCCCTG GTGTTCATGC TGGTCGTCAC GCAGCTGCCG TTCCTGGCGA CGGTCGTGTA CTCGCTGCGC TCGTGGAACC TGCTGCGGCC CGACTCCCAG GCGTGGGTGG GCCTGGCCAA CTACGCGGCG GTCTTCACCG ACCGGCAGTT CCTGGGCGCG GCGGCGAACA CGGTGCTGAT CACCGCCTCC TGCGTGGTGG TGGCGATGCT GCTGGGCATC GGGCTCGCGC TGCTGCTGGA CCGGAAGTTC AGAGGGCGAG GCGTGGTGCG CACGCTCGTC ATCACCCCGT TCCTGATCCT GCCCGTGGCC ACGGCGCTGC TGTGGAAGCA CATCATGCTG GAGCCGGTGT TCGGCCTGGT GAACTTCGTG CTGTCGCCGT TCGGGGTCGA GTCGTTCGAC TGGGTCTCCC AGGCGCCCGT GTTCTCCGTG GTGGCCGCGC TGGTCTGGCA GTGGACGCCG TTCATGATGC TGCTGGTGCT GGCGGGACTC CAGAGCCAGG GCTCCGACGT ACTGGAGGCG GGCCAGGTCG ACGGCGCGTC CCGGTGGCAG ACGTTCGCCT GGATCACCCT GCCGCACCTG CGCCGCTACA TCGAGCTGGG CGTGCTGCTG GGGTCGGTCT ACGTGGTCAA CACCTTCGAC ACGATCTACA TGATGACCCA GGGCGGGCCG GGCACCGCCA GCTCCAACCT GCCCTTCTAC GTCTACCAGC GGACCTTCCT CGGGTTCGAC ATCGGCCAGT CCGCGGCCAT GGGCGTCGTG GTGGTGGTCG GCACCATCAT CGTGGCGACG CTGGCCCTGC GGCTGATCTT CCGCACGTTC ATGAACGCAC AGGAGGCGAC ATCGTGA
|
Protein sequence | MSAVTAPARE APAAATPPER PRRSRGGGGW ARRAPLLPAL VFMLVVTQLP FLATVVYSLR SWNLLRPDSQ AWVGLANYAA VFTDRQFLGA AANTVLITAS CVVVAMLLGI GLALLLDRKF RGRGVVRTLV ITPFLILPVA TALLWKHIML EPVFGLVNFV LSPFGVESFD WVSQAPVFSV VAALVWQWTP FMMLLVLAGL QSQGSDVLEA GQVDGASRWQ TFAWITLPHL RRYIELGVLL GSVYVVNTFD TIYMMTQGGP GTASSNLPFY VYQRTFLGFD IGQSAAMGVV VVVGTIIVAT LALRLIFRTF MNAQEATS
|
| |