Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_1987 |
Symbol | |
ID | 9245837 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 2407986 |
End bp | 2409644 |
Gene Length | 1659 bp |
Protein Length | 552 aa |
Translation table | 11 |
GC content | 78% |
IMG OID | |
Product | putative integral membrane protein |
Protein accession | YP_003679919 |
Protein GI | 297560945 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.285044 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.000714658 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGAATTACT TTTATCGCCT GTGGGGTGCG CTACGCCGCA CCCTCACCAC CCGCGGGGCC TTCGCGCCCA GGCCCGTGCG CGGCAACGTC GCCGTCACCG CGCTGCGCGC CGGAGTGTGC GTGGCCCTGC CCCTGCTGGC GCTGCACGCC GCGGGGCGCA TCGACCTGGC CCCCTACGCC GCCATGGGCT GCTTCACCGC CCTGTACGCG CGCGACGACA CCTACGCCCG CCGCGCCCGG CTCCTGGCCG TGGTCGGCGC GACCCTGACC CTGGCCGTGG CCCTGGGCGC GCTGGTCTCG GCCGCGCTCC CCCACCCCCT GGCCGCCATC ACGGTCGTGG CCCTGGTCGC CGGAGGGGCC AAGTACCTCT CCGACGCCCT GGAGTTCGGC GCCCCGGCCG GGCTGATGTT CGTCTTCGCC GCCGGGGTGG CCGCCTACAA CCCCCAGCCC CTGACCGCGC TGCCGGTGGT GGCGGCCACC ACCGCGGCCG CGGCCGCCCT GTGCTGGGCC CTGGCCCTGG TCGGCGCCCT GGTCCATCCC ACCGCCCCCG AACGCCTGGC GGTCGCCCGC GCCCTGCACG CCCTCGCCCG CCACCTGCGC CACCCCGCAC CCCCCGCCCG CACCGGTGCC GAGACCGCCC TCCACCAGGC CTGGCACGTC CTGCTGTCCT CCCCCGGGCA CACCCCGACC CGCCAGGCCC TGGAGGTCCT CACCGCCCGC GCCGAAACCC TGCTCACCGG AACCGGAGAC CGCGTCACCG ACGCCCGCGC CGCCGCCGAG ATGAGCGAAC TGGCCCGCCG CCTGCGCACC GAACGCGCCG TCGAACCCCT GGTCGGCGCC GACGAGCACG CCGCCCTGTC CCAGGCCGCC GCACACCTGC GCCGCCACCA CGCACCCCCG CCCGGGGCGG TCGAGCGCCT GCGCGCCGCA CTGCGCGCGC CCTCGCCCAC CCCGGTGTCG GTGGTGCGCA TCGTCCTGGC CTGCCTGCTG GCCGGTGCCC TGGCCTGGGC GCTGGGCATG GGACACGGCT ACTGGGCCGC CGTCTCGGCC GGATCGGTCC TGCAGGCCGT CAACGTCACC ACCACCTGGC ACCGCACCCT GCAACGCGGC GCGGGCACGG TCGTGGGCGT GGCGCTGGCC GCCGTGCTCT TCAGCCTGGA CTACTCCCCG CTGGGCGTCA TCGCGCTGGT GGTGGCGTGC CAGATGGCGG CCGAACTGGT GGTGACCACC AACTACTCCT ACGCCATCGT GTTCGTCACC CCCCTCACCC TGGCCCTGTC GGGCCTGGCC GACGCCGACC CGGGCGCGGA CGGGCTGGCG GCCGAGCGGC TGTGGGCCAC CGTGCTGGGA GCCGCGGTCG GGCTCCTGGT GTGCGCGGCG GTCGCCAACC GCCGCGCCGG GGACCACCTG AGCCAGGCCC TGGCCGCGTG CGAGCGCGAA CTCGCCCGGG CGCGGGAGTG CGCGGGCGCC GACGCGGCCG CGCACCGACG CCTGGCGCGC AGCCTGGTGG CGCTGCGCGC CTCCCACGCC CTGGCCGAGG GGGAGCCGTG GCTGGCCGGG GCGTGCGCGC GGGAGGTCCA GGACGTGGAA CGCCGCGCCC GCGCCCTGCT GGACGGGGCC GCCCTGCCCG GCCGAGCGGT CGGCGCCCTT CCGGAGTGA
|
Protein sequence | MNYFYRLWGA LRRTLTTRGA FAPRPVRGNV AVTALRAGVC VALPLLALHA AGRIDLAPYA AMGCFTALYA RDDTYARRAR LLAVVGATLT LAVALGALVS AALPHPLAAI TVVALVAGGA KYLSDALEFG APAGLMFVFA AGVAAYNPQP LTALPVVAAT TAAAAALCWA LALVGALVHP TAPERLAVAR ALHALARHLR HPAPPARTGA ETALHQAWHV LLSSPGHTPT RQALEVLTAR AETLLTGTGD RVTDARAAAE MSELARRLRT ERAVEPLVGA DEHAALSQAA AHLRRHHAPP PGAVERLRAA LRAPSPTPVS VVRIVLACLL AGALAWALGM GHGYWAAVSA GSVLQAVNVT TTWHRTLQRG AGTVVGVALA AVLFSLDYSP LGVIALVVAC QMAAELVVTT NYSYAIVFVT PLTLALSGLA DADPGADGLA AERLWATVLG AAVGLLVCAA VANRRAGDHL SQALAACERE LARARECAGA DAAAHRRLAR SLVALRASHA LAEGEPWLAG ACAREVQDVE RRARALLDGA ALPGRAVGAL PE
|
| |