Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_1685 |
Symbol | |
ID | 9245535 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 2057488 |
End bp | 2059137 |
Gene Length | 1650 bp |
Protein Length | 549 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | |
Product | Oligopeptide transporter OPT superfamily protein |
Protein accession | YP_003679620 |
Protein GI | 297560646 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0732845 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.334429 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAACCGT CGGCACCTGG TTCGTCCACC GAACCGCAGA CCCCGGGAAG CGCCCGCCAC CCGCGCGCGT TCGAACCCGT CGTCGTCATC GTCACCGTCC TGGTGAGCCT CCTCGGGGCG GTGATCGGCA TCCACATGAT CACGACGCTC GGGGTCTCGC CCAACACCAG CGTCATCGGC GCGGTCGTGG CCATGCTCAT CGGCCGGATC GGGTTCCTGG GGCTGCGCTC GATGCGCAAC ACCAACCGGC AGAACCTCAT CCAGTCCTCG ATCTCCGGCG CGACCTTCGC CTCGGCGAAC TCCCTGCTCA CCCCGATCGC CATCCCGTTC CTCTTCGGCC GCCCCGACCT GGTGTGGCCG ATGCTGCTGG GCGCGTCCCT GGGCCTGCTC ATCGACGTGT TCGTGCTCTA CAAGGCGTTC GGCTCCCGGT TCCTGCCCGC CGACGCGGCC TGGCCGCCCG GGGCGGCGGC GGCCGAGACC ATCAAGGCCG GTGACCGGGG CGGACGCCAG GCGGCCATCC TGGTGGGCGG CGGCGCGGTC GGGCTCGGCG CCTCCTTCCT GGGCATGCCG ATGTCGGCGG CGGGCATCGC CATGATCGGC AACGTCTGGG CCCTGCTCAT GTTCGCCGTG GGCCTGCTCG TCGCCCAGTA CTCCCCGGCG GTCATCGGGA TCGACCTCAA CTCGATCTAC GTGCCGCACG GCGTCATGAT CGGCGCGGGC GTGGTCGCGC TGGCGCAGAT CGTGGTCATC CTCGCCGGCC GCCAGAGCCG CAGGGAGAGG GAGCGCGAGG CCGCCCGCGA CCGCGCCGCC CAGGACGACC CGTCCCTGGC CTACACCGTG GACCGCGCCA CCCTGGGCCG GGCCCTGGGC TCGGGCTACG TGCTGTTCGC CCTCGGTGCC CTCGTGCTCG CGGTCACCGG CGGGATCTGG GCGGACATGA GCTGGCTGGG CATCCTCGGA TTCGTCCTGT TCGCCGCCGT GGCCGCCCTG GTCCACGAAC TCATCGTCGG CCTGGCCGCC ATGCACGCGG GCTGGTTCCC CGCCTTCGCG GTCACCCTCA TCTTCCTCAT CCTCGGCCTG GCGCTGGGCA TCCCCGGGGT GCCGCTGGCC CTGCTCGTGG GCTACTGCGC GGCCACCGGC CCCGCCTTCG CGGACATGGG CTACGACTTC AAGGCCGGGT GGGTGCTGCG CCGCGACCGC CGCCCCTACA CCGCCTTCGA GCTCGACGGA CGCCGCCAGC AGCTCATCTC CTCCATGATC GGGTTCGCCG TCGCCATCGG CATGGTCGCG CTGCTCTGGC AGGGCCTGTT CGAGGACGGC GCCGTGCCGC CCACCTCGAT CGTCTACGCC GACACCATCA AGGCCGGGCT GAGCGACCCC TCCGTCCTGC TCCAGCTCGC CCTGTGGGCC GTGCCCGGCG CGATCGTGCA GCTCCTGGGC GGCCCCCGGC GCCAGATGGG CGTCCTGCTC GCCACCGGCC TGCTCGTGGC CACGCCCAAC GCCGGATGGC TCGTCCTGGC CGGGCTGGCG ATCCGCCTGG TGTGGGAGCG CCGCCGCGGC GAGAAGGGCG AGCAGGAGAT CGCCCTGGTC GGCGCCGGGC TCATCGCCGG GGACTCCGTC CACTCCGTCG GCACCGTCTT CAGCCGCTGA
|
Protein sequence | MEPSAPGSST EPQTPGSARH PRAFEPVVVI VTVLVSLLGA VIGIHMITTL GVSPNTSVIG AVVAMLIGRI GFLGLRSMRN TNRQNLIQSS ISGATFASAN SLLTPIAIPF LFGRPDLVWP MLLGASLGLL IDVFVLYKAF GSRFLPADAA WPPGAAAAET IKAGDRGGRQ AAILVGGGAV GLGASFLGMP MSAAGIAMIG NVWALLMFAV GLLVAQYSPA VIGIDLNSIY VPHGVMIGAG VVALAQIVVI LAGRQSRRER EREAARDRAA QDDPSLAYTV DRATLGRALG SGYVLFALGA LVLAVTGGIW ADMSWLGILG FVLFAAVAAL VHELIVGLAA MHAGWFPAFA VTLIFLILGL ALGIPGVPLA LLVGYCAATG PAFADMGYDF KAGWVLRRDR RPYTAFELDG RRQQLISSMI GFAVAIGMVA LLWQGLFEDG AVPPTSIVYA DTIKAGLSDP SVLLQLALWA VPGAIVQLLG GPRRQMGVLL ATGLLVATPN AGWLVLAGLA IRLVWERRRG EKGEQEIALV GAGLIAGDSV HSVGTVFSR
|
| |