Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_4076 |
Symbol | |
ID | 9247948 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 4874248 |
End bp | 4875882 |
Gene Length | 1635 bp |
Protein Length | 544 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | |
Product | glycosyl transferase family 39 |
Protein accession | YP_003681978 |
Protein GI | 297563004 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.639567 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.932979 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCACCA CCGCACCTTT CTCCGAGGCA GAGCCCTCGC CCCCGCCGCG GTCGGCGCGG ATACGCGCCC GCTTCCTGCC CGTCAACCCC GCGCCCCGCT GGCTGGGCTG GCTGGGCGCC TTCGCCGTCG CGCTGTTCGC CGGGGTGCTG AGGTTCTTCA ACCTCGGCCA GCCCGACCGG ATCTACTTCG ACGAGACCTA CTACGCCAAG GACGCCTACG GGCTCTGGAA CTTCGGCTAC GAGCACGAGA CCGTCGAGGA GCCCGTCGAG GCCGACGACC TGCTCGCCCA GGGGTACCAG GACATCTTCA CCGGCACGGG CGACTTCATC GTGCACCCGC CGGTGGGCAA GTGGATGATC GCCCTGGGCG ACTGGCTGTG GTCGCTCCTG CCGTTCGGCA CGACCATGAC CCCCGAGGCC TGGCGGTTCG CCTCCGCCGT GGCCGGGGTG CTCTCGGTCC TCATCCTGGT GCGGCTGGCC ACGCGGATGA CCCGCTCGGT GCTGCTGGGC TGCACGGCGG GGCTGATCAT GGCGCTGGAC GGCCTGCACT TCACGCTGAG CCGCATCGCC ATGGTGGACA TCTTCCTGAC CCTGTGGATC CTGGCCGGAT TCGCGTGCCT GGTGATCGAC CGGGACAGCA CCCGGGAGCG GATGGCCCGC CTGGCGGAGG CGGGAGGGGA CCTGGCGTCG GTGGGCTGGC TGGGGATGCG CTGGTGGCGG CTGGCGGCCG GTCTGTGCTT CGGCCTGGCG GTGGGCACCA AGTGGTCGGC CCTGTTCTTC GTCGCGGCGT TCGGCCTGCT CACGGTGGCG TGGGACTACG GGGCCCGCAG CAGCGTCGGC CAGCGCGGCT TCTTCTGGCG GTGGCTGGGC GTCGACGCCG TCCCGGCGTT CGTGCAGACC GTGGTGGTCG CGGGGGTCGT CTACCTGGTC TCGTGGTCGG GGTGGCTGTT CACCCGCGGC GGCTACAACC GCGACTTCGC GGACGGCATG GCGCCCGAGT GGGTGCCCGG GTTCCTGCGG GCGCCGGTGG AGGCGCTGTG GAGCCTGGTG GACTACCACC AGCGGATGAT GACCTTCCAC ACCGACCTGA CCAGCGACCA CGCCTACATC TCCGCGCCGT GGGAGTGGCT GGTCATGCGC ACCCCGGTGA TGTTCCACTA CAACGGCGAG GTCGCCTCGT GCGACACGGG CGACTGCGTC ACCTCCGTGG TCTCCATCGG CACGCCGGTC ATCTGGTGGT CCAGCCTGCT CGCGCTGGCG GTGATGCTCG GCTGGTGGGT GACCTTCCGC GACTGGCGGG CCGGGGCGGT GCTGCTGGCC GTGGCCGCGG GGTGGCTGCC GTGGTTCGCC TACCCGGACC GGCCCATGTT CCTGTTCTAC GCCGTCCCGC TGCTGCCCTT CCTGGTCCTG GCGATCGTGC TGGCGCTGGG CCTGGCGATG GGGGCCGGGG AGGACAGTCC GCGCTTCGCC CCCTACACGC GTGCGGTGGG CGGCATCGTC TACGGCGTGG TCCTGCTGTT GATCGTGGCC AATTTCGCCT ACTTCTACCC GGTGTTGTCG GCGTATCCGA TCGACGAGGG TATGTGGCGT GAACGCATGT GGTTCGACGT GTGGATCTAC GGCAGCGGCG GTTAG
|
Protein sequence | MTTTAPFSEA EPSPPPRSAR IRARFLPVNP APRWLGWLGA FAVALFAGVL RFFNLGQPDR IYFDETYYAK DAYGLWNFGY EHETVEEPVE ADDLLAQGYQ DIFTGTGDFI VHPPVGKWMI ALGDWLWSLL PFGTTMTPEA WRFASAVAGV LSVLILVRLA TRMTRSVLLG CTAGLIMALD GLHFTLSRIA MVDIFLTLWI LAGFACLVID RDSTRERMAR LAEAGGDLAS VGWLGMRWWR LAAGLCFGLA VGTKWSALFF VAAFGLLTVA WDYGARSSVG QRGFFWRWLG VDAVPAFVQT VVVAGVVYLV SWSGWLFTRG GYNRDFADGM APEWVPGFLR APVEALWSLV DYHQRMMTFH TDLTSDHAYI SAPWEWLVMR TPVMFHYNGE VASCDTGDCV TSVVSIGTPV IWWSSLLALA VMLGWWVTFR DWRAGAVLLA VAAGWLPWFA YPDRPMFLFY AVPLLPFLVL AIVLALGLAM GAGEDSPRFA PYTRAVGGIV YGVVLLLIVA NFAYFYPVLS AYPIDEGMWR ERMWFDVWIY GSGG
|
| |