Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_4711 |
Symbol | |
ID | 9248593 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 5591296 |
End bp | 5592585 |
Gene Length | 1290 bp |
Protein Length | 429 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | |
Product | protein of unknown function DUF1205 |
Protein accession | YP_003682603 |
Protein GI | 297563629 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.890895 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCGTATCT TGTTCGCAAC GTTCTCCGAG AAGACCCACT TCATCGGGAT GACCCCCCTG GCATGGGCGC TGCGCGCCGC CGGGCACGAG GTGCGCGTCG CCAGCCAGCC CGAACTCGCG CCGACGGTGG CCGCGACCGG GCTGCCGTTC GTCGCCGCGG GGTCGGACCA CGTGCTCCCC CAGGTGATCG CCTGGGTCGG GCGCATGGCG CGGGACATGC GCCCCGACTT CGACATGATG CGCGTGGCGG CTCCGGAGGT CCCCTCCGGG GAGGAGCTGC GGGCCGCCTA CCGCGACGTG CTGGTGCCGC TGTGGTGGAA GGTCGTCAAC GACCCGATGC TGGAGGACCT GGTCGCCTTC TGCCGCGAGT GGCGCCCCGA CCTGGTCGTG TGGGAGCCCA TCACCTTCTC CGCGGCGATC GCCGCGGAGG CGTGCGGTGC GGCGCACGTG CGCTTCCTGT GGAGCCTGGA CCTGTTCGCC GCGATGCGCG AACAGTACCT GCGCCACATG GAACGACAGC CCCCACAGGA ACGCGACGAC CCCCTCGCCG CATGGCTGGG CGACCGCGCC GCCCGCCACG GCGTCGACTT CTCCGAAACC CTCGTCCGCG GCCAGGCCAC CCTGGACTAC CTGCCCGCCT CCCTGGGCGT GCCCGCCCCC ACCGGAGCCC GCCGCCTGCC CATCCGCTAC GTGCCCTACA ACGGACGCGC CGTCGTCCCC GACTGGCTGC GCACACCCCC CACCCGCCCC CGCGTCTGCC TCAGCCTCGG GACGACGGCC ACCCAGCGCC TGGGCGGCTA CACGGTCGAC GTCGCGACCC TCCTGGAGGG CCTGGCCGAC CTGGACGTGG AGGTCGTGGC CACCCTGCCC GCCCGCGAGC AGGAGAAGCT GGGCGCCGTC CCCGACAACG CCCGCCTGGT CGAGTACGTC CCCCTGCACG CCCTGACCCC CACCTGCGCC GCCATGATCA CCCACGGCGG GGCGGGCACC GTGATGTCCG GCCTGGTGCA CGGGGTCCCG CAGTCGGCCG TGCCGCACCA CATGTACGAC GAGCCCCTGC TGGCCTCACT GGTGGCCGCG CAGGGCTCGG GGGTGGTCGT GGACCCCTCC CGGGTCACCC CCGAGGCCGT CCGGGAGAGC ACCCGGAGGC TGCTGGAGGA CCCCTCCCAC GCCGAGGCGG CGCGACGCCT GCGCGGGGAG GTGGACGCCA TGCCCTCCCC CGCCGAGGTC GCGCGCCGGC TGGCGCGGGC CGCGGGGGAG GGCGGGCGGG TGGACCTCAC ACGGTGGTGA
|
Protein sequence | MRILFATFSE KTHFIGMTPL AWALRAAGHE VRVASQPELA PTVAATGLPF VAAGSDHVLP QVIAWVGRMA RDMRPDFDMM RVAAPEVPSG EELRAAYRDV LVPLWWKVVN DPMLEDLVAF CREWRPDLVV WEPITFSAAI AAEACGAAHV RFLWSLDLFA AMREQYLRHM ERQPPQERDD PLAAWLGDRA ARHGVDFSET LVRGQATLDY LPASLGVPAP TGARRLPIRY VPYNGRAVVP DWLRTPPTRP RVCLSLGTTA TQRLGGYTVD VATLLEGLAD LDVEVVATLP AREQEKLGAV PDNARLVEYV PLHALTPTCA AMITHGGAGT VMSGLVHGVP QSAVPHHMYD EPLLASLVAA QGSGVVVDPS RVTPEAVRES TRRLLEDPSH AEAARRLRGE VDAMPSPAEV ARRLARAAGE GGRVDLTRW
|
| |