Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_3122 |
Symbol | |
ID | 9246978 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 3737500 |
End bp | 3738789 |
Gene Length | 1290 bp |
Protein Length | 429 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | |
Product | thiamine biosynthesis/tRNA modification protein ThiI |
Protein accession | YP_003681037 |
Protein GI | 297562063 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCGCGT CGCTTGAGTC CGCGCCGCTC CCCGCAGCCC TGGCCGGGTC CGACGCCGGG CTCGGCGAGC TCTGCGTGCT CATGAAGCTC GGCGAGATCG TCCTCAAGGG CTCCAACCGC AAGCTGTTCG AGCGGCGGCT GCACAACAAC ATCCGCGCCT CCGTGCGCGA CCTGGGCGAG GTCCGCCTCT CCCAGCGCGG CGCGGGCGTC ATCATCGTGC GCAAGCCCGA CGCCTCCGAC CTGGAGGTCG CCGAGATCGC CGACCGCATG GCCAACGTCA TGGGTGTGGT CTGGGTGCAC CTGGTCCGCC GCGTGCCCAA GGACCTGGAC GCCGTCACCG ACATCGGCGT GCGCGTCATG CAGGGCCGCG AGGGCACCTT CGCCGTGCGC GCCCGGCGCC GCGACAAGCG CTTCGAGATG ACCTCGTCGG AGCTGGCCGG GTACCTGGGC TCGAAGATCA TCGAGGCGCA CGGCTACAAG GTCAACCTCA AGCGCCCCGA CAACACCCTG TTCGTCGAGG TGGACAAGGA CGAGGCGTTC GTGTTCACCG ACGGCGTGCC CGGCCAGGGC GGCCTGCCCG CCGGGATGAG CGGCCGCGGC CTGGTGCTGA TGTCGGGCGG GATCGACTCA CCCGTCGCCG CGCACCGGAT GATCCGGCGC GGGCTCAAGG TCGACTTCCT GCACTTCTCC GGCATGCCGT TCACCGGCCC GGAGTCGATC TACAAGGCCT ACAGCCTCGT CCGCCAGCTC GACCGCTACC AGGTGGGCTC GCGGCTGTTC GTCATCCCCT TCGGCAAGGC CCAGCAGCAG CTGAAGAGCT CGGGGATCGA GCGGCTCCAG ATCGTCGCCC AGCGCCGCCT CATGCTCAAG ACCGCGGAGG CCCTGGCCGA CGACCTGGGC GCGGAGTGCC TGGTCACGGG GGACGCGCTG GGCCAGGTGT CCAGCCAGAC CATGACCAAC CTGACCGCCC TGGACGACGC GGTGGACCTG CCGATCCTGC GTCCGCTCAT CGGCATGGAC AAGACCGAGA TCATGGACCA CGCCCGCCGG ATCGGGACCC TGTCCATCTC GGAGCTGCCC GACGAGGACT GCTGCACCAT GCTGACCCCG CGCCAGGTGG AGACCGCGGC CAAGATCCCG GACCTGCGCC AGATCGAGAA GCGCCTGGAC GCCGAGGAGC TGGCCGAGCA CCTGGTCACC ACCGCGCAGG TGCACAAGCC CAGCTTCCTG GGCGACGCCG CGCCCAAGCG CGTGGCGCCC GCCTCCGCGT CGGTGGCCGC CACCGCCTAG
|
Protein sequence | MSASLESAPL PAALAGSDAG LGELCVLMKL GEIVLKGSNR KLFERRLHNN IRASVRDLGE VRLSQRGAGV IIVRKPDASD LEVAEIADRM ANVMGVVWVH LVRRVPKDLD AVTDIGVRVM QGREGTFAVR ARRRDKRFEM TSSELAGYLG SKIIEAHGYK VNLKRPDNTL FVEVDKDEAF VFTDGVPGQG GLPAGMSGRG LVLMSGGIDS PVAAHRMIRR GLKVDFLHFS GMPFTGPESI YKAYSLVRQL DRYQVGSRLF VIPFGKAQQQ LKSSGIERLQ IVAQRRLMLK TAEALADDLG AECLVTGDAL GQVSSQTMTN LTALDDAVDL PILRPLIGMD KTEIMDHARR IGTLSISELP DEDCCTMLTP RQVETAAKIP DLRQIEKRLD AEELAEHLVT TAQVHKPSFL GDAAPKRVAP ASASVAATA
|
| |