Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_3978 |
Symbol | |
ID | 9247849 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 4758448 |
End bp | 4759716 |
Gene Length | 1269 bp |
Protein Length | 422 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | |
Product | sulfate adenylyltransferase, large subunit |
Protein accession | YP_003681881 |
Protein GI | 297562907 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.819909 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTAACG ACATCCTGCG GTTCGCGACG GCGGGTTCCG TCGACGACGG CAAGAGCACC CTGATCGGCC GACTGCTGTT CGACTCCAAG TCCATCTTCG AGGACCAGCT CGACGCGGTG GAGCGCACCA GCGTCGCGCG CGGCGAGGAG CAGACCAACC TGGCGCTGCT CACCGACGGC CTGCGCGCCG AGCGCGAACA GGGCATCACC ATCGACGTGG CGTACCGCTA TTTCGCCACC CCCAAGCGCA CCTTCATCAT CGCCGACACC CCCGGGCACA TCCAGTACAC CCGGAACATG GTCACGGGCG CCTCCACGGC CGACCTCGCC ATCATCCTGG TGGACGCGCG CAAGGGCCTG CAGGAGCAGA GCCGCCGCCA CGCCTTCCTC ACCACCCTGC TCCAGGTGCC CCACCTGGTG CTGGCGGTCA ACAAGATGGA CCTGGTGGAC TACTCCCAGG AGCGCTTCGA GGAGATCCGG GCCGAGTTCG CCGACTTCGC CACCAAGCTG GACGTGTGCG ACCTGACCTT CGTGCCGATC TCGGCCCTGA ACGGCGACAA CGTGGTGAGC CGCTCGGAGA ACATGCCCTG GTACACCGGG CCCTCCCTGC TCCACCACCT GGAGAACGTG CACATCGCCT CCGACCGCAA CCTCATCGAC GCGCGCTTCC CCGTGCAGTA CGTGATCCGG CCGCACAGGT CGGCCGACCC CGAGCTGCAC GACTACCGGG GTTACGCGGG CCAGATCGCG GGCGGCGTCC TCAAGCCGGG CGACGAGGTC ACCCACCTGC CGTCCGGGCT GAACACGCGG ATCGCCAGGA TCGTCACCGC CGACGGCGAC GTCACCGAGG CCTACTCGCC GATGTCGGTC ACCCTCCTGC TGGAGGACGA GATCGACATC TCCCGGGGCG ACATGATCTG CCGCCCCAAC AACGCCCCCG CGGTCACCCA GGACCTGGAG GCGATGGTGT GCTGGATGAC GGACGCGCGC AAGCTCACGC CGCGCTCCAA ACTCATCCTC AAGCACACCA CCCGCACCGC GAAGGTCATG GTCAAGGACC TGCGCTACCG GCTGGACGTC AACACGCTGC ACCGCGACGA GCAGGCCGAC CACCTGTCGC TCAACGAGAT CGGCCGGGTG CGGCTGCGCT CCACCCAGCC CCTGTTCGTG GACGAGTACG CCAAGAACCG CCAGACGGGC GGGTTCATCC TCATCGACGA GTCCACCAAC ACCACGGTCG CCGCGGGGAT GGTCGTCAAG ACCGACTGA
|
Protein sequence | MSNDILRFAT AGSVDDGKST LIGRLLFDSK SIFEDQLDAV ERTSVARGEE QTNLALLTDG LRAEREQGIT IDVAYRYFAT PKRTFIIADT PGHIQYTRNM VTGASTADLA IILVDARKGL QEQSRRHAFL TTLLQVPHLV LAVNKMDLVD YSQERFEEIR AEFADFATKL DVCDLTFVPI SALNGDNVVS RSENMPWYTG PSLLHHLENV HIASDRNLID ARFPVQYVIR PHRSADPELH DYRGYAGQIA GGVLKPGDEV THLPSGLNTR IARIVTADGD VTEAYSPMSV TLLLEDEIDI SRGDMICRPN NAPAVTQDLE AMVCWMTDAR KLTPRSKLIL KHTTRTAKVM VKDLRYRLDV NTLHRDEQAD HLSLNEIGRV RLRSTQPLFV DEYAKNRQTG GFILIDESTN TTVAAGMVVK TD
|
| |