Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_2959 |
Symbol | |
ID | 9246812 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 3534861 |
End bp | 3536162 |
Gene Length | 1302 bp |
Protein Length | 433 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | |
Product | cysteine desulfurase, SufS subfamily |
Protein accession | YP_003680875 |
Protein GI | 297561901 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCACAG TCCGCGAGTT CGGTCGGGCC GGGGCGGCCC CGCTCGACGT CGCGGCGATC CGGGAGGACT TCCCGATCCT CTCCCGGACC GTCCGCGACG GGCGGCCGCT GGTGTACCTC GACTCCGGGG CCACCTCGCA GAAGCCCCGT CAGGTGCTCG ACGCCGAACG TGAGTTCTAC GAACGCCACA ACGCGGCGGT GCACCGTGGC GCGCACCAGC TCGCGGAGGA GGCGACCGAC GCCTACGAGG CGGCCCGGGA GACCATCGCC CGGTTCATCG GGGCCGACGC CGGGGAGGTG GTGTTCACCA AGAACGCCAC CGAGGGCGTC AACCTGGTGG CCTACGCGCT GAGCAACTCC GCGACCGCCG ACGAGGAGCT GCGCCGCTTC CAGGTGGGGC CCGGCGACGA GGTCGTGGTG ACCGAGATGG AGCACCACGC CAACCTGGTG CCCTGGCAGC AGCTGTGCCA GCGCACCGGG GCGACCCTGC GCTGGTTCCC GGTGACCGAC GAGGGGCGCC TGGACCTGTC GGGGATCGCC GACCTGGTCA ACGAGCGCAC CAAGGTCGTG GCGTTCAGCC ACCAGTCCAA CGTGCTGGGC ACCGTCAACC CGGTCGGGGA GATCGTCGCC CGGGCCCGCG AGGTCGGCGC GCTGACCGTG CTGGACGCCT GCCAGTCGGT CCCGCACATG CCGGTCGACG TGGCCGCGCT GGGCGTGGAC TTCCTGGTCT TCTCCGGCCA CAAGATGCTC GGTCCCAACG GGATCGGGGT GCTGTGGGGG CGCCGGGAGC TGCTGGAGGC CATCCCGCCG TTCATCACCG GCGGTTCCAT GATCGGCGTG GTCCACATGG AGTACTCCAC CTGGGCCGAC CCGCCCCAGC GCTTCGAGGC CGGTGTGCCG ATGGCCCCGC AGGCCGTCGG CCTGGCCGCG GCCTGCGACT ACCTGTCCCG CGTGGGCATG GAGCGGGTCG CCGAGCACGA GCACGCGCTG ACCGAGTACG CGCTGGAGCG GATCGGCGCC CTGGAGGGCG TGCGGATCGT CGGCCCCGCC GAGGCGGTGG ACCGCGGCGG CGCCGTGTCC TTCGCCGTGG ACGACATCCA CCCGCACGAC GTGGGCCAGG TCCTGGACGA CCGGGGCGTC GAGGTCCGGG TGGGCCACCA CTGCGCCTGG CCGCTGCACC GCAGGCTGGG CGTCGTCGCG ACCACGCGCG CGTCCTTCTA CCTGTACAAC ACGCGGGAGG ACGTGGACGC GCTCGTGGAG GCCATCAGGG CCGCGCAGAA GTTCTTCGGA ACCCGGCCCT AG
|
Protein sequence | MTTVREFGRA GAAPLDVAAI REDFPILSRT VRDGRPLVYL DSGATSQKPR QVLDAEREFY ERHNAAVHRG AHQLAEEATD AYEAARETIA RFIGADAGEV VFTKNATEGV NLVAYALSNS ATADEELRRF QVGPGDEVVV TEMEHHANLV PWQQLCQRTG ATLRWFPVTD EGRLDLSGIA DLVNERTKVV AFSHQSNVLG TVNPVGEIVA RAREVGALTV LDACQSVPHM PVDVAALGVD FLVFSGHKML GPNGIGVLWG RRELLEAIPP FITGGSMIGV VHMEYSTWAD PPQRFEAGVP MAPQAVGLAA ACDYLSRVGM ERVAEHEHAL TEYALERIGA LEGVRIVGPA EAVDRGGAVS FAVDDIHPHD VGQVLDDRGV EVRVGHHCAW PLHRRLGVVA TTRASFYLYN TREDVDALVE AIRAAQKFFG TRP
|
| |