Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_2043 |
Symbol | |
ID | 9245893 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 2464253 |
End bp | 2465245 |
Gene Length | 993 bp |
Protein Length | 330 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | |
Product | transcriptional regulator, AraC family |
Protein accession | YP_003679975 |
Protein GI | 297561001 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.437 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.109712 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCCTTC CGCACCGTGT CGTCATCGGG GTCTTCCCCG ACGTCGACCT CCTTGACGTC ACCGGCCCGG CCGAGGTCTT CGCGCTGGCC AACCAGGAGG CCCCGGGCCG TGCCGACTAC CGGGTCCTCC TCGCCGGGCC GACCCGCGGC GAGGTGCGGA CGTCGGCGGG CGTGCGGCTG CTCACCGACG TGTCCTTCGA CGACGTCGGC GGACAGGTGG ACACGCTGCT GGTACCCGGC GCGGTCGACA TGGGCGACGA CGGCCCCGTG GCCCGGATCG ACTCCGACGT CGTGGCGTGG GTGCGGGAGA CCGCCCCCTG CGCCCGGCGG GTGGCGTCGG TGTGCGTGGG CGCGCACGTA CTGGCGGCGG CCGGACTGCT GGACGGCAGG ACCGCGACCA CGCACTGGTC GACCGCCGCG CAGCTCGCCG CCGACCATCC GGCCGTCACG GTCGACCCGG ACCCGATCTT CGTCCGCGCC GACCGCGGAC GGCTGTGGAC GGGCGCCGGG ATCAGCGCCT GCCTGGACCT CGCACTCGCC CTGGTGGCCG AGGATCTGGG TGAGGACGTC GCGCTGGCGG TGGCCCGGCA GCTGGTGATG TACCTCAAGC GGCAGAGCGG GCAGAGCCAG TTCTCCGTGC CGCTCAGCCG GCCCGCCTCC GCCCGCCGCG ACATCGACGA GCTGCTGCTG TGGATTTCCG ACCACCTCGA CGAGGACCTG TCCGCGGAGG TGCTGGCGGC CCGGATGCAC CTGAGCGAAC GGCACTTCGC CCGCGTCTTC GCCCAGGAGA CCGGCACCGG TCCCGCCGCC TACGTCGAGG GCGTCCGGGT CGAGGCCGCC CGGCGCCTGC TGGAGACCAC CGACGACCCG CTCGACCGGG TCGCGGCCAG GGCCGGGTTC GGCTCGACGG AGACCCTGCA CCGGGCGTTC CGGCGACAGC TCGCCACCAC CCCCGCCGCC TACCGCCGCC GCTTCCGCAC CCAGGCCGCC TGA
|
Protein sequence | MSLPHRVVIG VFPDVDLLDV TGPAEVFALA NQEAPGRADY RVLLAGPTRG EVRTSAGVRL LTDVSFDDVG GQVDTLLVPG AVDMGDDGPV ARIDSDVVAW VRETAPCARR VASVCVGAHV LAAAGLLDGR TATTHWSTAA QLAADHPAVT VDPDPIFVRA DRGRLWTGAG ISACLDLALA LVAEDLGEDV ALAVARQLVM YLKRQSGQSQ FSVPLSRPAS ARRDIDELLL WISDHLDEDL SAEVLAARMH LSERHFARVF AQETGTGPAA YVEGVRVEAA RRLLETTDDP LDRVAARAGF GSTETLHRAF RRQLATTPAA YRRRFRTQAA
|
| |