Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_3703 |
Symbol | |
ID | 9247572 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 4441769 |
End bp | 4443463 |
Gene Length | 1695 bp |
Protein Length | 564 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | |
Product | thiamine biosynthesis protein ThiC |
Protein accession | YP_003681607 |
Protein GI | 297562633 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGGCTC TCGACGGCCG CGACGTCAGT TCCACCGCCG ACCACCGGGT GACCACCGGT CCCATCATGG GATCACGCAA GGTCTACCGG GAGGTCGCCA CCCCCGAGGG GCACACCCTC CGCGTCCCGC AGCGCCGGGT GGAACTCAGC AACGGCAGCC ACTTCGACCT GTACGACACC TCGGGTCCGT ACACCGACGA CACCGCGCAC ATCGACGTCC ACCGGGGCCT GGCCCCCACC CGCGGGGAGT GGGCCCACGC GCCCGCCCCG TCCGGCGGCG CGACCACCCA GCTGGCCCAC GCCAAGGCCG GGACCATCAC CCCCGAGATG CGGTTCGTGG CCGCCCGGGA GGGCGTGGAC CCCGAGTTCG TCCGCCAGGA GGTGGCCGTG GGCCGGGCGG TCATCCCCGC CAACCGGTGC CACCCCGAGT CCGAGCCGAT GATCATCGGC AAGAACTTCC TGGTCAAGAT CAACGCCAAC ATCGGCAACT CCGCCGTCAC CTCCTCCGTG CCCGAGGAGG TGGAGAAGAT GGTGTGGGCC ACCCGCTGGG GGGCCGACAC CGTCATGGAC CTGTCCACCG GCAAGCGCAT CCACGAGACC CGCGAGCGCA TCCTGCGCAA CTCGCCCGTC CCGATCGGGA CCGTACCCAT CTACCAGGCC CTGGAGAAGG TCAACGGCGA CCCGGCGGCG CTGAGCTGGG AGGTCTACCG CGACACCGTC ATCGAGCAGT GCGAGCAGGG CGTGGACTAC ATGACCGTCC ACGCGGGCGT GCTGCTGCGC TACGTGCCGC TCACCGCCCG CCGGGTCACC GGCATCGTCT CGCGCGGCGG CTCCATCATG GCGGCCTGGT GCCTGGCCCA CCACAGGGAG AGCTTCCTGT ACACCCACTA CGAGGAGCTG TGCGAGATCC TCCGGGAGTA CGACGTCACC TTCTCCCTCG GCGACGGGCT GCGGCCGGGG TCGATCGCCG ACGCCAACGA CGAGGCCCAG TTCGCCGAGC TGCGCACCCT GGGCGAACTC ACCCACATCG CCCGCGCGCA CGACGTGCAG GTGATGATCG AGGGGCCCGG GCACGTGCCG ATGCACAAGA TCGCGGAGAA CGTGCGCCTG GAGGAGGAGC TGTGCGGCGA GGCGCCGTTC TACACGCTCG GCCCCCTGGC CACCGATGTC GCGCCCGGCT ACGACCACAT CACCTCCGCG ATCGGCGCCG CCCAGATCGG CTGGCTCGGT ACGGCGATGC TGTGCTACGT CACCCCCAAG GAGCACCTGG GGCTGCCCGA CCGGGACGAC GTCAAGACCG GTGTGATCAC CTACAAGCTC GCCGCGCACG CCGCCGACCT GGCCAAGGGG CACCCCCGGG CGCAGGAGTG GGACGACGAG CTGTCCAAGG CGCGGTTCGA GTTCCGCTGG CAGGACCAGT TCCACCTGGC CCTGGATCCC GAGACCGCGC AGTCCTTCCA CGACCAGACC CTGCCCGCCG AACCCGCCAA GACCGCGCAC TTCTGCTCCA TGTGCGGGCC GAAGTTCTGC TCCATGAAGA TCACGCAGGA CGTGCGCAGG TACGCCGAGG AGCACGGGCT GGAGACGGTG GCCGCCATCG AGAAGGGCAT GGCCGACAAG TCCGCGGAGT TCGCCGAACA GGGTAAGCGG GTCTACCTGC CCCTGGCGGA CCAGGAGACC GCGCAGCACC AGTGA
|
Protein sequence | MTALDGRDVS STADHRVTTG PIMGSRKVYR EVATPEGHTL RVPQRRVELS NGSHFDLYDT SGPYTDDTAH IDVHRGLAPT RGEWAHAPAP SGGATTQLAH AKAGTITPEM RFVAAREGVD PEFVRQEVAV GRAVIPANRC HPESEPMIIG KNFLVKINAN IGNSAVTSSV PEEVEKMVWA TRWGADTVMD LSTGKRIHET RERILRNSPV PIGTVPIYQA LEKVNGDPAA LSWEVYRDTV IEQCEQGVDY MTVHAGVLLR YVPLTARRVT GIVSRGGSIM AAWCLAHHRE SFLYTHYEEL CEILREYDVT FSLGDGLRPG SIADANDEAQ FAELRTLGEL THIARAHDVQ VMIEGPGHVP MHKIAENVRL EEELCGEAPF YTLGPLATDV APGYDHITSA IGAAQIGWLG TAMLCYVTPK EHLGLPDRDD VKTGVITYKL AAHAADLAKG HPRAQEWDDE LSKARFEFRW QDQFHLALDP ETAQSFHDQT LPAEPAKTAH FCSMCGPKFC SMKITQDVRR YAEEHGLETV AAIEKGMADK SAEFAEQGKR VYLPLADQET AQHQ
|
| |