Gene Ndas_3703 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3703 
Symbol 
ID9247572 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4441769 
End bp4443463 
Gene Length1695 bp 
Protein Length564 aa 
Translation table11 
GC content70% 
IMG OID 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_003681607 
Protein GI297562633 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGCTC TCGACGGCCG CGACGTCAGT TCCACCGCCG ACCACCGGGT GACCACCGGT 
CCCATCATGG GATCACGCAA GGTCTACCGG GAGGTCGCCA CCCCCGAGGG GCACACCCTC
CGCGTCCCGC AGCGCCGGGT GGAACTCAGC AACGGCAGCC ACTTCGACCT GTACGACACC
TCGGGTCCGT ACACCGACGA CACCGCGCAC ATCGACGTCC ACCGGGGCCT GGCCCCCACC
CGCGGGGAGT GGGCCCACGC GCCCGCCCCG TCCGGCGGCG CGACCACCCA GCTGGCCCAC
GCCAAGGCCG GGACCATCAC CCCCGAGATG CGGTTCGTGG CCGCCCGGGA GGGCGTGGAC
CCCGAGTTCG TCCGCCAGGA GGTGGCCGTG GGCCGGGCGG TCATCCCCGC CAACCGGTGC
CACCCCGAGT CCGAGCCGAT GATCATCGGC AAGAACTTCC TGGTCAAGAT CAACGCCAAC
ATCGGCAACT CCGCCGTCAC CTCCTCCGTG CCCGAGGAGG TGGAGAAGAT GGTGTGGGCC
ACCCGCTGGG GGGCCGACAC CGTCATGGAC CTGTCCACCG GCAAGCGCAT CCACGAGACC
CGCGAGCGCA TCCTGCGCAA CTCGCCCGTC CCGATCGGGA CCGTACCCAT CTACCAGGCC
CTGGAGAAGG TCAACGGCGA CCCGGCGGCG CTGAGCTGGG AGGTCTACCG CGACACCGTC
ATCGAGCAGT GCGAGCAGGG CGTGGACTAC ATGACCGTCC ACGCGGGCGT GCTGCTGCGC
TACGTGCCGC TCACCGCCCG CCGGGTCACC GGCATCGTCT CGCGCGGCGG CTCCATCATG
GCGGCCTGGT GCCTGGCCCA CCACAGGGAG AGCTTCCTGT ACACCCACTA CGAGGAGCTG
TGCGAGATCC TCCGGGAGTA CGACGTCACC TTCTCCCTCG GCGACGGGCT GCGGCCGGGG
TCGATCGCCG ACGCCAACGA CGAGGCCCAG TTCGCCGAGC TGCGCACCCT GGGCGAACTC
ACCCACATCG CCCGCGCGCA CGACGTGCAG GTGATGATCG AGGGGCCCGG GCACGTGCCG
ATGCACAAGA TCGCGGAGAA CGTGCGCCTG GAGGAGGAGC TGTGCGGCGA GGCGCCGTTC
TACACGCTCG GCCCCCTGGC CACCGATGTC GCGCCCGGCT ACGACCACAT CACCTCCGCG
ATCGGCGCCG CCCAGATCGG CTGGCTCGGT ACGGCGATGC TGTGCTACGT CACCCCCAAG
GAGCACCTGG GGCTGCCCGA CCGGGACGAC GTCAAGACCG GTGTGATCAC CTACAAGCTC
GCCGCGCACG CCGCCGACCT GGCCAAGGGG CACCCCCGGG CGCAGGAGTG GGACGACGAG
CTGTCCAAGG CGCGGTTCGA GTTCCGCTGG CAGGACCAGT TCCACCTGGC CCTGGATCCC
GAGACCGCGC AGTCCTTCCA CGACCAGACC CTGCCCGCCG AACCCGCCAA GACCGCGCAC
TTCTGCTCCA TGTGCGGGCC GAAGTTCTGC TCCATGAAGA TCACGCAGGA CGTGCGCAGG
TACGCCGAGG AGCACGGGCT GGAGACGGTG GCCGCCATCG AGAAGGGCAT GGCCGACAAG
TCCGCGGAGT TCGCCGAACA GGGTAAGCGG GTCTACCTGC CCCTGGCGGA CCAGGAGACC
GCGCAGCACC AGTGA
 
Protein sequence
MTALDGRDVS STADHRVTTG PIMGSRKVYR EVATPEGHTL RVPQRRVELS NGSHFDLYDT 
SGPYTDDTAH IDVHRGLAPT RGEWAHAPAP SGGATTQLAH AKAGTITPEM RFVAAREGVD
PEFVRQEVAV GRAVIPANRC HPESEPMIIG KNFLVKINAN IGNSAVTSSV PEEVEKMVWA
TRWGADTVMD LSTGKRIHET RERILRNSPV PIGTVPIYQA LEKVNGDPAA LSWEVYRDTV
IEQCEQGVDY MTVHAGVLLR YVPLTARRVT GIVSRGGSIM AAWCLAHHRE SFLYTHYEEL
CEILREYDVT FSLGDGLRPG SIADANDEAQ FAELRTLGEL THIARAHDVQ VMIEGPGHVP
MHKIAENVRL EEELCGEAPF YTLGPLATDV APGYDHITSA IGAAQIGWLG TAMLCYVTPK
EHLGLPDRDD VKTGVITYKL AAHAADLAKG HPRAQEWDDE LSKARFEFRW QDQFHLALDP
ETAQSFHDQT LPAEPAKTAH FCSMCGPKFC SMKITQDVRR YAEEHGLETV AAIEKGMADK
SAEFAEQGKR VYLPLADQET AQHQ