Gene Ndas_4120 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4120 
Symbol 
ID9247994 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4919616 
End bp4921322 
Gene Length1707 bp 
Protein Length568 aa 
Translation table11 
GC content70% 
IMG OID 
Productthiamine pyrophosphate protein TPP binding domain protein 
Protein accessionYP_003682021 
Protein GI297563047 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGACA AGGCAACCAG CGAGTCCACC AGCAGCGAGA AGACCACGAT CTCCGGCGGC 
CACCTGGTCG CCAAGGCCCT CAAGGCCGAG GGGATCGACG TCATCTTCAC GCTCTGCGGC
GGACACATCA TCGACATCTA CGACGGATGC GCGGACGAGG GCATCGACGT GGTCGACGTG
CGGCACGAGC AGGTCGCCGC GCACGCCGCC GACGGCTACG CCCGCGTCAC CGGCAAGCCC
GGGTGCGCCG TCGTGACCGC CGGACCGGGG ACCACCGACG CGGTCACCGG CATCGCCAAC
GCCTACCGCG CGGAGAGCCC GATGCTGGTC ATCGGCGGCC AGGGCGCCCT GAGCCAGCAC
AAGATGGGCT CGCTCCAGGA CCTGCCGCAC GTGGACATGA TCAACCCGAT CTCCAAGTTC
GCCGCCACCG TGCCCCACAC CGAGCGGGTC GCCGACCTGG TCTCGATGGC CTTCCGCGAG
GCCAACAGCG GCGCCCCCGG CCCGGCCTTC CTGGAGATCC CCCGGGACGT CCTGGACGCC
GAGGTGCCCG TGGAGCGGGC CCGCGTCCCC GCCAAGGGCC GCTACCGCGC CTCCACCCGG
CAGGCGGGCG ACCCCGCCGC GATCGAGCGG CTCGCCGACC TGATCGTGCG CTCCGAGAAG
CCCAGCATCC TGCTCGGCAA CCAGGTGTGG ACCACCCGGG CCACGCAGTC CGCCACCGAC
CTGGTGCGCG CGCTCAACAT CCCCGCCTAC ATGAACGGCG CGGGCCGGGG CACCCTGCCG
CCCGGGGACC CGCACCACTT CCAGCTCTCC CGGCGCTACG CCTTCACCAA CTCCGACCTG
ATCATCATCG TCGGCACCCC CTTCGACTTC CGGATGGGCT ACGGCAAGCG CCTCTCGCCC
ACCGCCACGG TGGTGCAGAT CGACCTCAAC TACGCCACCG TCGGCAAGAA CCGCGACGTG
GACCTGGGGC TGGTCGGGGA CGCCGACGTG ATCCTGTCCT CGGTGCTCCA GGCGACCTCG
GGCTACGGGG ACAACGGCGC CCAGAGCCGC AAGACCTGGC TGGAGGAGCT GCGCACCCAG
GAGCAGGCCG CGCTGGACAA GCGGGCGCAC CTGCTCACCT CCGACTCCAC GCCCATCCAC
CCCTACCGGC TGGTCAGCGA GATCAACCAG TTCCTCACCG AGGACTCCAT CTACGTCGGC
GACGGCGGCG ACATCGTCAC CTTCTCCGGC CAGGTGGTCC AGCCCAAGTC GCCGGGCCAC
TGGATGGACC CCGGGCCCCT CGGCACGCTG GGCGTGGGCG TCCCGTTCGT GATGGCGGCC
AAGTACGCCC GCCCGGACAA GGAGGTGGTG GCCCTCTTCG GCGACGGCGC GTTCAGCCTG
ACCGGCTGGG ACTTCGAGAC CCTGGTCCGG TTCGACCTGC CCTTCGTCGG CATCGTGGGC
AACAACTCCT CGATGAACCA GATCCGCTAC GGCCAGATCG CCAAGTACGG CGCGGACCGG
GGCGAGATCG GCAACACCCT GGGCGACGTC AACTACGCCG AGTTCGCCCG GATGCTGGGC
GGCCACGGCG AGGAGGTCCG GGACCCGGCC GACATCGCCC CGGCGCTGCG CCGCGCCCGC
GAGTCCGGCA AGCCCTCGCT GATCAACGTC TGGATCGACC CCGAGGTCTA CGCCCCGGGA
ACGATGAACC AGACCATGTA CAAGTAG
 
Protein sequence
MADKATSEST SSEKTTISGG HLVAKALKAE GIDVIFTLCG GHIIDIYDGC ADEGIDVVDV 
RHEQVAAHAA DGYARVTGKP GCAVVTAGPG TTDAVTGIAN AYRAESPMLV IGGQGALSQH
KMGSLQDLPH VDMINPISKF AATVPHTERV ADLVSMAFRE ANSGAPGPAF LEIPRDVLDA
EVPVERARVP AKGRYRASTR QAGDPAAIER LADLIVRSEK PSILLGNQVW TTRATQSATD
LVRALNIPAY MNGAGRGTLP PGDPHHFQLS RRYAFTNSDL IIIVGTPFDF RMGYGKRLSP
TATVVQIDLN YATVGKNRDV DLGLVGDADV ILSSVLQATS GYGDNGAQSR KTWLEELRTQ
EQAALDKRAH LLTSDSTPIH PYRLVSEINQ FLTEDSIYVG DGGDIVTFSG QVVQPKSPGH
WMDPGPLGTL GVGVPFVMAA KYARPDKEVV ALFGDGAFSL TGWDFETLVR FDLPFVGIVG
NNSSMNQIRY GQIAKYGADR GEIGNTLGDV NYAEFARMLG GHGEEVRDPA DIAPALRRAR
ESGKPSLINV WIDPEVYAPG TMNQTMYK