Gene Ndas_2974 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_2974 
Symbol 
ID9246827 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp3549623 
End bp3551716 
Gene Length2094 bp 
Protein Length697 aa 
Translation table11 
GC content72% 
IMG OID 
Producttransketolase 
Protein accessionYP_003680890 
Protein GI297561916 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACGCCC ACACCCCGCA AACCCTGGAG TGGTCGGACC TCGACCTCCG CGCCGTCAAC 
ACGGTCCGGG CCCTGGCGAT GGACGCGGTC GAGAAGTCGG GTAACGGACA CCCCGGCACC
GCGATGAGCC TGGCGCCAGC CGCCTACCTG CTCTTCCAGA AGATCATGCG CCACGACCCC
TCGGACCCCG AGTGGACGGG CCGGGACCGC TTCGTGCTGT CGATCGGCCA CTCCAGCCTC
ACCCTGTACA TCCAGCTCTA CCTCGCCGGG TACGGGCTCG AACTGGACGA CCTCAAGAAC
CTGCGCCAGT GGGGCAGCCG CACGCCCGGC CACCCCGAGT TCAGCCACAC GCCGGGGGTG
GAGACCACCA CCGGTCCGCT GGGCCAGGGC GTGGGCAACG CGGTCGGCAT GGCCATGGCC
GCCCGCCGCG AGCGCGGCCT GTTCAACCCC GAGGCCGGCC CGGGCGCCAG CCCCTTCGAC
CACCACATCT ACGCGTTCTG CTCCGACGGC GACGTCCAGG AGGGCGTGAG CCACGAGGCC
AGCGCCCTGG CCGGGACCCA GCAACTGGGC AACCTCATCA TGATCTGGGA CGACAACCGG
ATCTCCATCG AGGACGACAC CCGCATCGCG CACTCCGAGG ACGTGGCCGA GCGCTACCGC
GCCTACGGCT GGCACGTGGA GGAGGTCGAC TGGAGCGCCA CCGGCGAGTA CGTCGAGGAC
GTCGAGGCCC TGTTCCAGGC GATCGTGCGC GGCAAGGCCG AGACCCAGCG CCCGACCTTC
ATCCGCCTGC GCACCGTCAT CGGCTGGCCC GCGCCCAACA AGCAGAACAC CGGCGCCATC
CACGGCGCGG CCATCGGCGC CGACGAGATC TCCGCCACCA AGGCGATCCT CGGCCTGCCC
GACGAGCCCT TCGCCGTCGA GGACGCGGTG ATCGAGCACA CCCGCCGCGC CGTGGACCGG
GGCCGCGAGG CCCGCGCCGC CTGGGAGGTG GAGTTCAGGG CCTGGCACGA GAGCGCCGGG
GAGCACGCCG AACTGTTCGA CCGCCTGGTC GAGAAGCGGC TGCCCGAGGG CTGGGAGAAG
GCCCTGCCGA CCTTCGAGGC CAGCGAGAAG GGGATGGCCA CCCGCAAGGC CAGCGGCGAG
GTGCTCTCGG CCCTGGCGCC GCTGCTGCCC GAGCTGTGGG GCGGCTCGGC CGACCTGGCC
GGGTCCAACA ACACCACGCC CAAGGGCGAG CCGTCCTTCC TGCCCTTCGA CCGCGCCAGC
GAGATGTTCC CGGGCAGCCC CTACGGGCGC GTCCTGCACT TCGGCGTGCG CGAGCACGGC
ATGGGCTCCA TCCTCAACGG CATGGCCCTG CACGGCCCGA CCCGGCCCTA CGGCGGCACC
TTCCTCGTCT TCAGCGACTA CATGCGCCCC GCGGTCCGGC TGGCCGCGAT CATGCAGCTG
CCGGTCACCT ACGTGTGGAC GCACGACTCC ATCGGCCTGG GCGAGGACGG CCCCACCCAC
CAGCCGGTCG AGCACCTGTG GGCGCTGCGC GCCATCTACG GCCTGGACGT GATCCGCCCC
GCCGACGCCA ACGAGACCGC CGTGGTGTGG CGCGAGGTGA TCGAGCGGGG CGACCGCCCG
TCCGCGCTGG CGCTGACCCG CCAGAACCTG CCCGTCCTGG ACCGCGAGGA GTACGCCTCG
GCCGAGGGCG CGGTCAAGGG CGGCTACGTG CTGGCCGAGG CCGACGGGGG CTCCCCCGAG
GTCATCATCA TGGCCACCGG CAGCGAGGTG CAGATCGCCC TGGACGCCCG CAAGGCGCTC
CAGGAGGCGG GCACGCCCAC CCGCGTGGTG TCCATGACGT GCGTGGAGTG GTTCGAGCGC
CAGAGCGAGG AGTACCGCGA GCAGGTGCTG CCCTCCTCCG TGCGCGCCCG CGTGTCGGTG
GAGGCCGGGA TCGCCCTGGG CTGGCGCGAG TACGTCGGCG ACGCCGGCGA GTCGGTGAGC
CTGGAGCACT ACGGCGCCTC CGCCCCCTAC CAGGTCCTGT ACGAGAAGTT CGGCTTCACG
ACCGAGGCGG TCGTCGAGGC GGCCCGCAAG AGCATCGCCA GGGCCGGCAG CTGA
 
Protein sequence
MNAHTPQTLE WSDLDLRAVN TVRALAMDAV EKSGNGHPGT AMSLAPAAYL LFQKIMRHDP 
SDPEWTGRDR FVLSIGHSSL TLYIQLYLAG YGLELDDLKN LRQWGSRTPG HPEFSHTPGV
ETTTGPLGQG VGNAVGMAMA ARRERGLFNP EAGPGASPFD HHIYAFCSDG DVQEGVSHEA
SALAGTQQLG NLIMIWDDNR ISIEDDTRIA HSEDVAERYR AYGWHVEEVD WSATGEYVED
VEALFQAIVR GKAETQRPTF IRLRTVIGWP APNKQNTGAI HGAAIGADEI SATKAILGLP
DEPFAVEDAV IEHTRRAVDR GREARAAWEV EFRAWHESAG EHAELFDRLV EKRLPEGWEK
ALPTFEASEK GMATRKASGE VLSALAPLLP ELWGGSADLA GSNNTTPKGE PSFLPFDRAS
EMFPGSPYGR VLHFGVREHG MGSILNGMAL HGPTRPYGGT FLVFSDYMRP AVRLAAIMQL
PVTYVWTHDS IGLGEDGPTH QPVEHLWALR AIYGLDVIRP ADANETAVVW REVIERGDRP
SALALTRQNL PVLDREEYAS AEGAVKGGYV LAEADGGSPE VIIMATGSEV QIALDARKAL
QEAGTPTRVV SMTCVEWFER QSEEYREQVL PSSVRARVSV EAGIALGWRE YVGDAGESVS
LEHYGASAPY QVLYEKFGFT TEAVVEAARK SIARAGS