Gene Ndas_2196 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_2196 
Symbol 
ID9246046 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp2623378 
End bp2625726 
Gene Length2349 bp 
Protein Length782 aa 
Translation table11 
GC content70% 
IMG OID 
ProductPhosphoketolase 
Protein accessionYP_003680124 
Protein GI297561150 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0118493 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000352137 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCCCTGC AAGAGGACGA ACTACGAGGC GTGGACGCCT ACTGGCGGGC CGCGAACTAC 
CTGTCGGTCG GGCAGATCTA CCTGCTGGAC AACCCGTTGC TGCGTGAGGA ACTGCGGCCC
GAGCACATCA AGCCGCGCCT GCTGGGGCAC TGGGGCACCA CCCCGGGGCT GAACTTCTGC
TACGCGCACC TGAACCGGGC GATCCGTGAT CGGGACCTGG AGATGATGTA CGTGATGGGG
CCGGGGCACG GCGGTCCGGC CGCGGTGGCC AACGCCTGGC TGGAGGGCAC CTACAGCGAG
GTGTACCGGC ACGTGAACCA GGACGTGGAG GGGATGCGGC GGCTGTTCCG GCAGTTCTCC
TTCCCCGGCG GGGTGCCCAG CCACGTGGCG CCCGAGACGC CGGGGTCGAT CCACGAGGGC
GGGGAGCTGG GGTACTCGCT GGCGCACGCC CAGGGCGCGG CCTTCGACAA CCCCGACCTG
GTGGTGGCGT GCGTGGTGGG CGACGGGGAG GCCGAGACGG GCCCCCTGGC GGGCAGTTGG
CGCGGCAACG CGTTCCTGGA CCCGGTGCAC GACGGCGCGG TGCTGCCGAT CCTGCACCTG
AACGGGTACA AGATCGCCAA TCCCACGGTG CTGGCGCGGA TCCCCGAGCG GGAACTGCTG
TCCTACTTCG AGGGGCTGGG GTACCGGCCG CTGCTGGTCA GCGGTGAGGA TCCGGAGTAC
ATGCACAGGC GGATGGCCGA GGCGGTGGAC GAGGCTCTGG AGGAGATCGC GCGGATCCAG
CACCGGGCGC GGGTGTCGGG TGTGCGCGGG CGCGCGGTGT GGCCGGTGGT GATCCTGCGT
TCGCCCAAGG GGTGGACGGG GCCGGCGGAG GTGGACGGTG TGCCGGTGGA GGGGACGTGG
CGTTCCCACC AGGTGCCGTT GGGCCAGGTG CGTTCGGACG AGGGGCACCT GAGGCAGTTG
GAGGAGTGGA TGCGCTCCTA CCGGCCCGAG GAGTTGTTCG AGGAGTCGGG TGCCCCGGTG
GCCGAGGTGC GCGCGCAGGC CCCGGAGGGC GAACGGCGGA TGAGTGCGAG TCCGCACGCC
AACGGCGGGG TGCTGCGTCG GGAGCTGGTG CTGCCGGACT TTCGCGGGTA CGCGGTGGAC
GTGGGCCGGA ACGGGACGCA GACCGGTGAG GCGACTCGGG TGCTGGGCCG TTTCCTGCGG
GACGTGATCC GGTCCAATCC GCGCACGTTC CGGTTGATGG GGCCTGACGA GACGGCCTCC
AACCGGTTGG ACGCGGTGTT CGAGGCCTCG GACAAGGAGT GGCAGGCCGA GCGGCTGGCG
ACCGATGAGC ACCTGGGGCC CGGCGGCCGG GTGATGGAGG TGCTCAGCGA GCACCTGTGC
CAGGGGTGGT TGGAAGGGTA CCTGTTGACC GGGCGGCACG GGCTGTTCGG CTGTTACGAG
GCGTTCGTGC ACATCGTGGA CTCGATGTTC AACCAGCACG CGAAGTGGTT GAAGGTGTCC
CGGGATCTGC CGTGGCGGCG GCCGATCTCG TCGTTGAACT ACCTGTTGAC CTCGCACGTG
TGGCGTCAGG ACCACAACGG GTTCACCCAC CAGGATCCGG GGTTCCTGGA CGTGGTGCTC
AACAAGCCCG CGGAGGTGGT GCGCGTGTAC CTGCCGCCGG ACGCCAACAC GCTGTTGTCG
GTGGCCGACC ACTGCCTGCG CTCCAGCGAC TACGTGAACG TGGTGGTGGC GGGCAAGCAA
CCGGGGCTGG GGTACCTGTC GATGGAGGAG GCGGTGGCGC ACTGCGCGCG CGGGATCGGG
ATCTGGGAGT GGGCCAGTAC CGACGGTGGG GCGGATCCGG ACGTGGTTCT GGCCTGTGCG
GGTGACGTGC CCACGTTGGA GGTTTTGGCG GCCGCGGACC TGGTGCGCCG CTGGTTGCCG
GGGGTGCGGG TACGGGTGGT CAACGTGGTG GACCTGATGC GGTTGCTGCC CGACAGCGAT
CACCCGCACG GGTTGCCCGA CGCCGAGTAC GACGCCCTGT TCACCACGGA CAAGCCGGTG
GTTTTCGCCT TCCACGGGTA TCCGTGGCTG GTGCACCGGT TGACCTACCG CAGGGCGGGC
CACGCGAACC TGCACGTGCG CGGATACCGG GAGCGGGGCA CGACGACCAC ACCGTTCGAC
ATGGTGATGC TCAACGACCT GGACCGGTTC CATCTGGTCA TGGACGTGAT CGACCGGGTT
CCGGGACTGG GCCAGCGGTC GGCCCAGGTG CGTCAGCGGA TGGTGGACGA GCGGTTGCGG
CACCGCGCCC ACACCCGGGA GTTCGGTGAG GATCCGGCGG AGATCCGTGA GTGGGTGTGG
CGGTACTGA
 
Protein sequence
MALQEDELRG VDAYWRAANY LSVGQIYLLD NPLLREELRP EHIKPRLLGH WGTTPGLNFC 
YAHLNRAIRD RDLEMMYVMG PGHGGPAAVA NAWLEGTYSE VYRHVNQDVE GMRRLFRQFS
FPGGVPSHVA PETPGSIHEG GELGYSLAHA QGAAFDNPDL VVACVVGDGE AETGPLAGSW
RGNAFLDPVH DGAVLPILHL NGYKIANPTV LARIPERELL SYFEGLGYRP LLVSGEDPEY
MHRRMAEAVD EALEEIARIQ HRARVSGVRG RAVWPVVILR SPKGWTGPAE VDGVPVEGTW
RSHQVPLGQV RSDEGHLRQL EEWMRSYRPE ELFEESGAPV AEVRAQAPEG ERRMSASPHA
NGGVLRRELV LPDFRGYAVD VGRNGTQTGE ATRVLGRFLR DVIRSNPRTF RLMGPDETAS
NRLDAVFEAS DKEWQAERLA TDEHLGPGGR VMEVLSEHLC QGWLEGYLLT GRHGLFGCYE
AFVHIVDSMF NQHAKWLKVS RDLPWRRPIS SLNYLLTSHV WRQDHNGFTH QDPGFLDVVL
NKPAEVVRVY LPPDANTLLS VADHCLRSSD YVNVVVAGKQ PGLGYLSMEE AVAHCARGIG
IWEWASTDGG ADPDVVLACA GDVPTLEVLA AADLVRRWLP GVRVRVVNVV DLMRLLPDSD
HPHGLPDAEY DALFTTDKPV VFAFHGYPWL VHRLTYRRAG HANLHVRGYR ERGTTTTPFD
MVMLNDLDRF HLVMDVIDRV PGLGQRSAQV RQRMVDERLR HRAHTREFGE DPAEIREWVW
RY