Gene Ndas_5220 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_5220 
Symbol 
ID9249113 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014211 
Strand
Start bp370954 
End bp373065 
Gene Length2112 bp 
Protein Length703 aa 
Translation table11 
GC content70% 
IMG OID 
ProductPolyphosphate kinase 
Protein accessionYP_003683106 
Protein GI297564133 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.17424 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCACGG AAACAGCGCC AACACCCGAT GGGCCCGCCG CCGGTACGCC CGAGCTTCCC 
GCCGACAGGT TCATGGACAG GGAGGAGGGG TGGCTGCGCT TCAACCAGCG CGTGCTCGAA
CTCGCCGAGG ACGAGGACAT CCCCCTCCTG GAACGCGCGC GGTTCCTGGC GATCTTCTCC
AGCAACCTGG ACGAGTTCTT CATGGTGCGC GTGGCCGGGC TCAAACGCCG CCTGGCCACC
GGCCTGTCCG TCGCCTCCTC CAGCGGACAC CACCCCCGCG CGCTGCTCGG GCGCATCTCC
CGGTTCACCC GCGAACTCAT GCTCCGCCAG GCCGCCTGCT TCCATGACAG CGTCGCGCCC
GCCCTGCGCG AGGTCGGCAT CCGCGTCGTG CGCTGGAACG AGCTGGAGAC CGTCGAGCGC
GAGAACCTGC GCGGCTACTT CCAGCGCTCG GTCTACCCGC TGCTGACACC CTTCGCGGTC
GACTCCGCGC ACCCCTTCCC CTACATCTCC GGCCGCTCGC TCAACCTCGC GGTCGCCGTC
CGCGACCCCC ACGACGGGCG CCGCATGTTC GCCCGGGTCA AGATCCCCAG CTCCCTGCCC
CGCTTCATCG AGCTGGGCGA CCGCGAGGGC GGCCGGTTCG TCCCCGTCGA GGACATCGTC
GCCGCCCACC TGCCCCAGCT CTTCGAGGGC ATGCAGATCC TGGAGCACAA CGCCTTCCGG
GTCACGCGCA ACGCCGACCT GGAGGTCGAC GAGGACGAGA CCGACGACCT CGTCACCTCC
CTGGAGAACG AGCTGCTGCG CCGCCGCTTC GGCCCGCTCG TGCGCCTGGA GGTCGAACAC
GACATCAGCG ACGAGGCCCT GGCCATACTC ACCGAGGAGC TGGGCGCCGA GGAGGAGGAG
ATCTACCGGG TCCCCGGTCC GCTCAACCTC GCCGGACTGT CCCAGATCGC CGACACCGAC
CGGCCCGAGC TGCGCTACCC GCCCATGGTC CCCGTCGAAC CGCGCGCCCT GACCTCCGGC
GACCTCTTCG CCGCCGTCCG CGAGAACGAG GTCCTCGTCC ACCACCCCTA CGAGTCCTTC
GCCACCACCA CCGAGCGCTT CCTGGCCCTG GCCGCCACCG ACCCCGAGGT CATGGCGATC
AAGCAGACCC TGTACCGCAC CAGCGGCGAC TCGCCCATCG TGGAGTCCCT CATCGAGGCC
GCCCGCGAGG GCAAGGAGGT CGTGGTCCTG GTCGAGATCA AGGCCCGCTT CGACGAGCAG
AACAACATCC AGTGGGCCCG CAAGCTCGAA CAGGCGGGCT GCCACGTCGT CTACGGCGTC
GTCGGCCTCA AGACCCACTG CAAGCTGTCG ATGGTCGTGC GCCGCGACGA CGACGGGCTG
CTGCGCCGCT ACTGCCACGT GGGCACCGGC AACTACAACC CGAGCACCGC CCGCATGTAC
GAGGACCTCG GCCTGTTCAG CGCCGACCCC GAGGTGGGCG AGGACGTCAG CGACCTGTTC
AACAGCCTCA CCGGCTTCTC CCGCAAGAAG CACTACCAGC GCCTGCTCGT GGCGCCCGGC
GCCCTGCGCG AGAGCCTGCT CGAACAGATC GCCGTCGAGA TCGACAACAG CCTCAGGGGC
GAGCCCGCCC GCATCCGGAT CAAGGTCAAC TCCCTGGTCG ACCTGGAGAT CATCGACGCC
CTGTACCGGG CCTCCCAGGC CGGGGTGAGC GTCGACCTGT GGGTGCGCGG CAGCTGCGTC
CTGCGCGCCG GGGTGCCCGG ACTGTCCGAC AACATCCGGG TGCGCAGCAT CCTGGGCCGC
TTCCTGGAGC ACTCCCGCAT CTTCGTCTTC GCCAACGGCT GGCGGCCCCA GGTGTGGATC
GGCAGCGCCG ACCTGATGCC CCGCAACCTC GACCGCCGGG TCGAGGCCCT GGTACGCGTG
GCCGACACCG AGCAGTGCCG CAGGCTGGTG CGCCTCATGG ACCTGGCGAT GGACGACGGC
ACCTCGTCCT GGCGGCTCGA ACCCGACGGG ACGTGGACCC GGTTCACCCG CGACGACGAC
GGGCGACCGC TCACCGACCT CCAGGACAGC CTGCGCGGCG ACCGCCACCT GCGCGTGGTG
GAGGGCGGGT GA
 
Protein sequence
MSTETAPTPD GPAAGTPELP ADRFMDREEG WLRFNQRVLE LAEDEDIPLL ERARFLAIFS 
SNLDEFFMVR VAGLKRRLAT GLSVASSSGH HPRALLGRIS RFTRELMLRQ AACFHDSVAP
ALREVGIRVV RWNELETVER ENLRGYFQRS VYPLLTPFAV DSAHPFPYIS GRSLNLAVAV
RDPHDGRRMF ARVKIPSSLP RFIELGDREG GRFVPVEDIV AAHLPQLFEG MQILEHNAFR
VTRNADLEVD EDETDDLVTS LENELLRRRF GPLVRLEVEH DISDEALAIL TEELGAEEEE
IYRVPGPLNL AGLSQIADTD RPELRYPPMV PVEPRALTSG DLFAAVRENE VLVHHPYESF
ATTTERFLAL AATDPEVMAI KQTLYRTSGD SPIVESLIEA AREGKEVVVL VEIKARFDEQ
NNIQWARKLE QAGCHVVYGV VGLKTHCKLS MVVRRDDDGL LRRYCHVGTG NYNPSTARMY
EDLGLFSADP EVGEDVSDLF NSLTGFSRKK HYQRLLVAPG ALRESLLEQI AVEIDNSLRG
EPARIRIKVN SLVDLEIIDA LYRASQAGVS VDLWVRGSCV LRAGVPGLSD NIRVRSILGR
FLEHSRIFVF ANGWRPQVWI GSADLMPRNL DRRVEALVRV ADTEQCRRLV RLMDLAMDDG
TSSWRLEPDG TWTRFTRDDD GRPLTDLQDS LRGDRHLRVV EGG