Gene Ndas_2119 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_2119 
Symbol 
ID9245969 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp2537482 
End bp2539098 
Gene Length1617 bp 
Protein Length538 aa 
Translation table11 
GC content73% 
IMG OID 
ProductAlkaline phosphatase 
Protein accessionYP_003680050 
Protein GI297561076 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGGGAA CCCCTCGCAC CCCCGCCCCC CTTCCCGTCC CCGACCCGAC CGTCAGCAGC 
ACCACACGGC GCCAGGCCCT GGTCGGCGGC GCGGCCACTC TGGGCGCCGT CGCGCTGGGC
GCCTCCTGGA GTCCCGGCGT GCGGGCCGAC GCCTCCGTCC GCGCCGACGC CCCCGCCCGG
TCGGGCGGCA TCGGTGAGCC CTTCACCCTG GGCGTCGCCT CCGGCGACCC CTTCCACAGC
AGCGTCGTGC TCTGGACCCG TCTGGCCCCC AACCCCTTCG CCGAGGACGG ACTGGGCGGC
ATGCCCGACC GGCGGGTGGA GGTCGAGTGG CAGGTCTCAC GGGAGGAGGG CTTCGGTCTG
CTCTCCGCCT CCGGAACCGT GGAGACGGGC CCCGACGCCG CCCACTCGGT GCACGTGGAG
GCGCAGGGGC TGCGGCCCGG CACCGAGTAC TTCTACCGGT TCCGCGTGGG CAACGAGATC
AGCCAGGTCG GCCGTACCAG GACCGCCCCT CCCCCCGGGA TCCGCACGGA CCGGTTCTCC
TTCGCGTTCG CCAGCTGCCA GAGCTACACC GCCGGGCACT ACAACGCCCA CGCGCACCTG
GCCGAGGAGG ACCTGGACCT GGTCGCGTTC CTGGGCGACT ACATCTACGA GACGGGCGGA
CAGGGGAGCC TGGGCCGGGG CCACCTCCCC GACCGGGAGG TCCGCACCCT GGCCGAGTAC
CGCGTGCGGC ACGCCCAGTA CAAGAGCGAC GCCAACCTCC AGGCCGCGCA CGCCGCCTTC
CCGTGGGCGG TGGTCTTCGA CGACCACGAG CTGGAGAACA ACTGGGCCGA CGACCGCTCC
AACGGCGAGG ATGTCCCGCC CGAGGAGTTC CTGCGCCGGC GCGCCCAGGC GCTGAAGGCC
TACCACGAGC ACATGCCGCT GCGCTTCGCC CAGACGCCGG TCGGCCCCGA CATGCAGCTC
TACCGCAGAC TGGCCTTCGG CGACCTGGTG GACATGCACC TGCTCGACAC CCGCCAGTAC
CGCGACCCGC AGGTGTCCGA CGCCGAGCGC GGCGACCCCT CGCGCACCCT GCTCGGCGCC
CGGCAGAAGC AGTGGCTGCG GGAGGGGCTG TCCTCGTCGC GGGCCCGCTG GAACGTGCTC
GCCCAGCAGG TGTTCTTCTC CCAGCGCGAC TTCGCCGAGG GCGGGGCGAC CGACTTCAGC
AACGACGCCT GGGACAACTA CCTCGTGGAC CGCGACGAGG TCCGCGACCA GCTGGCGCGC
ACGCGCAACG GGGTGGTGAT CACCGGGGAC GTGCACGCCA ACTACGTGTG CGACGTCAAG
GCCGACTTCG ACGCCCCCGA GTCGCCGACG GTGGCCACCG AGCTGGTGGG CACGTCCGTC
ACCAGCGGCG GCGACGGCAC CGAACAGGCC CCCGGCGACG AGGTTCAGCT GCGGGAGAAC
CCGCACATCA GGTTCGTCAA CCGCAAGCGG GGCTACGTGC GCAACGTCGT CACGCCCACC
GAGTGGACGG CCGACTACCG CGTCGTGGAC CACGTGAGCG AGCCGGGCTC GCCCATCCGC
GACCGTGCGC GGTTCGTGAT CGAGGACGGC GTGGCGGGCG TGCGCATGGA GGGGTGA
 
Protein sequence
MPGTPRTPAP LPVPDPTVSS TTRRQALVGG AATLGAVALG ASWSPGVRAD ASVRADAPAR 
SGGIGEPFTL GVASGDPFHS SVVLWTRLAP NPFAEDGLGG MPDRRVEVEW QVSREEGFGL
LSASGTVETG PDAAHSVHVE AQGLRPGTEY FYRFRVGNEI SQVGRTRTAP PPGIRTDRFS
FAFASCQSYT AGHYNAHAHL AEEDLDLVAF LGDYIYETGG QGSLGRGHLP DREVRTLAEY
RVRHAQYKSD ANLQAAHAAF PWAVVFDDHE LENNWADDRS NGEDVPPEEF LRRRAQALKA
YHEHMPLRFA QTPVGPDMQL YRRLAFGDLV DMHLLDTRQY RDPQVSDAER GDPSRTLLGA
RQKQWLREGL SSSRARWNVL AQQVFFSQRD FAEGGATDFS NDAWDNYLVD RDEVRDQLAR
TRNGVVITGD VHANYVCDVK ADFDAPESPT VATELVGTSV TSGGDGTEQA PGDEVQLREN
PHIRFVNRKR GYVRNVVTPT EWTADYRVVD HVSEPGSPIR DRARFVIEDG VAGVRMEG