Gene Ndas_5336 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_5336 
Symbol 
ID9249239 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014211 
Strand
Start bp510690 
End bp511949 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content70% 
IMG OID 
Productphosphoserine phosphatase SerB 
Protein accessionYP_003683222 
Protein GI297564249 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.414905 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.119609 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATGAGA AATCCACGCT GTTGGTGACG GTGACCGGCC ACGACCGTCC GGGCATCAGT 
GCTCGTCTGC TGAGCACCCT CTCCGTCTTT CCGGTGACCA TCGCCGACCT GGAACAGGTG
GTCCTCGCGG GCCGCCTCGT GCTGGGCGCG GTCCTGGAGG TGGACGAGCG GGTGGCTCCC
GGTGTGAGTC CCTCGCGGGT CTTCGAGGAG GTCCGCAACG CGCTCGACAA GACCGCCATC
GACCTCGACA TGGAGGTCGG CTACGGCAAG AGCGGAGGGA AGGACAACGG CCGCGTGAAG
GCGGTCGTCC ACGACCGGCT GCACGTGACC GTCCTGGCCG ACCCGCTGCG TCCCGGCGCC
CTGGGCGCCC TCACCTCGTG CGTCGCGCGC GCGGGCGCGA ACATCGACCG GATCGAGCGG
CTGTCCAGCT TCCCGGTGAC CTCCGTGGAG ATGGAGATCT CCGGCGGTGA CGCCGACCAG
CTGCGCGCCG AACTCGCCAT GGAGGGGTCC ACCCAGGGGG TGGACGTGGC CGTGCAGCCC
AGCGGCCTGC ACCGGCGGGC CAAGCACCTC ATCGTCATGG ACGTGGACTC CACGCTGATC
CAGGGCGAGG TCATCGAGCT GCTGGCCGCG CACGCCGGAT GCGCGGACGA GGTCGCCCGG
GTCACCGAGG AGGCCATGCG CGGCGAGCTG GACTTCGAGG AGTCGCTGCG CCGCCGGGTG
ATGCTGCTGA GGGGCCTGGA CGCCTCCGCC ATTCCCAAGG TGTGCGAGGA GATCCAGCTG
ACACCGGGCG CCAGGACGCT GGTCCGCACC CTGAAGCGCC TGGGGTACGA GTGCGGGATC
GTCAGCGGCG GCTTCACCCA GTTCACGGAC GTGCTCGTGG AGCGCCTCGG GTTGGACTAC
GCCGCCGCGA ACACCCTGGA GATCGTCGAC GGCAAGCTCA CCGGCGAACT GGTGGGGCCG
ATCATCGACC GCAAGGGCAA GGCGACCACC CTGGAGCGGT TCGCCGCGGA GGCCGGTGTG
CCCCTGGAGC AGACCGTGGC CGTGGGCGAC GGCGCGAACG ACCTGGACAT GCTGCAGGCG
GCGGGGCTGG GCGTGGCGTT CAACGCCAAG CCGGTCGTGC GGCAGCAGGC CGACACCTCG
GTGAGCGTGC CCTACCTGGA CACGATCGCG TTCATCCTCG GAATCACCCG GGAGGAGATC
GAGGCCGCGG ACATGCGGGA CCAGATCAAT CCGGTGTCGG ATTCGGTGCC GCACGACTGA
 
Protein sequence
MNEKSTLLVT VTGHDRPGIS ARLLSTLSVF PVTIADLEQV VLAGRLVLGA VLEVDERVAP 
GVSPSRVFEE VRNALDKTAI DLDMEVGYGK SGGKDNGRVK AVVHDRLHVT VLADPLRPGA
LGALTSCVAR AGANIDRIER LSSFPVTSVE MEISGGDADQ LRAELAMEGS TQGVDVAVQP
SGLHRRAKHL IVMDVDSTLI QGEVIELLAA HAGCADEVAR VTEEAMRGEL DFEESLRRRV
MLLRGLDASA IPKVCEEIQL TPGARTLVRT LKRLGYECGI VSGGFTQFTD VLVERLGLDY
AAANTLEIVD GKLTGELVGP IIDRKGKATT LERFAAEAGV PLEQTVAVGD GANDLDMLQA
AGLGVAFNAK PVVRQQADTS VSVPYLDTIA FILGITREEI EAADMRDQIN PVSDSVPHD