Gene Ndas_1249 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1249 
Symbol 
ID9245099 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp1551450 
End bp1552919 
Gene Length1470 bp 
Protein Length489 aa 
Translation table11 
GC content75% 
IMG OID 
Productphosphoesterase PA-phosphatase related protein 
Protein accessionYP_003679194 
Protein GI297560220 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.195733 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.362635 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCGAC TCCAACGGGC TGACAAGAAG GTCTACGACC ACGTGACGGG GTTGGGACCC 
GCGTCCCTGG ACGCCTACAC CCCCAAGTTC GTGCAGGCGA CGGACAACAT GGCGCCGTGG
TTCCTCATGT CGGCCACCCT CGCCGCCACC GGCGGCCCCA GGCTGCGGCG CACCGCCCTG
CGCGCCATCC TGGCCGCGGG CACCGCCAAC CTGGTCTCGG CGGGGATCAA GCAGATCTCG
GGGCGCACCC GCCCGGACAA CTCGGCCGTC CCGGCCGCGC GCAGCCCCTA CCGCTCCTAC
CCGAGCACCT CCTTCCCCTC CGGGCACACC GCCGCGGCCG CGGCCTACGC GGCCGGGGTC
ATGACCGACG CGCCCAAGCC GCTGGCCGGG CTGGTCGCCC TCCTGGCGGG CGGGGTGGCG
TTCTCCCGCG TGCACAGCGG CGTCCACTAC CCCGGCGACG TGCTCGCCGG AGTGGCCATC
GGGTGCGGCG CGGCCCTGCT CGCCGGAACG GTGGTGCCGC CCCGGCCCGA ACTGGTCTTC
GGCGCGCGCA CCGTCGCCGA CGGGGAGACC GACGTGGACC GCGAGGGCGG CGGGGTGACG
GTGGTGGTCA ACCCGCGCTC GGCCTCGGGC ACGGTGCCCG GGCTGACCGC GGCCGACGTC
ACGAGCAGGG TGGCGCGGGC GCTGCCCAAG GCGCGGATCA TCCCGCTGTC GGCCGACGAC
GACGTGGTCG GGGTCATGGA CCAGGCCGCG CGCACCAGCG AGGTGCTGGC GGTGGCGGGC
GGCGACGGCA CCGTCAACGC CGGGGCGCAG GCGGCGCTCG ACCACGACCG CCCGCTGCTG
GTGCTGCCGG ACGGCACCCT CAACAACTTC GCCCGCACCC TGGGGCTGTC CTCGGTGGAC
ATCGCGCTGC GGGCCTTCGA CGACGGGCGG CTGGCCCGGG TGGACGTGGG CGAGGTGGAC
GGCCGGATCT TCCTCAACAC GTCCTCCTTC GGCTCCTACC CGCGCATGGT GGACCGGCGC
GACAAGTGGG CCGAGCGGAT CGGCAAGTGG CCCGCGTTCG CGCTGGCCCT GTGGCAGGAC
CTGCGGGAGG TGAGCCCGAT CTCCGCGGTC GTGGACGGCG AGCCCGCCAA GGTGTGGTGG
GCGTTCGTGG GCAACTGCCA GTACCGCACG CACGGCCGGG TGCCCGCGCT GCGCGAGCAG
CTGGACGACG GGCGGCTGGA CGTGCGGGTG CTCACCGCGC GGGCGCCCTT CCCGAGGCTG
CGCGCCGTCG CGGACGTGCT GCTGGGCAAG TTCGCGCACG GCGAGGGGTA CTCCGAGCGG
CTGACCACGG GGCTGACGCT GACCATCCCG GGCGAGCCCA GGCTGCTCGC CGTGGACGGC
GAGGTCGTGG AGGGCTCGCG CACGGTGGTC TTCACCAAGC GGCACGCGGC CCTGCGGGTG
TTCGTGCCCG CCGTCGAGAC CGACCGGTGA
 
Protein sequence
MSRLQRADKK VYDHVTGLGP ASLDAYTPKF VQATDNMAPW FLMSATLAAT GGPRLRRTAL 
RAILAAGTAN LVSAGIKQIS GRTRPDNSAV PAARSPYRSY PSTSFPSGHT AAAAAYAAGV
MTDAPKPLAG LVALLAGGVA FSRVHSGVHY PGDVLAGVAI GCGAALLAGT VVPPRPELVF
GARTVADGET DVDREGGGVT VVVNPRSASG TVPGLTAADV TSRVARALPK ARIIPLSADD
DVVGVMDQAA RTSEVLAVAG GDGTVNAGAQ AALDHDRPLL VLPDGTLNNF ARTLGLSSVD
IALRAFDDGR LARVDVGEVD GRIFLNTSSF GSYPRMVDRR DKWAERIGKW PAFALALWQD
LREVSPISAV VDGEPAKVWW AFVGNCQYRT HGRVPALREQ LDDGRLDVRV LTARAPFPRL
RAVADVLLGK FAHGEGYSER LTTGLTLTIP GEPRLLAVDG EVVEGSRTVV FTKRHAALRV
FVPAVETDR