Gene Ndas_5389 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_5389 
Symbol 
ID9249292 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014211 
Strand
Start bp568076 
End bp569251 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content70% 
IMG OID 
Productphosphate ABC transporter, periplasmic phosphate-binding protein 
Protein accessionYP_003683274 
Protein GI297564301 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCAACG GGCGTTCCAA GCTCGCCGTC CTGGCGGCCA TCCCCCTGCT CGCGGCGGCG 
TGCAGCAGCG GAGGCGACAC CGGCGGCGAC GGCTCCGGCG GTGGCGGGGG CGACCTCAGC
GCCTCCCTGA CCGGTGCCGG GGCCAGCTTC CCCGACCCCC TGTTCCAGGA CTGGATCTAC
ACCTACTCCA ACGACGTCCA GCCCGGCGTC AGCGTCAACT ACCAGAGCGT GGGCTCCGGT
GCCGGCGTGG AGCAGTTCCT GGAGCAGACC GTCGACTTCG GCTCCTCCGA GGAGCCGCTG
GGCGAGGAGG ACCTCGCGGC CGCCTCCGAG GCCCGCGGCT GTGAGGCCGT GCAGTTCCCG
GTCGTCTTCG GCGCGGTCGT CATCGCCTTC AACAACCCCG AGCTGGACGG CCTGGTCCTG
GACGCGGCGA CCATCGCGGC CATCTACGAC GGCCAGATCA CCACGTTCGA CGACCCGGCC
ATCGCCGAGC TCAACCCGGA CATGGAGCTG CCCGGCGACG AGATCATCCC CGTCCACCGC
TCCGACAGCT CCGGCACCAC CTACGTGTTC TCGCACTACC TGAGCACCGA GGTCGACTCC
TGGGCCGAGA AGTACGGCGA GGGCAAGGAG ATCGAGTGGG CCGACGGCCT GGTCGGCGGC
CAGCAGAACG ACGGCGTCGC CCAGGGCATC ACCCAGAACC CCGGCGGCAT CGGCTACGTG
AACCAGTCCT TCGCGCAGGA GGCGGGCCTG TCCGTCGCCC ACATCGTGAA CGAGGACGGC
AACCCGATCG AGCCGACCCT GGAGTCCACC ATCGCGGCCT CCGAGGAGGC CGAGATCCCG
GACAACTTCC AGTTCGCGAT CGACAACATC GGCGGCGAGG GTTACCCGAT CGCCGGTTCC
AACTGGATCT TCACCTACAC CTGCGGCTAC GAGCAGGCCA GCGCCGACGC CATCATCGAC
TTCTGGACCT GGGCCCTGAC CGACGAGGGC GCCCGCGAGC TGGCCGGTGA GCTGGGCTAC
GCGCCGCTGG GCCAGGAGCT GACCGACCGC GTCGTCGCCG AGCTGGAGAA GACCAACGCC
GAGAACGGCG GCGCCGACGC CGGCGCCGAG GAGGAGGCCG GAGCCGAGGA GAGCGCCGAG
GAGAGCGCTC CCGAGGCCGA AGCCACCACC GAGTAG
 
Protein sequence
MRNGRSKLAV LAAIPLLAAA CSSGGDTGGD GSGGGGGDLS ASLTGAGASF PDPLFQDWIY 
TYSNDVQPGV SVNYQSVGSG AGVEQFLEQT VDFGSSEEPL GEEDLAAASE ARGCEAVQFP
VVFGAVVIAF NNPELDGLVL DAATIAAIYD GQITTFDDPA IAELNPDMEL PGDEIIPVHR
SDSSGTTYVF SHYLSTEVDS WAEKYGEGKE IEWADGLVGG QQNDGVAQGI TQNPGGIGYV
NQSFAQEAGL SVAHIVNEDG NPIEPTLEST IAASEEAEIP DNFQFAIDNI GGEGYPIAGS
NWIFTYTCGY EQASADAIID FWTWALTDEG ARELAGELGY APLGQELTDR VVAELEKTNA
ENGGADAGAE EEAGAEESAE ESAPEAEATT E