Gene Ndas_5282 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_5282 
Symbol 
ID9249180 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014211 
Strand
Start bp447976 
End bp449388 
Gene Length1413 bp 
Protein Length470 aa 
Translation table11 
GC content69% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003683168 
Protein GI297564195 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGGGCC ACCCGACGAG CCGGATCACC ATCCCGTCGT GGGCATGGGA ACGCGAGCAC 
ACGGCGGAGT TCCTGAGAAG GCGCGATGTC CAGGCACTTT TCCGGTTCGG GCGCCAGTAC
GGCGGTGCGA GCCAGATGGC GATCTCGGCG GCGACCGGCC TCTCACAGGG GCGGGTGAGC
GAGGTACTGA ACGGCCGCAA GACCGTCACG GCCTTCGAGG TGTACGAACG GATCGCCGAC
GGCTTCGGGA TGCCCGACCG CGCGCGGATG CTGTTCGGCC TTGCACCGAA GAACCCCGGC
ACGTTCACTG GATCCGACAG CGCGGAGTCC TCGGCACCAC CGAGGCGACC CGCACCCAGG
AGCCCGGGCC CGGGGGAGGA TGAGAAGGTG CGACGTCGCG CATTCGTCGG ACTGGCCGGA
ACCGCCCTGT TCCAGGCCAC CACAGGTTCA GTGCTGGACA TGGACGAGAT CGCCGCAGCA
CTCACCAGGT ACGCGGGAGG TCGTCGGTCG ACTGTGCGAG CGGCGACGAG TCTCGGCGAG
CTCGCCAAAC GGACGCACGC GGCCAAGGCC GGATACCAGG CGTGCCATTA CACGGCGGTG
GCCAAGCGGC TACCCGAGCT GCTGCGCGGC CTGGATGACG CGGTGTCCGG TGTCGAAGAC
GACGACCTCT CCCGGGTGCA CGCGCTTCGG GCGGAGGCGT ACCACGTGGC GGCGAGTCTC
CTGCTCAAGT GCGAGGAGCG CGGTTTGTCC TACCTCGCGG CAGACCGAAG CATGCGCGCG
GCCGAGGCGA GCGAGAGCCC CGCGGTCGTG GGTGCGAGCG CCAGGGCGTT CACCCGGGCG
CTGATGCGTG AGCGCCACTA CAAGGCGGCG ACGGAACTGG CGACGAGCAC CGCCGAACGG
GTGGACAGCG CCACGGACGA GGCCACGCCG GAATCCCTGT CGGTGTACGG GTCGCTTCTG
CTGAGCGGTG CGGTCTCGGC GGCCAAGCGC GAAAACCGGG GCCAGGCGCT CAGCCTGCTT
GACGAGGCCA GCGACGCGGG TGAACGGCTC GGCGGGGATT TCAACTACCA GTGGACTGCC
TTCGGACCGA CCAACGTCCT GCTGCACCGG GTGAGCGCGG CAGTCGAGTT GGGCGACGCG
GGCAGCGCGA TCGACTACGC GCGCCAGGTG CGGTTGGACA ACATCGACGT GATGGAACGC
CGGGTGACGC TGTTCATCGA CGCGGCGCGT GCGTACCAGC AGTGGGGCAA GACCGAGCAC
GCCTACCACG CGTTGCGCAA CGCCGAGGAG ATGGCGAGCG AGGAACTGAC CGCCCGTCCG
GTGGTGCACC AGTTGATCAA CGACATCCGT ACCCGGGCGA GTGGTCACCT GTACGACAAC
GTCAGCGAAC TAGCGGAAAG GGTCGGTGCC TGA
 
Protein sequence
MPGHPTSRIT IPSWAWEREH TAEFLRRRDV QALFRFGRQY GGASQMAISA ATGLSQGRVS 
EVLNGRKTVT AFEVYERIAD GFGMPDRARM LFGLAPKNPG TFTGSDSAES SAPPRRPAPR
SPGPGEDEKV RRRAFVGLAG TALFQATTGS VLDMDEIAAA LTRYAGGRRS TVRAATSLGE
LAKRTHAAKA GYQACHYTAV AKRLPELLRG LDDAVSGVED DDLSRVHALR AEAYHVAASL
LLKCEERGLS YLAADRSMRA AEASESPAVV GASARAFTRA LMRERHYKAA TELATSTAER
VDSATDEATP ESLSVYGSLL LSGAVSAAKR ENRGQALSLL DEASDAGERL GGDFNYQWTA
FGPTNVLLHR VSAAVELGDA GSAIDYARQV RLDNIDVMER RVTLFIDAAR AYQQWGKTEH
AYHALRNAEE MASEELTARP VVHQLINDIR TRASGHLYDN VSELAERVGA