Gene Ndas_5269 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_5269 
Symbol 
ID9249167 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014211 
Strand
Start bp433833 
End bp435026 
Gene Length1194 bp 
Protein Length397 aa 
Translation table11 
GC content78% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003683155 
Protein GI297564182 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.827708 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCTCGG GACTGGGCGC GCTCGCCCTC GGCTGGCACG CGCTGTCCAG GACCCGTGCG 
GTCGGCAGCG AGTCCCAGGC ACTCGCCCAG CGGGCGGCCG CGGTCAGCGC CTCGGGCGTG
GACCCCTTCG CCGTGCGCGA CGTGGCGGTG CTCCACTACG ACGCCCTGGA GGAGATGTCG
GGCGCCCGCT CCTTCTCCCT GGCGCTGCTC AACAGCGAGG GTGACGGCGT CGTCGTCACC
TCCATCAACG GGCGCACCGA GTCGCGCACC TACGCCAAGG CGGTGGTGGG CGGGGAGTGC
GACACCCTGC TCAGCCCGGA GGAGTACCGG GTCGTCCGGT CGGCACGCCT GGGGGAGGGC
GTCGGCGCCG CCGCGACCGC GGGAGGCCCC CCGGCGCGGG CGGCTTCCTC CGCCGGGGGC
CGGCCGGTGA CTCCGTCCGC GCCTCGGGAG GAACCCGGGT CCCCGCCAGC CCGGGACGAG
GAGCGGGGCG CGCGTGAGGG CGCGGAGGCG GTTGACGAGG GATCGGACCG GGAGCGGCGC
GAGGCCCACC CGGCAGCCTC AACCGGTACG GGCGACTCGG CTGCCCGGAC CGCCTCGGCC
GCTCCGACAA CCTCAGCCGT CCCGTCCGTC GCGGACGACG GTCGGGAGGA CGTGGAAGAC
GCTCCGGCAG CCTCAGTCGC CCCGAACGGC ACGGGCGCGA CCCGGGAGAA CGCGGAACCG
GCTCCGGCGG GCCCGTCAGC ACGGACGGCC CGCGCCACCG AGCCCGTCGC GGATGACAGC
CGAGAGGCTT CCGAGGTCAT GCCGGCCTCA ACCGGCACGG GAGTCTCGGC CGTCCGGCCC
GCCTCAACCG CTCCGGCAGC CTCGTTCAAC CCGGCAGTCC CGGGCGGCGG CCGGGAGGAG
CAGGAGCCCC GCGGCCTCGT CTCCGTGATC CGCAGGACCA TCGGGCGCGC CGGCGCCGAC
CGTCGGCCCC CCGTCCAGGC CGTGCGGTCG GGTGTCGGAG CGGCGAGGCC CATCGCCTCC
GCCAACGTCA CGGTGCGGCC GGGGCCCTCC GCCGCCGCGG CGGGGAGCAC GGGGACTTCG
GCCACCGGGA CCGCGACGGC CGCCGAGCCG CGCCCGGAGG CCCCGACCGA CGAGGGGACC
GGCGGGGAGG GCACCCGGGA GGCGGAGGCC CCCGCGCCGG AGGCGCGCGG ATGA
 
Protein sequence
MLSGLGALAL GWHALSRTRA VGSESQALAQ RAAAVSASGV DPFAVRDVAV LHYDALEEMS 
GARSFSLALL NSEGDGVVVT SINGRTESRT YAKAVVGGEC DTLLSPEEYR VVRSARLGEG
VGAAATAGGP PARAASSAGG RPVTPSAPRE EPGSPPARDE ERGAREGAEA VDEGSDRERR
EAHPAASTGT GDSAARTASA APTTSAVPSV ADDGREDVED APAASVAPNG TGATRENAEP
APAGPSARTA RATEPVADDS REASEVMPAS TGTGVSAVRP ASTAPAASFN PAVPGGGREE
QEPRGLVSVI RRTIGRAGAD RRPPVQAVRS GVGAARPIAS ANVTVRPGPS AAAAGSTGTS
ATGTATAAEP RPEAPTDEGT GGEGTREAEA PAPEARG