Gene Ndas_1546 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1546 
Symbol 
ID9245396 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp1892986 
End bp1894311 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content79% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003679481 
Protein GI297560507 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.348015 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGCGCTG CCACAATCCG GGACATGCAG ACAGGACCGG ACGTACTCCA CACCACCGAC 
TGGACCCGCG CCTTCCACGC CTACGGCTCC GGCGCCGACA CCCCCGGGCA CCTCGCCAGC
CTGCTCACCG GCGACGCGCG TGAGCGCGAG CGCGCGCTGG ACCACCTCCA CGGCGCGCTC
CTGCACCAGG GCACGGTCTA CCCCGCCACC GTCCCCGCCG CCCTGTTCGT CGCCGGGATC
CTCGACCGCC CCGAACCCGA CGACCCGGCC CGGGGCCCGC TCCCCGAGGC GTCGGACGCG
GCCCCGGGTG CCCCGCGCCG CGTCCTGCTC GACTTCCTGG CCGCCGCGGC CGAGGGCGCG
CTGCACCAGA CGCCCCCCGG CCCCGTGCCC GAACCCCCCG CCGGGGCCGA GCTGGACCGG
GTCTACGCCG CACTGGCCTC CGACGACGAG GACGAGGCGG TCGGGGTCTG GGAGACCCCC
GCCGTGGACG CGCTCATGCG GAGGGTGGGC CCGGACATGC GCGCCGCCGC GCCCGTGCTG
TACGCGGCGG TCGAACCCCA CCTGACCGCC TCCGACGCGC ACACGCGCAT GTGCGCGGTG
GAGGCCGCCT CCGCCCTGGC CCGGCTCGGC GGGCTGGAGC CCGACCTGTC CGGCGCGGCC
GACATGGCCG AGACCCGCGA CGAGGGCGCG GTGATCGTCC TGGCCCTGGG CGCCTGCGGC
GCCGACACCA CCGAGTTCCT CGCCCACGCC GACCCCGCCA TCCGCGCCTG CGCCGCGCTG
GCGCCCGGCC AGCGAGCCAA CCCCGCGGCC ACCGCGGAGC TGGCCGCCGC GCTCGCGGAC
CCGGAGGCGG CCGACGCCTG GTTCACCAGG CGTCCCGCCC ACTTCACCGG CCACGTGCGT
TTCGCCCTGG TCCGGGAACT CGCCGAGCGC TCCACCGCCG AGGACGCCGC GCGCCTGCTC
CCGGTGCTGC GCGCGCTGGC CCCGCTGACC TCACCCCTCA CCGCGGCGGC CGACGCCGGT
CCGCTGCTGG ACCTGGCCTT CCGCGCCGCC GACACGGGCC GCGGCGCGGC CGCCGACGCG
GACGACGGCA CCACCGCTGA CGCGGGCGCC GAAACGGGTC CCGACGGGTC GGCGGCCCCG
TCCGCCACGG GGCCCGCCGC GCCGCGCGAC CCCGCGGAGC TGACCGCCGT CCAGCGCGAC
TACCTCCGGG TCCTGGCCGA CCACGACGGC TTCTGGGACG GCCGGTTCGC CAACTTCCTG
GTCGTGCTCG CCCGGCTGGG GCTGCCCCGC GAGCGCCGCG GGCTCCGCGC GCTGCTCGCG
GCCTGA
 
Protein sequence
MGAATIRDMQ TGPDVLHTTD WTRAFHAYGS GADTPGHLAS LLTGDARERE RALDHLHGAL 
LHQGTVYPAT VPAALFVAGI LDRPEPDDPA RGPLPEASDA APGAPRRVLL DFLAAAAEGA
LHQTPPGPVP EPPAGAELDR VYAALASDDE DEAVGVWETP AVDALMRRVG PDMRAAAPVL
YAAVEPHLTA SDAHTRMCAV EAASALARLG GLEPDLSGAA DMAETRDEGA VIVLALGACG
ADTTEFLAHA DPAIRACAAL APGQRANPAA TAELAAALAD PEAADAWFTR RPAHFTGHVR
FALVRELAER STAEDAARLL PVLRALAPLT SPLTAAADAG PLLDLAFRAA DTGRGAAADA
DDGTTADAGA ETGPDGSAAP SATGPAAPRD PAELTAVQRD YLRVLADHDG FWDGRFANFL
VVLARLGLPR ERRGLRALLA A