Gene Ndas_1745 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1745 
Symbol 
ID9245595 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp2123254 
End bp2125131 
Gene Length1878 bp 
Protein Length625 aa 
Translation table11 
GC content73% 
IMG OID 
Productamino acid adenylation domain protein 
Protein accessionYP_003679679 
Protein GI297560705 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.825969 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGACCA ACAGTGCGGT GAGCGGCCTG GAGGGGAAGA CCTTCCTCGC CGGCCTTCTC 
GAACGCCAGG CCAGGAACCA GCCGGACGCG ACCGGCCTCG TGTATGCGGG CCGCGCCCAC
ACCTACGCCG ACCTCAACGC GAGGGCCAAC CGGCTCGCGC GTGCGCTCAT CGACGCCGGG
GTGGGGCCGG AGACGCGCGT GGCGGTGTCG ATGCGGCGCT GCCCGGGTGC GATCGTCGCC
CTCTTCGCCG TGCTCAAGGC GTGCGGCGTC TACCTTCCGA TCGACGCCAC CCACCCCCGA
GAGCGCATCG GATACGTCCT GGCCGACAGC GCCCCGAAGG TCGCCGTCAC CGATGACCGG
GGCGCCGACG CCCTCGGCCG CCACGGTGTG CCGATGACGT TCCTGCGCCT GGGAGAGGAC
GGCGACACCG GCGGACACCC CGCCGACCAC GACGTCCGCG ACGAGGAGCG CCGGTCACCG
CTGCGCCCCG ACAACCTCGC GTACGTCATG TACACCTCCG GCTCCACGGG CAGACCCAAG
GGCGTGCAGA TCTCCCAGTC CAGCCTGGGC CTGTACAGCC GCCACTACGG CCGGTTCTTC
GACGAGGTGG ACGCCGGACG GCGCCTGCGC ATCGCCCACA CCGCCGCGCT GACCTTCGAC
GTCGGCTGGA ACTCCGTCAT CGGCTTGGCC GCGGGCCACG AGATGCACCT CTACGCGGAG
GAGGACTACC GGGACGTCGA TCGCTTCGTG CGGATCATGA GCCGACACCG ACTCGACTGC
GTCGTGTTCA CCGCGTCCTA CTGGGGGGCG CTGGTGCAGT CCGCGGAGTG GGGCAGGGGG
GAGCACACGC CGCGGGTGCT GCTCTCCTGC GGGGAGGCGT TCCCGAACGC CCTCTGGCAG
CGGCTCCGGG GGATCGAGGG AACCCGCGTG ATGAACACCT ACGGGCCGAC CGAGGCCACC
GTGGAGGCGG TGGCCACCGA CACCGACGCC ACCCCCCGCC CCACGCTGGG CACACCGATC
CCGGACACGG GCATCCACGT CCTGGACGAC GCGCTCGCGC CCGCGCCGAC CGGGTCGCCC
GGCGAGCTCT ACATCACCGG CGCCCGCCTC GCCCGCGGGT ACCTCAACCG ACCCGGGCTG
ACCGCCGAGC GGTTCGTCGC CTCGCCGTTC GCCCCGGGAG AGCGCATGTA CCGGACGGGT
GACGTCGTCC GGCGGAACGA CGTCGGCGAT CTGGAGTTCC TCGGGCGCGT CGACGACCAG
GTGAAGATCC GCGGCTTCCG TGTCGAGCCG GGGGAGGTCG AGGCGGTGCT CGCCTCCCAC
CCCGCCGTCT CCCGGGCCGC GGTCGTGGTG CGGGAGGACC GGGACGGGGC GCGCGGTCTC
GTCGGCTACT TCGTCGTGGA CGGTGGCGGC GTGGACGAGG CCGAACTGCG CCGGCACCTG
GGCAGGGCAC TGCCGGACTA CATGGTGCCC TCCGCCCTGT TGAGGGTGGA CGAGATGCCG
CTCAACGCCA ACGGCAAGCT CGACAGGGGA GCGCTCCCGG AGCCGACGAG GAACGCGGAG
AGCGCGCCGG ACCGGGCGGA CACGGTCGAG GAGGTCCTCC TCCACATCCT TCGCGAGGTG
CTGGAGGAAC CCGGGCTCGG GCCCGGGGAC CTCTTCACCG AACGCGGAGG CGACAGCATC
CGGGCGTTCC GCGTGGTCAC CCGGGCCCGG GACTCCGGAG TCGTGGTCTC CACGACCGAC
GTCCTCAGGC ACCAGTCCGC GACGGCGATC GCAGGGGCGG CCACCGTGGA CGCCGGGGCC
GGGTCGGACG GGGCCGGGCC GACCACGCGT GTCCCGGACC GCGAGGTCAG CGAACTCCAG
AAGGAACTCG GCCTGTGA
 
Protein sequence
MATNSAVSGL EGKTFLAGLL ERQARNQPDA TGLVYAGRAH TYADLNARAN RLARALIDAG 
VGPETRVAVS MRRCPGAIVA LFAVLKACGV YLPIDATHPR ERIGYVLADS APKVAVTDDR
GADALGRHGV PMTFLRLGED GDTGGHPADH DVRDEERRSP LRPDNLAYVM YTSGSTGRPK
GVQISQSSLG LYSRHYGRFF DEVDAGRRLR IAHTAALTFD VGWNSVIGLA AGHEMHLYAE
EDYRDVDRFV RIMSRHRLDC VVFTASYWGA LVQSAEWGRG EHTPRVLLSC GEAFPNALWQ
RLRGIEGTRV MNTYGPTEAT VEAVATDTDA TPRPTLGTPI PDTGIHVLDD ALAPAPTGSP
GELYITGARL ARGYLNRPGL TAERFVASPF APGERMYRTG DVVRRNDVGD LEFLGRVDDQ
VKIRGFRVEP GEVEAVLASH PAVSRAAVVV REDRDGARGL VGYFVVDGGG VDEAELRRHL
GRALPDYMVP SALLRVDEMP LNANGKLDRG ALPEPTRNAE SAPDRADTVE EVLLHILREV
LEEPGLGPGD LFTERGGDSI RAFRVVTRAR DSGVVVSTTD VLRHQSATAI AGAATVDAGA
GSDGAGPTTR VPDREVSELQ KELGL