Gene Ndas_5356 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_5356 
Symbol 
ID9249259 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014211 
Strand
Start bp531645 
End bp533216 
Gene Length1572 bp 
Protein Length523 aa 
Translation table11 
GC content71% 
IMG OID 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_003683242 
Protein GI297564269 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAACTCG CCCTGACTCC CCTGGAGTTC GCCCGCCGCA CCCGAAGACT GCACCCCCGG 
CGCGAGGCGG TGGTGGACCG TGGCCTCCGG CTCACCTACG AGGAGTTCTT CGACCGCTGC
GACCGCTGGT CCCGCGCCCT CCAGAACATG GGGGTCTCCA AGGGCGACCG CGTCGCCTAC
ATCTCCCCCA ACACCCACGC CCAGCTGGAG TCCTTCTACG CCGTGCCGCA GCTCGGCGCG
GTGCTGGTAC CGGTCAACTT CCGGCTGTCG GCGGACGACT TCGTCTACAT CGTCAACCAC
TCCGGGGCCC GGGTCCTGTG CGTGCACGCC GACCAGCTCG ACGCCGTGGA CGGCGTCCGC
GACCAGATGC CGGACGTGGA GCGGTTCGTC GCCCTGGAGG GGGCGCGCCC CGGCTGGGAG
GACTACGAGA CCCTCGTCGC GCAGTCCACG GCGGACTACA CCCGTCCCGA GATCGACGAG
ACCGACCTGC TGACCATCAA CTACACCAGC GGGACCACGG CCCGGCCCAA GGGCGTCATG
ATCACCCACC GCAACGCGTA CATGAACTCG GTGGGCACCC TCCTCCACCT GCGGATCGGC
ATCGACGACC GGTACCTGTG GACGCTGCCG ATGTTCCACG CCAACGGCTG GACCTACACG
TGGACGGTGA CCGCCGCGGC CGCCGCCCAC GTGTGCCTGC CCGCGATGGA CCCGGCCACC
GTGTACGAAC TGATCCGCTC CGAGGGGGTC ACCTGGCTGT GCGCGGCCCC CACCGTGCTG
ATCATGCTGT CCAACGCCCC CGAGGAGGTG CGCGGCGAGG TTCCGTCCGG CGTGCACGTG
GTGACGGCCG GGGCCTCGCC CGCCGCCGAC ACCATCGAGC GCTTGGAGGA CGGTTTCGGG
TGGACCGTCA CCCATGTCTA CGGGCTGACC GAGACCACGC CGTTCATCAC CGTGTGCGAG
CCGCGCGCCG AGCACGGCGG CCTGTCCCCG CGCGACCGCG CCGCCGTCAA GGCACGCCAG
GGGGTCGAGC TGATCACCTC CGGCGAGCTG CGGGTGGTGG ACGCCGACGG CGTCGAGGTG
CCCTGGGACG GCACGACGGT CGGGGAGATC ACCGTGCGCG GGAACGTGGT GATGAAGGGC
TACTACAACG ACCCGGAGGC CACTCGGAAG GCCATGGGCG ACGGCTGGTT CCACACCGGC
GACGCGGCGG TCACCCACCC GGACGGGTAC GTGGAGATCC AGGACCGGAT CAAGGACGTC
ATCATCTCCG GCGGGGAGAA CATCTCGTCC GTCGAGGTGG AGGGGGTCCT GCTGCGGCAC
CCGGCCGTGC TGGAGGCGGC CGTCGTGGGC GTGCCGCACG AGCGCTGGGG CGAGTCGCCG
AAGGCGTCCG TGGTGCTGCG CGAGGGCGCC GCGGCCACGG AGGAGGAGCT GATCGCCTTC
GCCCGCGACA ACCTGGCGCA CTTCAAGGCG CCCACACAGG TGGAGTTCGT GGAGCAGCTT
CCCAAGACGG CCACCGGCAA GATCCAGAAG TTCGTGCTGC GCGGGGGCGC GTCCGCGGTG
TCGCGGCAGT AG
 
Protein sequence
MELALTPLEF ARRTRRLHPR REAVVDRGLR LTYEEFFDRC DRWSRALQNM GVSKGDRVAY 
ISPNTHAQLE SFYAVPQLGA VLVPVNFRLS ADDFVYIVNH SGARVLCVHA DQLDAVDGVR
DQMPDVERFV ALEGARPGWE DYETLVAQST ADYTRPEIDE TDLLTINYTS GTTARPKGVM
ITHRNAYMNS VGTLLHLRIG IDDRYLWTLP MFHANGWTYT WTVTAAAAAH VCLPAMDPAT
VYELIRSEGV TWLCAAPTVL IMLSNAPEEV RGEVPSGVHV VTAGASPAAD TIERLEDGFG
WTVTHVYGLT ETTPFITVCE PRAEHGGLSP RDRAAVKARQ GVELITSGEL RVVDADGVEV
PWDGTTVGEI TVRGNVVMKG YYNDPEATRK AMGDGWFHTG DAAVTHPDGY VEIQDRIKDV
IISGGENISS VEVEGVLLRH PAVLEAAVVG VPHERWGESP KASVVLREGA AATEEELIAF
ARDNLAHFKA PTQVEFVEQL PKTATGKIQK FVLRGGASAV SRQ