Gene Ndas_2198 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_2198 
Symbol 
ID9246048 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp2628015 
End bp2629634 
Gene Length1620 bp 
Protein Length539 aa 
Translation table11 
GC content74% 
IMG OID 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_003680126 
Protein GI297561152 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.861325 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000258347 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCCCGTGC CCGACCCGGA GCACCACGTC CGCGACTGGC TCGCCACTTA CGACACACCC 
ACCACATCGG TGGCCCACCT CCTCTGCGAC CGCCACGACC CCCACGCCAC CGCCACCACC
GAGATCGGCC CCACCCTGGA GGCCACCACC CTCACCTTCG GCGAACTCGC CCGGCGCTCC
CGCGACCTGG CCACCGGTCT GGCCGACCTG GGCATCACCA GCGGGGACCG CGTCGCCACC
CTCATCCCCA AGGGCGTGGA CCTGACCGTC ACCGCCCTGG CCGTGTGGCG TCTGGGCGCC
GTCCTGGTCC CCCTGCTCTC CTCCTTCGCC CCCTCGGCCA TCAACGAGCG CCTCACCGAC
TCCGGCGCCC GCCTGGTGGT GTGCGACGCC GAGTACCGCG CCAAGCTCGT CCCCGGCGCC
GACCGCCCCT GGCACATCGC CACCACCGCC GCCGAGCCCG CCCACGAGGG CGACCACACC
CTCACCGGCC TGGCCGCCCG CGCAGCCGCG GCGCCGTCCG TCCCCGACGC CGCCGTGGGC
GGGGACGGCC CGCTGGCCGT CGTCTACGTC TCCGGGGTCA TCGGCCCGCC CCGGGGCGTG
CGGGTGCCCG TGCGCGCCCT GGCCGCCATG CACGCCTACC ACCACTACGG CCTGGGCGTG
CACGACGACG ACGTCTACTG GAACACCGCC GACCCCGGCG CGGCCTACGG CCTCTACCAC
GGGCTCATCT CGCCGCTGCT GGCCGGGCAC AACTCCCTGG CGCTGCGGGC CGGGTTCTTC
GACCCGGAGC TGACCCTGGA CGTGCTGGGC GTGCACGGCG TCACCAACCT GGCGGCCGAT
CCCACCACCT ACCGCACGCT GCGCGCGGCC ACCAAGACCC TGCCGCCCGA GGTGATGGTG
CAGAGCCTGG CCAGTGCGGG CGAGCCGCTG GCCCCCGACG TCATCGACTG GGTCACCGAC
GTGTTCGGCG TCCCGGTGCG CGACCACTAC GGGCAGACCG AGCTGGGCTG GTGTGTGGGC
GTGCCCAACG GCGACACCGG TCAGGGGCCG GCCGATCCGC CGCCGGGGGC GATCGGTCCC
GCGCTGCCGG GCTGGCGGGT GCAGATCCTG GAGGCGATCT CCGACGACCC CGCTCCCCTG
GGCGCCTACG GGCGTGTGGC GGTGGACCTG GAGCGCAGTC CGCTGGCCTG GTTCGAGGGC
TACGTCGGTC AGGAGGGGGC CTCGCAGGTC AGGTTCACCC CCGACCGCGC CTACTACCTG
ACCGGGGACA CCGGTATCCA GGACCGGCAG GGGTCGTTGT TCTTCTCCAC CCGTGACGAC
GGCGCCATCT TGACCTACGG GTACCGGATC GGTCCGAGTG AGGTGGAGTC GGTGCTCAAC
GCCCACCCGG CGGTGGAGGA GTGCGGGGTG TACTCGATCC CCGACGAGCT CGCCGGGCAG
GTGATCGGTG CGCGGGTGGT TCTGGGCGCC GGTCACGAGG CCACTCCGGA GCTGGCCGAG
GAGCTCAAGG GGTGGGTGGG CGAGCGGTTC GCCGCGCACG CCGCGCCGCG GGTGGTGGAC
TTCGTGGAGG AGCTGCCTCG TACGGCCTCG GGCAAGATGC GCCGGGCGCG CCTGCGCTGA
 
Protein sequence
MPVPDPEHHV RDWLATYDTP TTSVAHLLCD RHDPHATATT EIGPTLEATT LTFGELARRS 
RDLATGLADL GITSGDRVAT LIPKGVDLTV TALAVWRLGA VLVPLLSSFA PSAINERLTD
SGARLVVCDA EYRAKLVPGA DRPWHIATTA AEPAHEGDHT LTGLAARAAA APSVPDAAVG
GDGPLAVVYV SGVIGPPRGV RVPVRALAAM HAYHHYGLGV HDDDVYWNTA DPGAAYGLYH
GLISPLLAGH NSLALRAGFF DPELTLDVLG VHGVTNLAAD PTTYRTLRAA TKTLPPEVMV
QSLASAGEPL APDVIDWVTD VFGVPVRDHY GQTELGWCVG VPNGDTGQGP ADPPPGAIGP
ALPGWRVQIL EAISDDPAPL GAYGRVAVDL ERSPLAWFEG YVGQEGASQV RFTPDRAYYL
TGDTGIQDRQ GSLFFSTRDD GAILTYGYRI GPSEVESVLN AHPAVEECGV YSIPDELAGQ
VIGARVVLGA GHEATPELAE ELKGWVGERF AAHAAPRVVD FVEELPRTAS GKMRRARLR