Gene Ndas_4579 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4579 
Symbol 
ID9248460 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp5425068 
End bp5427563 
Gene Length2496 bp 
Protein Length831 aa 
Translation table11 
GC content76% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003682472 
Protein GI297563498 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.848749 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTCGGC CCCGACTGGA CACGAGCGGG CTGCACGTCC TGGGCGTGCG GCACCACGGC 
CCGGGGTCGG CCCGGGCCGT CCGGGCGGCG CTGGAGGAGA TCAAACCCGA CGCCGTGCTG
ATCGAGGGAC CGCCGGAGGC GGACGCCCTC ACCTCGCTGG TGGGCGAGCT GGAGCCGCCG
GTGGCGCTGC TGGCCTACCT CGCCGACACC CCCAAGGGGG ACGAGCCGAG AGCGGGTGAC
GGTAAGGAAA AGACTGCGCG CGGAGCGCGA CCGTTGAGGG TGGCGGAGGG CGACGGGTGG
GCGTTCTGGC CCTTCGCCAG CTTCTCCCCG GAGTGGGAGG CCCTGCGCTA CGCGGTGGAG
AACGACGTGC GGGTGCGCTT TTGCGACCTG CCCGCCGCGA ACACCCTGGC CGAGCGCGTC
GCCGAGGCCG AGGAGCGGCG CCGGGCCGAG GAGGAGAGCG CCCGCACGGA GGGGGACTCA
GAGGACCCGG GGGCCGAGGA GGCTGCCGGG AACCCCGAGG CGGCCGGAGC CGGTGCGGCT
GTCGCGGCAG AGGGAAGCGG CGGGACCGAC GGGGCCGGGA CCCCGGAGGC CGCTGACGGG
ACGGACGGCG GCGAGGTCGG CGAGCCGGAG CGGATCCGGC TGGACCCCCT GGGGGTGCTC
GCCGAGGCGG CGGGCTACGA CGACGCCGAG CGCTGGTGGG ACGACGTCAT CGAGCAGCGC
CGGGACGGGG AGCCCTCCCC GTTCCCGGCG ATCGCCGACG CCATGGCCGC CGTCCGCGCC
GAGTCCGGCC CCGAGACGGA GCGCGACGCG CGCCGGGAGG CGTACATGCG CCAGACCCTG
CGCGCCACCC TCAAGGAGGG CCACGAGCGG ATCGCCGTGG TCTGCGGCGC CTGGCACGCC
CCCGCCCTGC GCGACCTGGC CGACTACTCG GTCAAGGACG ACACCGCCCT GCTCAAGGGC
CTGCCCAAGG CGAAGGTCAC CGCCACCTGG GTGCCCTGGA CCCACGGGCG CCTGGCCGCC
TCCAGCGGTT ACGGCGCCGG GGTCGCCGCC CCCGGCTGGT ACCACCACCT GTTCACCGCG
CCCGACCTCC CGGTGCACCG GTGGCTGACC GACGCCGCCC GCATGCTCCG CGAGGAGGGG
CTGGCGGTGT CCTCGTCCCA CGTCATCGAG GGCGTGCGGC TGGCCGAGAG CCTCGCCGTG
CTGCGCGGGC GGCCGCTCGC CGGACTCGAC GAGGTGGCCG AGGCGCTCAC GGCCGTCCTG
TGCGAGGGCG AGCCCACCCG GGCCGCCCTC GTCCACCGCC GGATGGCCGT CGGCGAGCGG
ATGGGCTCGG TGCCCCCGTC CACGCCCATG GTTCCGCTCC AGCGCGACCT CATCGCGATC
CGCAAGCGCC TGCGGCTCAA GGCCGAGCCC TTCGACAGCG ACCTCGACCT GGACCTGCGC
AAGGACAGCC AGCGCGAACG CAGCGTGCTG CTGCACCGGC TGCGGCTGCT CGGCGTCGAG
TGGGGCGTTC CGCGCACCCC CGACGGAGGG CAGAAGGGCA CCTTCCGGGA GTCGTGGCGG
CTGCGCTGGG ACCCCGACAT GGACGTGGCG CTGATCGAGG CCAGCCGGTG GGGCACCACC
GTCGCCTCGG CGGCCACGGC CCGGGTGGCC GACCTCGCCG AGGACGCGGC CCTGCCCGCG
CTGACCTCCC TCACCGAGCA GTGCCTGTTC GCGGACCTGG GCGGGGCCCT GCCCCGCGTG
CTGTCCCTGC TCACCGACCG CGCGGCCACC GACAGCGACG TCACCCACCT GATGGAGGCG
CTGCCGCCGC TGGCCCGCTC CGCCCGCTAC GGGGACGTGC GCGGCACCGA CAGCGCCTAC
CTGAGCACGG TGGCCGAGCA GATCCTGCGG CGGGTGTGCG TCGGGCTGGC ACCGGCCGTG
CACGGCCTGG ACGACGACGC TGCGGAGCGG TTCGTCCGGC AGATCGACGC CACGCAGGGC
GCCGCCACCC TCCTCGGCGG GGAGGGCGCC CAGGCCTGGA CGGCGGCCCT GACCGCCCTG
GCCGTACGCG ACACCCTGCC CGGCCGCATC GCCGGCCGGG TCAACCGGAT CCTGTCCGAC
TCCGGGCTCG TGGACACCGA CGAACTGCGC CGCCGCCTCG GTCTGGCGAT GTCCCCGGGC
GTCGAACCCG CCTCGGCCGC GGCCTGGCTG GAGGGGTTCC TCCAGGGCAG CGGGCTGATC
CTGGTGCACG ACGACCGGCT GCTCGGGCTG ATCGACACCT GGCTGCTCTC CCTGCCCGAG
GAGCGCTTCA CCGCGGTGCT GCCGCTGCTG CGGCGCACCT TCGGGGCCTT CAACGGCCCC
GAGCGCCAGG AGATCGGCTC GGCCGCCCTG CGGCTCGGGA CGGCGCCCGC CGCCAAGCGG
GCCGCCCCGG CCCGCGTGGA CGTGGACACC CGGCGCGCGG CCCCGGCCCT GGCCACCGCG
CTGGCCATCC TCACCGACGG AAAGGTGCGC ACATGA
 
Protein sequence
MGRPRLDTSG LHVLGVRHHG PGSARAVRAA LEEIKPDAVL IEGPPEADAL TSLVGELEPP 
VALLAYLADT PKGDEPRAGD GKEKTARGAR PLRVAEGDGW AFWPFASFSP EWEALRYAVE
NDVRVRFCDL PAANTLAERV AEAEERRRAE EESARTEGDS EDPGAEEAAG NPEAAGAGAA
VAAEGSGGTD GAGTPEAADG TDGGEVGEPE RIRLDPLGVL AEAAGYDDAE RWWDDVIEQR
RDGEPSPFPA IADAMAAVRA ESGPETERDA RREAYMRQTL RATLKEGHER IAVVCGAWHA
PALRDLADYS VKDDTALLKG LPKAKVTATW VPWTHGRLAA SSGYGAGVAA PGWYHHLFTA
PDLPVHRWLT DAARMLREEG LAVSSSHVIE GVRLAESLAV LRGRPLAGLD EVAEALTAVL
CEGEPTRAAL VHRRMAVGER MGSVPPSTPM VPLQRDLIAI RKRLRLKAEP FDSDLDLDLR
KDSQRERSVL LHRLRLLGVE WGVPRTPDGG QKGTFRESWR LRWDPDMDVA LIEASRWGTT
VASAATARVA DLAEDAALPA LTSLTEQCLF ADLGGALPRV LSLLTDRAAT DSDVTHLMEA
LPPLARSARY GDVRGTDSAY LSTVAEQILR RVCVGLAPAV HGLDDDAAER FVRQIDATQG
AATLLGGEGA QAWTAALTAL AVRDTLPGRI AGRVNRILSD SGLVDTDELR RRLGLAMSPG
VEPASAAAWL EGFLQGSGLI LVHDDRLLGL IDTWLLSLPE ERFTAVLPLL RRTFGAFNGP
ERQEIGSAAL RLGTAPAAKR AAPARVDVDT RRAAPALATA LAILTDGKVR T