Gene Ndas_2989 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_2989 
Symbol 
ID9246842 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp3571686 
End bp3573644 
Gene Length1959 bp 
Protein Length652 aa 
Translation table11 
GC content79% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003680905 
Protein GI297561931 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.276184 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.189562 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGGAGC CCGACACGGT GACCTTCGCC CCCGAGCAGC ACCGGCCCCC CGGTGCCCCG 
ACGCGACCGG GGGGAACACC GTCCAGCGGT GTGGTGGTCG GTGTCGACCT GCGCGGCATC
CAGGCCTACG TGTACAGCGG CCGGCGCATC CTGGACGCGG TGGGGCGCGC CGCCCTCGTC
GCGGAGCTGA CCGACACATC CGACGCCGAG CACGGGATCG CCGACCTCGT ACCGCCGGAC
TGCGTGGTGC TGCGCGACGC CGGAGGGGCG CTCACCGCGG TCTTCCCCGA CGCGGCCTCG
GCGCGCGGGT TCACCGCCCT CTACACCCGC AGGCTGCGCG ACCGCGCCGC GGACCTGACC
CCCGTGGTGG CGCACGTGGC CTACGGTCCC GGGGCCGGGC ACGAGGCCGG GGGACCGGGC
GGCGCCGCCC CGGAGCACAC GGGCACGACC GGATCCCCGG GGCCCGACCG CGCGCCGGGC
GCCGACACGA CGGCGGCCGC CGGGGTGGAC GAGGCGTTGG CCCTGCTCCC GGCGCGGCTG
CGCGAGGCGC GGCGGCACAT GTCGGCCCTG CACACCCCCG CGCACGGGTA CGGGATCACC
GCCGTGTGCT CGGTCAGCGG AGGCCCCGCC GAGTCGGTGG ACAGCAGCCG CGTCCAGGAC
GACCACCCCG ACGTGCACGA GAGGGTGGCC GCCGACGTCG CCCGGGCCCG GGGGATCGGC
CGACGCTGGC ACCGCGCCCA CAGCTCCGAC TGGCTGGCCG GTGCCGTCAC CGCCGCCGGA
GCCCCCGCCC TGACGCTGCC GATGGAGGTG GACCGCCTGG GCCGCGACCA CGGCGGCCTC
AGCCGCGTCG CGGTCGTCCA CATCGACGTC AACGGCCTGG GCGCCCTCCT GGGCGAGTAC
CGCGAACGCG CGGGGGACCC GGCCGTGCCG GGTTCGGGCG CCTTCGCCCA GCGCCGCCTG
TCCACCCGCA TCGCCGGGCT CACCGAGGGG CTGGCCCGTG TCCTCGTGCG CGCGGTGGCG
GCCACCGTCC ACGCCGGACC CGGCCAGCGG CCCCTCATTC CGGGAACCGG CGCCGCCGCG
CCGATCACCC TGCACCATGA GCCGCCCGGT CCCGTGTCCC TGCCGGTCCG CCCCGTCGTG
GTCGCCGGGG ACGACCTGAC CGTGCTGTGC GACGCCCGGA TCGCCCTCAG CCTGGTGCGC
TACGCCCTCG ACTGGCTGGA CGCCGACCCC GAGCGCGTCG GCGACGACCG CGACCCCCGG
GTGGGCCTGC ACCGCGCGCT CGCCGAGGCC CACGGCGACC GCCCCGGCGG CGGGCGCGCC
GTCGTCGCGC CGGACGGCGC GGCGCACACC ACGTTCGTGC CCACGGTGGG GGTGGGCGTG
GCCGTGCAGC CGGTGGGCGC TCCGTTGTCG CTGGGCTACG ACCTGTGCGA GGCGATGTGC
CGCCGCGCCA AGGAGCACCG CCTCCAGACC GCCGAGGCCG ACCCCGGAGC GGGCGACGAG
CACGCCGTGG CCTGGACGAC GCGCCTCGAC GGCGTCGGAC GGGTGCTGCG CCGCCTGGAC
CGGGCCCGCC GGGCGCCGGT GCCGCGGACC GCCCTGCCCC TGACCGGAAC CGGGTTCGCC
CGCTTCCTGG ACCGGTACCT GTCCGCGACC GCGCCGGGCA GCCTGCTCGC CGACGGGGAC
ACCCGCCAGC GGGGCTGGCT GGTCTCCGCG CTGGTGCCCC TGTTGGAGTC GGGCGCCGAC
CCGGAGCCCG AACTCGCCCG ACGCGCCCAC GCGACGGGCG GGCCCGTCGA CCTGCCCCGA
GGCTGGACGC CCGGCGGCCT GCTCGACGCG GTCGAGGTGA TGGACCTGCA CCTGGACCCC
GGGCTGGCGG CGGCCCTCGA CCCCCACCGC CCCCAGCGGC GGGCGGGGAA CGGCGGTCCG
CGTACCGGGG CGCCCCGTCC GCGGGGCGCG TTCCGGTGA
 
Protein sequence
MEEPDTVTFA PEQHRPPGAP TRPGGTPSSG VVVGVDLRGI QAYVYSGRRI LDAVGRAALV 
AELTDTSDAE HGIADLVPPD CVVLRDAGGA LTAVFPDAAS ARGFTALYTR RLRDRAADLT
PVVAHVAYGP GAGHEAGGPG GAAPEHTGTT GSPGPDRAPG ADTTAAAGVD EALALLPARL
REARRHMSAL HTPAHGYGIT AVCSVSGGPA ESVDSSRVQD DHPDVHERVA ADVARARGIG
RRWHRAHSSD WLAGAVTAAG APALTLPMEV DRLGRDHGGL SRVAVVHIDV NGLGALLGEY
RERAGDPAVP GSGAFAQRRL STRIAGLTEG LARVLVRAVA ATVHAGPGQR PLIPGTGAAA
PITLHHEPPG PVSLPVRPVV VAGDDLTVLC DARIALSLVR YALDWLDADP ERVGDDRDPR
VGLHRALAEA HGDRPGGGRA VVAPDGAAHT TFVPTVGVGV AVQPVGAPLS LGYDLCEAMC
RRAKEHRLQT AEADPGAGDE HAVAWTTRLD GVGRVLRRLD RARRAPVPRT ALPLTGTGFA
RFLDRYLSAT APGSLLADGD TRQRGWLVSA LVPLLESGAD PEPELARRAH ATGGPVDLPR
GWTPGGLLDA VEVMDLHLDP GLAAALDPHR PQRRAGNGGP RTGAPRPRGA FR