Gene Ndas_2520 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_2520 
Symbol 
ID9246371 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp2987736 
End bp2990033 
Gene Length2298 bp 
Protein Length765 aa 
Translation table11 
GC content76% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003680445 
Protein GI297561471 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.556317 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00220915 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGACGACA CCACCCCCCC GCACGCCGAC CGGGGCTACG CCGCCGGACG GCTGGCCGAG 
GAACTCACCA CCGCCCTCAC CCACCAGGAC GCCCGCGTCC GCCAGGCCGC GCGCGCACGC
GTCGACTCCT GGGAACGGGT CGTCACCGGC ATGGGGGACG GCACCCTCAC CATCGGCTCC
CGCACCCCCG TCAAGGACCT GCCCGCCTGG GTCACCCCCG AGGTCGTCCA CGGCGGCTTC
GCCACCGGCC TGCCCGCCGC GGGCGGCCCC CTGCGCCCCC ACGAGCGCGA ACTCGCCCGC
CGCCGCGGCC TGCCCGAACA GCGCGCCGCC CTCCACGCCC ACCACCTCAC CGAGCGGGGC
CTGGCCGACC TCGACACCCT CCTCGACAGC GGCGAGTACG AACTGGACCT GCCAGAACAG
GCCGTACCGC TCACCGCCGC CTGGCTGCTG CGCAACGGCG ACACCGAGGC CGCCCTGCAC
CTGCTCGCCA CCGTCGAACC CCTCGCCGAC ACCCTGTGCC TCACCCCCCG CCCGGCGCCC
CGCCAGGACC TGCCCGCCCG CACGGTCTTC CGCCAGAGCG TCGGCGACGC GCGCCAGGCC
CTGGCCGAAC GCGCCGCCCG CGACCCGGCC ACCAGCCGCC CCCGGGCCCA GCAGGAGGCC
CTGGCCGTGT GGAACCCCTT CGCCGACCGG GTCCTCATCC ACTGGCTGGA GACCGTCCGC
GACGGCCGCG CCGACGCCCA CCGCCCCACG GGCTGGACCC GCCGCGGCGC CGCCCTGCTC
GCCGAGTACG AACGCCTGGC CGCCGAGCAC ACCCTGTGCA CCAAGCACCG CAAGCCCAAG
GAGAACCTCG CGATCCTGCT CGCCGCGCTG CGCGAGGCGG TCGCCGAGCC CGGCGCCGAA
CTCACCCCCC GCCGCCGCGG CCTGCTCCGC CACGCCGTGG ACTCGATGGT GGCCAAGCGC
GGCCTGCCCG GCTCCGAGCG GCACACCGCC CTGCGCGCCG GACAGGCCGA GCACGCCGCC
CGCCCCACCC ACGACGTGCT CGCCGCGCTG CTGGCCGACC GGCTCGCCCC GCTCCCCCAG
GCCATCGGCA CCCCGCACAC GGCGAAGCTG CTCACCCCCG TCTCCGCCGA GGAGGCACGC
GAGCGCACCG TGCCCGAGGG CTGGCCCATC CCCGGGGCCC TGGGCGACGT CGTCCGCAGG
GCCACCGCCG CTCCCCTGGA CGACCTGGTG GAACTCGGCG TGATCCCCTC CGCCGAGGTG
ATGGCCGAGG TCGTGCCCGC CCTCACCGCG GAGGCCGAGG CCGCCTCCGC CGCGGACCCC
GCCCTGGCCC GGCTGCTCGC CGCCCACCAC CGGGCGTTCA GCCGCAGGCG CTCCCTGCTC
CTGCTCAACC TGCAGAGCCA GGTCCGCGCG GACGAACTGC CCTGGACACG GGCGCTGCTG
GCCCACCGCT CCGGCCCCGA GGCGCGCCGT GAGACGGTGG CCGAACTGCT GTCCGCACTG
GGCTCCTGCG TGCTCACCCA CTTCCCCGGC ACGATCGTGC CCAACCCGAT GGTGAGCCAG
CTGTCCGCCC TGTCCCGCTC CGCGGACCTG CGTCTGCCGC TCGTGGAGGA ACTGGCCGCC
GACATCTTCA CGGGCGGGTT CTCCCCCAAG TTCACGCGCG CCGCCAAGGC CGCCGCCGCC
CTGCTGGGCC AGGGCCTCTA CAACCGCTAC TACGGCATCG ACCCCGACCG GGTCCTGGCC
CTGGAGGAGG AACCGACCCG GACCGGCGGC GGGGCCACGG ACTTCGCCCG GCTGTGCCGC
GAACGCGCGG GCGACCCGCG CCGAAGCGTC GTGGGCAACG GGATGGTCAT CGAGCAGGCG
CAGATCCTCA CCACGCACAA CCTGGCCGTC CTGGTCGGGG CCGGGGCGCG GCCCGCGTGC
GGGTGGGCGG AGCTGGCCCG GCGCGCCCAC ACCCTGACCG TGCGGACCGT GGAGAGCCTG
CCCCGGGTGT TCCCGCCCCT GCCCCACGTC AAGAACGCGG CCTTCGCCTG GCGGCAGACG
GTGTTCTTCC TGTCCCTGTG CACCGAGGCA GAGCAGCGCG AGGTGACGGC GTGGATGGTG
GGGCTGGCCC CCGACCAGCC CCACCACACG CGCGAACGCC TGGAGCCGGT GCTGGCCGGA
CTGCGCCGGG TGGTGGAGGG AGAGCCCCTG GAGGAGACCG GGGCCCCCAC GGGAAGGCGC
CGCTTCCTGG GCTGGGCCGG GCGGCACTGG ATGCTCCAGA TCGCCCCGCG GGCCCTGCGC
GGCGCGGACC GTCCCTGA
 
Protein sequence
MDDTTPPHAD RGYAAGRLAE ELTTALTHQD ARVRQAARAR VDSWERVVTG MGDGTLTIGS 
RTPVKDLPAW VTPEVVHGGF ATGLPAAGGP LRPHERELAR RRGLPEQRAA LHAHHLTERG
LADLDTLLDS GEYELDLPEQ AVPLTAAWLL RNGDTEAALH LLATVEPLAD TLCLTPRPAP
RQDLPARTVF RQSVGDARQA LAERAARDPA TSRPRAQQEA LAVWNPFADR VLIHWLETVR
DGRADAHRPT GWTRRGAALL AEYERLAAEH TLCTKHRKPK ENLAILLAAL REAVAEPGAE
LTPRRRGLLR HAVDSMVAKR GLPGSERHTA LRAGQAEHAA RPTHDVLAAL LADRLAPLPQ
AIGTPHTAKL LTPVSAEEAR ERTVPEGWPI PGALGDVVRR ATAAPLDDLV ELGVIPSAEV
MAEVVPALTA EAEAASAADP ALARLLAAHH RAFSRRRSLL LLNLQSQVRA DELPWTRALL
AHRSGPEARR ETVAELLSAL GSCVLTHFPG TIVPNPMVSQ LSALSRSADL RLPLVEELAA
DIFTGGFSPK FTRAAKAAAA LLGQGLYNRY YGIDPDRVLA LEEEPTRTGG GATDFARLCR
ERAGDPRRSV VGNGMVIEQA QILTTHNLAV LVGAGARPAC GWAELARRAH TLTVRTVESL
PRVFPPLPHV KNAAFAWRQT VFFLSLCTEA EQREVTAWMV GLAPDQPHHT RERLEPVLAG
LRRVVEGEPL EETGAPTGRR RFLGWAGRHW MLQIAPRALR GADRP