Gene Ndas_4622 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4622 
Symbol 
ID9248503 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp5488246 
End bp5491320 
Gene Length3075 bp 
Protein Length1024 aa 
Translation table11 
GC content71% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003682514 
Protein GI297563540 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.119976 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.339061 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGACCT TGTCCGACGA AGGCCTGATC GACCCCAGCG CGTTCCCCAT CCCCGACACC 
CTCACCTACC TCCTGGACTC GGCGGCGGCC AAGCTCAAGG CCGACGGGAC CGACCTGACC
GACACCGCGG GTGACATCAC CGGTGCGTGG GCGGGCCTGG ACGGCATCTA CTCCGCACCC
GAGTCCGCGA CGCTGTTCAA CGTCCTGAAC CCGCTCACCG GGGACGCCGA CGAGGTGTCG
TCGGCGCTCT CCAGCGCCTC CGACGCGCTG ATCGACTTCG CCGAGAAGGC GCGCGACATC
AAGGACCGCT GGTACACGCT GCGCAGCGAC TCCTACGCCT TCCTCCAGAG CATCGACTAC
GGGCGTGACG AGGACTGGGA CGAGGGCAGC GGCATGCTCT GGTGGAAGGA GGAGAGCCCC
AAGGTCGCCG AGCACAACGC CCTGCTCGAT CGGGCCGCCG CGCTCAAGCA CGAGTTCGAG
GAGGCCGAGC GCACGTGCGC GAACGCCATC ACCGCGCTCT TCGGCGGCAC CACCTTCATC
GCCCAGAGGG CCGACGGCAG CGTCACCCCG GGCGCGGGCG AGTTCGTCTA CGGTTTCGAC
GCGCCCCTCG AAGGCGTCGA GATGGAGTGG GGAGCCCCCC AGACCAGCGA CTACGCCTGG
TACACCGACG CCACGGACGC CGTCGGGGAC TACGTGGTCG GCATGGCCGA GGACCTCGGC
GGCATGGTCG GGGCGCACGG CCCCGAGGGC TGGTTCTCCG GTTCGTGGGG CGACAACCTC
TGGGACTACT GGGGCGGCAC CGTGGAGGGC CTCGGCTCCC TGGTGGGCGC GTCCCGGGAC
GAGAACGGCA ACTGGGGCTG GTCCCTGGAG ACCGCCGGGA ACGCCTGGAA GGAGGCGGCC
CACTCGGTGG TCCCGTGGCG GGAGTGGGGC GACCGGCCCT GGTACGTCAT CGGGACCGCG
GCGCTCAACA TCGGCGCCAC GGTGGGCGGC GCCCTGCTGA CCGCCACCGG CGTGGGCGCG
GTCGTCGGCG TCCCGCTCCT GGCCTGGCGC GGGTCGCGGA TCCTGGACGG GGTCAACGGC
GGCCGTGCGG ACGCCGACCT GCCCGACGGC GGCGGGGTGG ACCTCCAGAC CCTCATGTCG
CGCGTGCCCA GCTTCGGCGA CGGGTCCATC CAGCCGGTGG ACCTCAGCAG GCTCGCCGAC
CTGGATCTGG ACCAGGGCGA GTTCGGCCGG ATGACCGACG CCCTGCAACG GCTCAACGAC
CTCGACGGAG GCGACGGCGC CTCCCCCGAC CGGCCCGGCA ACAGCGACAA CCGCACGGTC
CCGGTCGACG ACGCCGACGG GGTCGGCGGC GACAACGGGT CCGCGCCGCG CGCGAACACC
GCGCCCGGGG ACGAGCCCGA GGCCGCGGAG CGGCGCGGGT CCTCGGACCG GGACCGCGAC
GACGCGGAGG GCGACGGCGC CGACCAGGAC GCCGAGGAGC CGACCTATCC GACCACCGAG
CTGCTCGACT CCAGCCAGGA CTTCCTCGAC GGCGTCGACC CCGAGTCCGT CCGGGGCCTG
CGCGAGGGGA TGGACGGCCA GGAGAACGAC TGGGTGACCT CGCAGGTACC CGACGACGTC
TCCTCGGTCA ACGACACCCC GGTCCAGCGC TACGAGTCCG AGCCCTCCCG CGTGGAGGCC
GGGAACGAGC GCCCCGAGGA ACTCGTCCTC GCGGGCCGGG GAGAGCACGA GATCGCGGAG
GGGGCGGACG GCGAGCGCGT CGACGCCCGC CACGACACCA CGGTCGACAA CAGTGCGGGT
GGAACGACGT CCCTCGACAG CCCGCGCGGA GGCACGACGG TCATCGAGAC CGACTCCGGC
CGCGGCGGTT CCGGCGGCGG CACGGGCGGT GGCGGCACCG GCGACGGACC GCCGCACTCG
CCCGACCCCG GCGACGGGAA CGGTTCCGGA CCGGACGATC CCCTCGACGG CGACAGCGGT
GACGGCGGCG GTGACGAAGA CCTGCCGAGC AACGTCCAGG ACCTGGCCAA CCACCGCTGG
GGTGACACCC CCGAGGGCAG GAAACGCTTC TACCAGCACT TCCAGGACCT GCTGAACGAC
CGCTCCAACG GGGTGTTCGA CCGGTTCTAC CAGGACAACG GCTACCGCAG GAGCCGGTTC
ACGAAGGTGG GTCCCGAGGA GATCCCCCTC CCCAAGCTGA CCTGGGACAA GCACAACGGC
CGGTGGATCG TCCAGGAGAC CCTTCCCGCC GCGGACCCGC CGGACTACAA GGGAGACGTG
AACAAGCGCG ACGCGCTCGT CTCACGGGAT CCCGCGAACG ACGGCTACAC CTACCTCGAC
GAGCTCGCGG AGAGGCGGCG CAGCGCCATC ACCGCGGACG GCACCGCGGG CACCAGGCTC
CAGGCGCTGG AGGCGCGGTA CAAGGAGCAC CTCGACGCGG GCGGCAGCGT TCCCCGGGAC
CTCGCCGCGG CCAGGGAGGT CTACGCCGAG CGGCACACCG ACATGCTCAG GGCCTCGGAG
GGCTTCGGCG AGGCGACGGC CGACATCGCG ATCCTCGACC AGTTCGACGG CACACACCCG
GCCCTCGACT CGGCGGGCAG GCCGGTCCTG GACGAGAACG GCGACCCCGT CAACAAGCCC
AGGGTCACCG AGCAGATCAC CCTGCCCGAC ACCTCCCCCC GCAACGGGAA CCACCAGTTC
GACCAGGTCT GGCGCACGGA GGACGGCGGG CTCGTCGTCG TCGAGGCCAA GAGCAGTACG
GACACCCAGC TCGGCACGCG CTTGGCCGAG GTTCCGAACA GCACGCCGGT CAGGGTGTCC
CAGGGAACGA GGGCGTATCT GGAGTCCATT CTGGAGAGCA TGCGGGAACG CGGGGAAAGC
GACGCCAGAA ACGCGTTCAC CGAGGAGGAC CTGGCCGACG AGATCCAGGC CGCTCTGGAG
AACGGCAAGC TCCACTACGT CGAGGTCAAG GGCAATCCCA TCACCAATTA CGATGACCAG
GGCCAAATCC TGGATTCCCG GAGCGAAGGT TACTCCTTCC GAGAATTCGA CCTGGACCGG
AGGGCAGGAC GTTGA
 
Protein sequence
MTTLSDEGLI DPSAFPIPDT LTYLLDSAAA KLKADGTDLT DTAGDITGAW AGLDGIYSAP 
ESATLFNVLN PLTGDADEVS SALSSASDAL IDFAEKARDI KDRWYTLRSD SYAFLQSIDY
GRDEDWDEGS GMLWWKEESP KVAEHNALLD RAAALKHEFE EAERTCANAI TALFGGTTFI
AQRADGSVTP GAGEFVYGFD APLEGVEMEW GAPQTSDYAW YTDATDAVGD YVVGMAEDLG
GMVGAHGPEG WFSGSWGDNL WDYWGGTVEG LGSLVGASRD ENGNWGWSLE TAGNAWKEAA
HSVVPWREWG DRPWYVIGTA ALNIGATVGG ALLTATGVGA VVGVPLLAWR GSRILDGVNG
GRADADLPDG GGVDLQTLMS RVPSFGDGSI QPVDLSRLAD LDLDQGEFGR MTDALQRLND
LDGGDGASPD RPGNSDNRTV PVDDADGVGG DNGSAPRANT APGDEPEAAE RRGSSDRDRD
DAEGDGADQD AEEPTYPTTE LLDSSQDFLD GVDPESVRGL REGMDGQEND WVTSQVPDDV
SSVNDTPVQR YESEPSRVEA GNERPEELVL AGRGEHEIAE GADGERVDAR HDTTVDNSAG
GTTSLDSPRG GTTVIETDSG RGGSGGGTGG GGTGDGPPHS PDPGDGNGSG PDDPLDGDSG
DGGGDEDLPS NVQDLANHRW GDTPEGRKRF YQHFQDLLND RSNGVFDRFY QDNGYRRSRF
TKVGPEEIPL PKLTWDKHNG RWIVQETLPA ADPPDYKGDV NKRDALVSRD PANDGYTYLD
ELAERRRSAI TADGTAGTRL QALEARYKEH LDAGGSVPRD LAAAREVYAE RHTDMLRASE
GFGEATADIA ILDQFDGTHP ALDSAGRPVL DENGDPVNKP RVTEQITLPD TSPRNGNHQF
DQVWRTEDGG LVVVEAKSST DTQLGTRLAE VPNSTPVRVS QGTRAYLESI LESMRERGES
DARNAFTEED LADEIQAALE NGKLHYVEVK GNPITNYDDQ GQILDSRSEG YSFREFDLDR
RAGR