Gene Ndas_5048 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_5048 
Symbol 
ID9248937 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014211 
Strand
Start bp186980 
End bp190039 
Gene Length3060 bp 
Protein Length1019 aa 
Translation table11 
GC content69% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003682935 
Protein GI297563962 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.114133 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.746585 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCGAGA TGTGCACGCC CACGGGCGGC CTGATCAACC CCGCGGCCTT CCCCATTCCG 
ACCGCCAGTC CCGCCACCCT GGAGAGACAG GCGCGGGACC TGCGCCGGGA GGGGGCCAAC
GTGGCAGGAA TCGGCGGCGA CATCAAGAGC GCCTGGGCGG GGCTGTCCAC GTGCTACTCC
GCGCCCGAGG CCGAGACCCT GTACGCGGTG GTGGACCCGG TCGCGACCGA CGGCGACAAG
GTGGAGACCT CCTTCGACAA GGCCGCGAGC GCGCTGGAGA CCTTCGCCGA GGCCGTCCGG
GACATCAAGG GCAAGTGGTC CACGCTCAAG AGCGACTCCT ACACCTTCCT CAACTCCATC
GAGGGCGACG ACGACTGGCG CGAGGCCGAC GGGTTCGTCG ACAGCCTCCT GGGGCGCGAG
AGCGAGAAGG TCGGGGAGCA CCAGGCGCTG CTCGACCGCG CGGACGCGCT GCGGCGCGAG
TACGAGGAGG CCGAGCGGGC CTGCGCCAAC GCGATCAACG CCGACATCCC CGACCGCACC
AACTTCGTCG CGGGCGACGG CGACGGGGAG TCCGCGCCGG GCGAGTTCGA GCACGGCTAC
GACGGGTACC TCGGCGACGT GGCCATGGCC TGGGGCGGTC CCATGGAGAC CGACCACGGC
TGGTGGGTGG ACGCGGGTGC GGCGGTGGGC GACTTCTTCG TGGGGATCGC GGAGGACGTC
GGCGGCATCA CCGGGATGTA CAGCTCCGAG GGCTGGTTCG AGATGTCCTG GGGCGACGCC
ATGTGGGAGT ACCACGAGGG CAACCTCCAG TCCCTGGCGT CACTGGCGGG CATGTACGAC
TCCGAGAGCG ACAGCTGGGG CTGGTCGGGC TGGGACACCG TCGGCAACGC CTGGAAGGAC
GCCGCGCACG CGGTGGTGCC CTGGGAGGAG TGGAGCGAGC GGCCCGGCTA CGTCATCGGG
ACCGCCCTGC TCAACATCGG CGCGACCGTG GGCGGCGCCG CCCTGACCGC GACCGGCGTG
GGCGCGGTCG TGGGCGTGCC GCTGATGGCC TGGCGCGGTT CGGCCATGCT CAACAGGATG
GGCGGGGACG GGCCGAGCGT CCCCGACGTG GACGTGCCGG ACATGAGCCG GATCAACCTG
AGCCTGCCCC GGTTCGGCAA CGTCTCCCTG GCGGATCTCA GGATCGACCT GGGCCAGCTC
CGGGAGGGCG CCTTCAGCAC CTCCCGGCTG TCGGAGATGC AGAACCTGCT GAGCAGGTTC
ACCGGCGGGG GCTTCTTCGG CGGCGGGGAC GACCCGGGCG GCAACGGTTC CGGTGGGGAC
GGGAACGGCT CCGAGGGCTC CGACAGCCAG CGCGTCGCCA ACCACCACGG CGACGAGGAG
GGCGGCCCGG CCGACACCCG GCCCGCCGTG AACCCGACCA CGCGTGAACT CGACGACAGC
CGGGAGCTCC TGGAGCTGCT GACTTCGCAT CCGGACTCCG CCGAGCTGGG CGACCTCGGG
CGCAGGGTCC TCGGTGGGGA GGGAGGCGAC AGCGACGCCC CCCGGCCGAA CCGTTCCGGG
CAGCCCGACC TGGAGGAGCT GGGCCTGGAC CCGTCCCTGG AGGCGGTGTT CAGCGACATG
AACGCGCACG CCTCGGACTA CCCGGCGTGG GAGGCCAACC AGTCCCCGGA CGGCCCGGAG
TCGGGGAGGG TTCCCGCCCT CGTCGGCGGC GGGAACAACG ACACGTTCGA GGGCAGCCAG
GACGTCTCCA ACCCGCCGTC GTACGACCGG GTGGACCTGA CCGGCACCGG CGGCGACGGT
CCCCAGCGCT TCCCGGACGA CCACCGCGAC GCGGACTTCG GCGACGACGG CCCGCAGATG
CGGGACCGCA CGCCCCTGGT CACCAACAGC ACCGGCGGCA ACGGCAGTGA CACGCTGGAC
ACCGGGGGGA ACTCCCGTGG CGGCACGGAC GTCCTGGACG GGTCCGGCGA CGGCGGTGGC
AACACGCACA GCGCGGGCGG CAACGGACGG GGCCCCGGCG ACACGCGGCC GGGCAGCGCC
GGAGGCGACA TCCTCGGCGG CCGGGGCGGC CAGAACGACG GCCCGGAGAA CACCCCGATC
AGGAACCAGC CCGACCTGCG ACCGGAGAGC ACCGACCCCG GGGTCCGGGA ACTCCTCCCC
AGGGACGGCC TGCGCTTCGG CGAGATCAGG GACGACAACG GCAATGTCAG CAACCGCATT
CTGGAGCCGA ACAGCAGGTA CAGGCTCTAC GAACCGGGCA GCGACCTCCA CACGGACTAC
GTCACCGACG CCGATGGGAA CATCAGGGAG ATCCGGACCG AGTCCAAGGG CTGGAACTCC
GAGCATCCGG AGTACCTCAA CCCCAGGCCG GACATGACCT TCAACGTGGA CGGCTACACC
TACAGGACCG ACGAGTACGG CCGGACGGTC TCCGTCGAGG GAACGCTGCA CAAGGAACCG
AACGTCCGCA ACGAAAACGA ACAGTCCAAG GTCAACAGCC AGGGCTCGGA CTACTACGAA
CAGCTGAACC AGAAGATCCG CGATGACTTC GAAACGGCGA ACGGCCGTCC GCCGGAAGCG
GGTGAGGTCC CCCAGTACCA GGACATCCAG TGGGACGGCG GCCACCTGAT CGGGTACGCG
GAGTTCTTCG GTATCGGCGA GCGCCTGAAC ATGGTCCCGA TGCGGTTCGA CGTGAACCAG
AACAGGACCG AGACCGCTCT GGATGACATC CCCGAAGAAG TACGGGGCGG CATCGAGGGA
AGTTACCGGA ACATCGAGCG CTCCTGGCGT GGAATCATGC GCGACAAGGG CTCCTGGCAT
GGATTCACCA ACCCCAAATT CAATGACGGG AGCTGGGACG CCGCCCTCGC ACTGAATCCG
AATAACCCCA AGATCGACGT TAAGATCACT AACATCTACG ACCCCAGCCT GCCACCGGTG
TACGACAAGT CCGGAAACCG TCACTTGCCA CCGCCGAGCA GGATTGAGGT CGAATGGGTC
CTCAACGGGG TTAGAATGGA AAACCGAGAG TACAACAACG TACCACCCCT CGTGGACTAG
 
Protein sequence
MTEMCTPTGG LINPAAFPIP TASPATLERQ ARDLRREGAN VAGIGGDIKS AWAGLSTCYS 
APEAETLYAV VDPVATDGDK VETSFDKAAS ALETFAEAVR DIKGKWSTLK SDSYTFLNSI
EGDDDWREAD GFVDSLLGRE SEKVGEHQAL LDRADALRRE YEEAERACAN AINADIPDRT
NFVAGDGDGE SAPGEFEHGY DGYLGDVAMA WGGPMETDHG WWVDAGAAVG DFFVGIAEDV
GGITGMYSSE GWFEMSWGDA MWEYHEGNLQ SLASLAGMYD SESDSWGWSG WDTVGNAWKD
AAHAVVPWEE WSERPGYVIG TALLNIGATV GGAALTATGV GAVVGVPLMA WRGSAMLNRM
GGDGPSVPDV DVPDMSRINL SLPRFGNVSL ADLRIDLGQL REGAFSTSRL SEMQNLLSRF
TGGGFFGGGD DPGGNGSGGD GNGSEGSDSQ RVANHHGDEE GGPADTRPAV NPTTRELDDS
RELLELLTSH PDSAELGDLG RRVLGGEGGD SDAPRPNRSG QPDLEELGLD PSLEAVFSDM
NAHASDYPAW EANQSPDGPE SGRVPALVGG GNNDTFEGSQ DVSNPPSYDR VDLTGTGGDG
PQRFPDDHRD ADFGDDGPQM RDRTPLVTNS TGGNGSDTLD TGGNSRGGTD VLDGSGDGGG
NTHSAGGNGR GPGDTRPGSA GGDILGGRGG QNDGPENTPI RNQPDLRPES TDPGVRELLP
RDGLRFGEIR DDNGNVSNRI LEPNSRYRLY EPGSDLHTDY VTDADGNIRE IRTESKGWNS
EHPEYLNPRP DMTFNVDGYT YRTDEYGRTV SVEGTLHKEP NVRNENEQSK VNSQGSDYYE
QLNQKIRDDF ETANGRPPEA GEVPQYQDIQ WDGGHLIGYA EFFGIGERLN MVPMRFDVNQ
NRTETALDDI PEEVRGGIEG SYRNIERSWR GIMRDKGSWH GFTNPKFNDG SWDAALALNP
NNPKIDVKIT NIYDPSLPPV YDKSGNRHLP PPSRIEVEWV LNGVRMENRE YNNVPPLVD