Gene Ndas_3079 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3079 
Symbol 
ID9246935 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp3681640 
End bp3683781 
Gene Length2142 bp 
Protein Length713 aa 
Translation table11 
GC content67% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003680994 
Protein GI297562020 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.411817 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAGGTGG GCTCCCCCCT ATTGGTGACG CTCTCCGAGA TCGCCGAGTA CGCCCAGGTG 
CGCCGACCCA GCGTGAGCAA CTGGCGTCGA CGCCATACCG ACTTTCCTCG TCCTGTGAGC
GCTTCCTCGA ACGTACCGCT GTTCGACTCC GACGAGGTAG CCGCATGGCT GGACCGGCGT
CCAGTCCACA AGGCCGCTTC TCCTGTGCAC ACAACCGAAG ACGAGGAGAA CTCCCCGGCG
ACCTATGGCG AGGTGTATCG GACGGGCATC CTACTCTCCG CGGTCACCTC GCACCGTGAA
CTACCTCCCG AAGAGCTCCT GATCGCGGCC CTCCGCGTTG TCTGCGCCTA CCGTTCCGCA
TCCGAGGCGA CTCCGGTCAC CGTTCAGGAA TTCCCACCCG ATCCAGATCT GCACGCCTCG
GTCCATGCCC TGGTGGATGT ACTCGGGCGA GCGGCGGCCA CAGAGCGCCT CATGGACCTG
GCCTCACGGT TGGAGCTCTC CTGGGCACCC GAGCAGGCAC CCCCGGCCGT GTGCACGCTC
GTCAGCAGGC TCCACCAAGT CCTCGCCGAC GGAGCGGGGG ACGCCTCCAT CGTTGATCTG
TCCGCCGGAG CGGGTGCCGG GCTCCTGGCG TTACTCAGCA CTGGAGCACC GCGTTCGGCC
ACCGCGGTGG TGAACGAGGT ATCGCTCGGC GAAATCCTCT CCCTCCGTCT CCGAGCCCAT
GGAATACCGG CCGTGGATAT TCAGGTCAGC GCACCGATCA CCCACGACTC CGACGTGGTG
CTCGCCTATC CGCCCTTCGT TCCCGGTGAG CGGGCCGACC ACGCTGACCA CCCACTCCTG
TGGGCGGAAC AGGCCGTGAG CATGCTTCAG ACCGACGGAC TCGCTTACGT CGTCGTCCCT
GACTGGACAC TCACTCACAC AGGACGGGGC AGCTCCACTC CTCCTGTGGC CGCCTCCCGG
GAACGGCTGC TCCGCAACCG GTGTATCCGA GCGGTGGTCC AACTGCCCCG CCGTATCCAC
CCCAGCCGCC CCGGCGCGGA GCTGGTGCTC CTGGTCCTGA CCCCGCGAGG CGAGGGGGGC
AGCACCGTGA CGCTGTGCGA CGCCGATCGA ATCGCGCGAA CGCAGGGACG CCATACGTCC
GCGAGGACCA ACGGTCAGGG GTGGATTGCC CCATGGGCGG AGGAAACCGT ACTCTCGATC
GCCGAAGCCC ACCGACGCCC AGGCTCCGAG GTATGCCGTT CCTTCACACC AGCCGACCTC
ATGGATCGCC ACCGTGTCCT CCCACTGCTC CCTTCCCAGC GGCTCACACC CTCCTCCCAA
CCACAGGAAC ACATCACTGA GGCCGGAGAA AGCCGACGGG ACGCGACTGT TGCCCTCGCG
GGGACATCAG GGCCGACGCT GGACTGGCTC AACCGGCTCA GGACTCCGCC CCGGCGAACT
CCGACGCGGT ACGAGCGGCT GGGTTCCCTA CTCACCGGTG GGCAGCTACG TCTGGTCCAA
GGCCACCGGA TCAGGACGGA CGACCTCGGT GACGAGGGCC AGACCGTCTA TGGGCGTGAG
GAGATGCTCG GTGAGATCCC GGTCGGGCAG CGCCGGATCA GCCCGCTCGT CCTCGCCGAG
TACCCGTCCG CGCTGGTCAC GGAGCCCGGA GATGTGATCC TACTCTTCGA CGAACGGCTG
CGCACCGTCG TCGACCAGGC AGGGGGGAGC GTCCTGTTGT TCCCCGTACA AGCCTTGCGG
ATCAGGGCCT ACGGAGACCT GCGCAAACCC ATGTCGATGG AACTCGGGCG AGCGTCCTCG
GTGCGGATTT GGCCCCACCA ACTGGCGGCC GTGCTCTCCG CCGGGCGCAA CGCCCGGCGC
GGTCGTGGCT CCCTCGTGCG CAGAGCCGAC CTCGAAGGCG TGGAGATCCC CGTTATGTCA
CCGACGGAGG CCAAGCTCTT CGATGACGCC ATGCGTGAGC ACGCGGCCGA GGTCGAGCGC
CTCCGCCGCC AGCTCACAGC AATGGAGGAC CTCGGCGCCG TGCTGGCCTC CGGAGTCGCG
GACGGTGCCC TGTCCGTCCA GCTCCACCCC CGTGCGCGTC GACCCGACTC GACCAACGTG
CCTTTCGAGG GCACCGCCTC AGACTTCGAC GACGACCACT AG
 
Protein sequence
MKVGSPLLVT LSEIAEYAQV RRPSVSNWRR RHTDFPRPVS ASSNVPLFDS DEVAAWLDRR 
PVHKAASPVH TTEDEENSPA TYGEVYRTGI LLSAVTSHRE LPPEELLIAA LRVVCAYRSA
SEATPVTVQE FPPDPDLHAS VHALVDVLGR AAATERLMDL ASRLELSWAP EQAPPAVCTL
VSRLHQVLAD GAGDASIVDL SAGAGAGLLA LLSTGAPRSA TAVVNEVSLG EILSLRLRAH
GIPAVDIQVS APITHDSDVV LAYPPFVPGE RADHADHPLL WAEQAVSMLQ TDGLAYVVVP
DWTLTHTGRG SSTPPVAASR ERLLRNRCIR AVVQLPRRIH PSRPGAELVL LVLTPRGEGG
STVTLCDADR IARTQGRHTS ARTNGQGWIA PWAEETVLSI AEAHRRPGSE VCRSFTPADL
MDRHRVLPLL PSQRLTPSSQ PQEHITEAGE SRRDATVALA GTSGPTLDWL NRLRTPPRRT
PTRYERLGSL LTGGQLRLVQ GHRIRTDDLG DEGQTVYGRE EMLGEIPVGQ RRISPLVLAE
YPSALVTEPG DVILLFDERL RTVVDQAGGS VLLFPVQALR IRAYGDLRKP MSMELGRASS
VRIWPHQLAA VLSAGRNARR GRGSLVRRAD LEGVEIPVMS PTEAKLFDDA MREHAAEVER
LRRQLTAMED LGAVLASGVA DGALSVQLHP RARRPDSTNV PFEGTASDFD DDH