Gene Ndas_0020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0020 
Symbol 
ID9243847 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp24238 
End bp25896 
Gene Length1659 bp 
Protein Length552 aa 
Translation table11 
GC content68% 
IMG OID 
ProductCatalase 
Protein accessionYP_003677978 
Protein GI297559004 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0935072 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGACG TATCCAGCCA GGGATCCGCG CCGGGAGACG ACCGCGAGGT GCTCACGAAC 
CGGCAGGGAC ACCCGGTCTA CGACAACCAG AACCAGCGCA CGGTCGGCGA GCGGGGGCCC
GCGACGCTGG AGAACTACCA GTTCCTGGAG AAGATCAGCC ACTTCGACCG GGAGCGCATC
CCGGAGCGGG TGGTGCACGC GCGCGGGGTG ACCGCGTTCG GCTACTTCGA GTCCTACGGC
GCGTGGGGCG ACGAGCCGAT CAGCCGCTAC ACGCGGGCCA AGCTCTTCCA GGGCAGGGGC
AAGCGGACCG ACATCGCGCT GCGCTTCTCG ACCGTCATCG GCGGCAGGGA CTCCTCGGAG
TGCGCGCGCG ACCCGCGCGG GTTCGCGATC AAGTTCTACA CCGAGGACGG CAACTGGGAC
CTGGTGGGCA ACAACCTCGC GGTGTTCTTC ATCCGCGACG CCATCAAGTT CCCCGACGTG
ATCCACGCCC TCAAGCCGGA CCCGGTGACC TTCCGCCAGG AGCCCAACCG CATCTTCGAC
TTCATGTCGC AGACCCCCGA GTGCATGCAC ATGCTGGTCA ACCTGTTCAG CCCGCGCGGC
ATCCCGGCGG ACTACCGGCA CCAGCAGGGC TTCGGCGTCA ACACCTACAA GTGGGTCAAC
GACGTGGGCG AGACCGTCCT GGTCAAGTAC ACCTGGATGC CCAAGCAGGG CGTGCGCAGC
ATGACCGAGG CCGACGCCGC CAACCTCCAG GCGGACGAGA CCGGGCACGC GACCAAGGAC
CTGCACGAGG CCATCGACCG CGGCGATTAC CCGGAGTGGG AGCTGCTCGT GCAGATGATG
AGCGACGAGG AGCACCCCGA GCTCGACTTC GACCCGCTGG ACGACACCAA GACCTGGCCG
GAGCAGGACT TCCCGCCCAA GGCGGTGGGG CGGATCGTGC TCGACCGGAA CGTGTCGGAC
AACTTCGCGG AGAACGAGCA GATCTCCTTC GGCACCGGCG TGCTCGTGGA CGGCCTGGAC
TTCTCCGACG ACAAGATGCT CGTCGGGCGC ACCTTCTCCT ACAGCGACAC GCAGCGCTAC
CGGGTGGGGC CCAACTACCT CCAGCTGCCG GTGAACCAGG CCAAGAACGC CGACGTGCGC
ACCAACCAGC GCGACGGCCT GATGGCCTAC CACCAGGACT CCGGGGGCGA GAACCCGCAC
GTCAACTACG AGCCGTCCAT CAACGGCGGC CTGCGCGAGG GGCAGTACCC CACGCACGAC
GAGCAGGGGC CGGAGATCCG GGGGCGGATG ACGCGCAAGC GCATCTCCCG CACCAACGAC
TACCAGCAGG CGGGGCAGCG GTACACGCTG ATGGAGGAGT GGGAGCGCGA CGACCTGGTG
CGCAACTTCA TCGGACAGCT CTCCCAGTGC GACCGGCCGA TCCAGGAGAG GATGGTCTGG
CACTTCCTCA TGGTCGACGA CGACCTGGGG CTGCGCGTCG GCGAGGGGCT GGGCATCGGC
CCGGGCGACG TGGCGCACCT GGAGCCGCTG CGGAGCCAGA CCCTGGACGA GGGGGAGCGC
CAGCGCATGG CCAACCTGGG CAAGAACGGC CCCCGGGACG TGTCGGGGCT GACGATGACC
CACTGCGTGC CCAACCAGCG GCACGTGGTG GAGCGCTGA
 
Protein sequence
MTDVSSQGSA PGDDREVLTN RQGHPVYDNQ NQRTVGERGP ATLENYQFLE KISHFDRERI 
PERVVHARGV TAFGYFESYG AWGDEPISRY TRAKLFQGRG KRTDIALRFS TVIGGRDSSE
CARDPRGFAI KFYTEDGNWD LVGNNLAVFF IRDAIKFPDV IHALKPDPVT FRQEPNRIFD
FMSQTPECMH MLVNLFSPRG IPADYRHQQG FGVNTYKWVN DVGETVLVKY TWMPKQGVRS
MTEADAANLQ ADETGHATKD LHEAIDRGDY PEWELLVQMM SDEEHPELDF DPLDDTKTWP
EQDFPPKAVG RIVLDRNVSD NFAENEQISF GTGVLVDGLD FSDDKMLVGR TFSYSDTQRY
RVGPNYLQLP VNQAKNADVR TNQRDGLMAY HQDSGGENPH VNYEPSINGG LREGQYPTHD
EQGPEIRGRM TRKRISRTND YQQAGQRYTL MEEWERDDLV RNFIGQLSQC DRPIQERMVW
HFLMVDDDLG LRVGEGLGIG PGDVAHLEPL RSQTLDEGER QRMANLGKNG PRDVSGLTMT
HCVPNQRHVV ER