Gene Ndas_0376 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0376 
Symbol 
ID9244211 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp458162 
End bp459886 
Gene Length1725 bp 
Protein Length574 aa 
Translation table11 
GC content73% 
IMG OID 
Producttype III restriction protein res subunit 
Protein accessionYP_003678330 
Protein GI297559356 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGTGA CGCAGACCAC GACGGAGGAC CGGCTCAGAG GCCTCCGCGC ATGGCAGCGG 
GAGGCGTTCG AGGAGTACTT CCGCCGGGAA CCGCGGGACT TCCTCGCCGT GGCCACCCCC
GGTGCGGGCA AGACCACCTT CGCCCTCACC CTGGCCAGCG AACTCCTCCA ACGCCACACG
GTGCGCGCCA TCACCATCGT GTGCCCCACC GACCACCTCA AGAAGCAGTG GGCCGAGGCG
GCCGCCCGGT TCGGCATCGC CATCGACCCC GAGTTCCGCA ACGGCCAGGG CGCCCTGGGC
CGCCAGTACG TCGGCGTGGC CGTCACCTAC GCCCAGGTCG CCGCCCACCC GATGCTGCAC
CGCAACCGGA CCGAGGCGCG CAAGACCCTC GTCATCTTCG ACGAGGTCCA CCACGCCGGG
GACGCCCTGT CCTGGGGCGA CGCGGCCCGC GAGGCCTTCG ACCCGGCGGC GCGCCGCCTC
TCCCTGACCG GGACCCCCTT CCGGTCCGAC ATCAACCCCA TCCCCTTCGT CGACTACGTC
CAGGACAGCG CCGGGGTGCG CCGCTGCTCC TGGGACTACA GCTACGGGTA CGGGCCCGCC
CTGGCCGACG GGGTCGTGCG CCCCGTCATC TTCATGGCCT ACTCCGGCGA GATGCGCTGG
CGCACCCGCG CGGGCGACGA GCTCGCCGCC AGGCTGGGCG AACCCCTCAC CCAGGACGCG
CTCTCCCAGG CGTGGCGGGC CGCGCTGGAC CCCAAGGGCG ACTGGATCAA GCGCGTACTC
CAGGCCGCCG ACCGCCGCCT GACCGAGGTC CGCAAGACCC ACCCCGACGC GGGGGCCCTG
GTCATCGCCA GCGACCACGA GAACGCCCGC GCCTACTCGC GCATCCTGCG CCAGATCACC
GGCAAGGGCG CCACGGTCAT CCTGTCCGAC GACCCCGGGG CCTCCAAGAA GATCTCCCGG
TTCGCCGCGG GCGACGACCG CTGGATGGTC GCGGTGCGCA TGGTCTCCGA GGGGGTGGAC
GTGCCCCGGC TGATGGTGGG CGTGTACGCC ACCTCCACCA GCACCGCGCT GTTCTTCGCC
CAGGCCATCG GCCGCTTCGT GCGCGTGCGC CAGCGCGGCG AGGTCGCCTC GGTCTTCCTG
CCCTCCGTGC CCACCCTGCT GGAGTACGCG GGCGAGATGG AGCGCGAGCG CGACCACGTG
CTCGACCGGA CCCCTGGGGA GGGCGACGAG TACCCGGAGG AGGACCTGCT CCGGGAGGCC
AACAAGAAGC GGGACACCCC CGACGCCGGG GAGGAACTGC CCTTCGAGAC CATGGAGTCG
GCGGCGGAGT TCGACCGCGC CCTCTACGAC GGGGCCGAGT ACGGCGGCGT GCCGGGCTCC
ACGGAGGAGG AGGACTTCCT GGGCCTGCCC GGTCTGCTCG ACCCGCAGCA GGTCTCCCAG
CTCCTGCGCA AGCGCAAGGC CGACCTCAAG GCGAGCGAGG TCAAGGCCCG CAAGGTGGAG
GAGCCCGCCG AGGAGGACGG GCCCACCCAC CAGGTCCTGG CCGACCTGCG CCGCGAGCTG
AGCGGTCTCG TGGGCGCCTG GCACCACCGC ACCGGCAAGC CGCACGGAGT GATCCACAAC
GAGCTGCGCC GCGCCTGCGG CGGGCCGCCC GTCGCACAGG CCACCCCGAC GCAGATCCGC
GAACGGATCG CCAAGATCCG CGTCTGGGCC GTCGGCGGGC GATAG
 
Protein sequence
MTVTQTTTED RLRGLRAWQR EAFEEYFRRE PRDFLAVATP GAGKTTFALT LASELLQRHT 
VRAITIVCPT DHLKKQWAEA AARFGIAIDP EFRNGQGALG RQYVGVAVTY AQVAAHPMLH
RNRTEARKTL VIFDEVHHAG DALSWGDAAR EAFDPAARRL SLTGTPFRSD INPIPFVDYV
QDSAGVRRCS WDYSYGYGPA LADGVVRPVI FMAYSGEMRW RTRAGDELAA RLGEPLTQDA
LSQAWRAALD PKGDWIKRVL QAADRRLTEV RKTHPDAGAL VIASDHENAR AYSRILRQIT
GKGATVILSD DPGASKKISR FAAGDDRWMV AVRMVSEGVD VPRLMVGVYA TSTSTALFFA
QAIGRFVRVR QRGEVASVFL PSVPTLLEYA GEMERERDHV LDRTPGEGDE YPEEDLLREA
NKKRDTPDAG EELPFETMES AAEFDRALYD GAEYGGVPGS TEEEDFLGLP GLLDPQQVSQ
LLRKRKADLK ASEVKARKVE EPAEEDGPTH QVLADLRREL SGLVGAWHHR TGKPHGVIHN
ELRRACGGPP VAQATPTQIR ERIAKIRVWA VGGR