Gene Noca_3166 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_3166 
Symbol 
ID4600151 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp3365367 
End bp3366950 
Gene Length1584 bp 
Protein Length527 aa 
Translation table11 
GC content70% 
IMG OID639777772 
Productgriselysin 
Protein accessionYP_924355 
Protein GI119717390 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3227] Zinc metalloprotease (elastase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCAGAA TCTCGGGCGT CGCCACGGCG GCGGTTCTCG CTCTCACCGC CGTCGGCATC 
CAGTCCGGCG CCAGCCAGGC CGTGACCGAG CGGCCGAACG CGGTCCAGGC CCTGCTCGCC
CATCCGGGCG CGGCGCTGGC CAGCAACGGA ACGGCGTTCA CGGTCACCCA CACCGTGACC
GACGCCGACG GCAGCACCCA CGTCCGCATG GACCGCACCT ACCGTGGCCT GCCCGTGCTC
GGTGGGGACC TGGTCGTCCA CCGCGGTACC CAGGGCGGGT GGCGCGGGGT GAGCCAGACC
CTCGAGAACG AGGTGCACGT CTCGACGACG CCGGCGGTCG GGAAGGCCGC CGCGGCCGTC
CGGGCGCTCG CGCCGGCGAA GGCGACGCGC GGCACCACCG GCGCGAAGAC GCAGTCGACC
CGCCTGGTCG TCGACGCGAC CACCGGCACT GCTCGGCTCG CGTGGGAGGT CATCACCGGC
GGCACCCAGC AGGACGGCAC GCCGAGCCGG CTGGCGACGT ACGTCGACGC CCGGACCGGC
GCGGTGATCC GCCGCGAGCA GCAGATCCAG ACCGCGGACG GCTCGGGCCA GTCGCTCTAC
TCCGGGACCG TGCCGCTGCA GCTGACGCTG TCGGGCTCGA CGTACCAGCT CAAGGACCCG
ACCCGGGGCA ACACCTACAC GACCGACATG GGCAACGCGA GCGACTCCCT CGGCTGCCAG
TACTTCGGCT TCAACTGCAA GACCGGCACC CTGTTCACCA GCCCGGACAA CCTGTTCGGC
AACGGCGCGA CGAGCAGCCG GGAGTCGGCC GCCGTGGACG CGCAGTACGG CACGAACATG
ACGTGGGACT TCTACAAGTC GACCTACGGG CGCAACGGGA TCTTCGGGAC CGGCGCCGGC
TCCTACAACC GGGTCCACTA CGGCAAGAAC TACGTCAACG CGTTCTGGGA CGGCACCAAG
ATGACGTACG GCGACGGCGA CGGGACCAAC TACGGACCGT TGGTCTCGCT GGACGTGGCC
GGTCACGAGA TGTCGCACGG CGTCACCGAG AACACCGCCG GACTGGCCTA CTCGGGCGAG
TCCGGTGGTC TCAACGAGGC GACCTCGGAC ATCTTCGGCA CGATGGTGGA GTTCTTCGCC
GCCAACGCCA ACGACCCGGG CGACTACCTG ATCGGCGAGG AGTTCGACCT CAAGAACCAC
CTCGGCTTCC GGCGGATGGA CAACCCGGCC TCGGACGGCA GCTCCTTCAA CTGCTGGTCG
TCGACCGTCG GGAGCGCCGA CGTCCACTAC TCCTCGGGCG TCGGGAACCA CTTCTTCTAC
CTGCTCGCCG AGGGCTCGGG CGCCAAGACC ATCGGCGGCG TCGCCCACAA CAGCCCGACC
TGCAACGGCT CGACGGTGAC CGGCATCGGC CGGGACGCCG CGAGCGCGAT CTGGTACCGC
GCGCTCACGG TCTACATGAC CTCCAGCACC AGCTACGCCG GCGCCCGCAC CGCCACGTTG
AACGCGGCGC GGGACCTGTA CGGCGCGGGC AGCGCGCAGC AGAACGCCGT GGCCGCCGCG
TGGAGCGCGG TCAGCGTCAA CTGA
 
Protein sequence
MRRISGVATA AVLALTAVGI QSGASQAVTE RPNAVQALLA HPGAALASNG TAFTVTHTVT 
DADGSTHVRM DRTYRGLPVL GGDLVVHRGT QGGWRGVSQT LENEVHVSTT PAVGKAAAAV
RALAPAKATR GTTGAKTQST RLVVDATTGT ARLAWEVITG GTQQDGTPSR LATYVDARTG
AVIRREQQIQ TADGSGQSLY SGTVPLQLTL SGSTYQLKDP TRGNTYTTDM GNASDSLGCQ
YFGFNCKTGT LFTSPDNLFG NGATSSRESA AVDAQYGTNM TWDFYKSTYG RNGIFGTGAG
SYNRVHYGKN YVNAFWDGTK MTYGDGDGTN YGPLVSLDVA GHEMSHGVTE NTAGLAYSGE
SGGLNEATSD IFGTMVEFFA ANANDPGDYL IGEEFDLKNH LGFRRMDNPA SDGSSFNCWS
STVGSADVHY SSGVGNHFFY LLAEGSGAKT IGGVAHNSPT CNGSTVTGIG RDAASAIWYR
ALTVYMTSST SYAGARTATL NAARDLYGAG SAQQNAVAAA WSAVSVN