Gene Noca_3657 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_3657 
Symbol 
ID4595769 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp3882932 
End bp3883978 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content74% 
IMG OID639778265 
Productputative DNA-binding/iron metalloprotein/AP endonuclease 
Protein accessionYP_924844 
Protein GI119717879 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0533] Metal-dependent proteases with possible chaperone activity 
TIGRFAM ID[TIGR00329] metallohydrolase, glycoprotease/Kae1 family 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGCAGCG AACCCCTGGT CCTCGGCATC GAGACCTCGT GCGACGAGAC CGGTGTCGGC 
ATCGTCCGTG GGCACACCCT GCTCGCGGAC GCGGTGGCGA GCAGCGTCGA CGAGCACGCC
CGCTTCGGCG GGGTGGTGCC CGAGGTCGCG AGTCGCGCCC ACCTCGAGGC GATGGTGCCG
ACCATCGAAC GGGCCTGCGA GACGGCCGGC ATCCGCCTGT ACGACGTCGA CGCGATCGCG
GTCACCAGCG GACCGGGGTT GGCCGGGGCG TTGATGGTGG GGGTAGCCGC CGCGAAGGCG
CTCGCGGTCG GCCTCGGCAA GCCGATCTAC GGCGTGAACC ACCTCGCGGC GCACGTTGCC
GTCGACCAGC TCGAGCACGG CCCGCTGCCC GAGCCCTGCC TCGCGCTGCT GGTCAGCGGC
GGCCACTCCA GCCTGCTGCG GGTCGAGGAC GTCACCTCCG GGGTGGACCC GATGGGGGCG
ACCATCGACG ACGCCGCCGG CGAGGCCTTC GACAAGGTGG CCCGGCTGCT CGGCCTGCCG
TTCCCCGGTG GCCCCTACAT CGACCGTGCG GCCCGCGAGG GCAGCACCGT GTACGTCGAC
TTCCCGCGCG GCCTGACCAG CCGCCGCGAC CTCGAGCGGC ACCGCTTCGA CTTCTCGTTC
TCGGGCCTCA AGACCGCGGT CGCGCGGTGG GTCGAGGCAC GGGAGCGGTC CGGCGAGCCG
GTGCCGGTGG CCGACGTGGC GGCGAGCTTC CAGGAGGCGG TCTGCGACGT GCTGACCCGC
AAGGCGATCG ACGCGGCGTC CAGTGCGGGC ATCGAGGACC TCCTCATCGG CGGTGGGGTC
GCCGCGAACT CCCGGCTGCG CGTGCTGGCG GAGGAGCGCG CCGCGGCGCG GGGGATCCGG
GTCCGGGTGC CCCGTCCCGG CCTGTGCACC GACAACGGCG CCATGGTCGC CGCTCTGGGC
GCCGAGATGG TCGCCCGCGG CCGCACCCCG TCCCCCCTGG ACCTCCCCGC CGACTCCTCG
CTCCCCGTGA CCGAGGTTCT CGTCTGA
 
Protein sequence
MSSEPLVLGI ETSCDETGVG IVRGHTLLAD AVASSVDEHA RFGGVVPEVA SRAHLEAMVP 
TIERACETAG IRLYDVDAIA VTSGPGLAGA LMVGVAAAKA LAVGLGKPIY GVNHLAAHVA
VDQLEHGPLP EPCLALLVSG GHSSLLRVED VTSGVDPMGA TIDDAAGEAF DKVARLLGLP
FPGGPYIDRA AREGSTVYVD FPRGLTSRRD LERHRFDFSF SGLKTAVARW VEARERSGEP
VPVADVAASF QEAVCDVLTR KAIDAASSAG IEDLLIGGGV AANSRLRVLA EERAAARGIR
VRVPRPGLCT DNGAMVAALG AEMVARGRTP SPLDLPADSS LPVTEVLV