Gene Noca_0444 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_0444 
Symbol 
ID4597343 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp475120 
End bp477165 
Gene Length2046 bp 
Protein Length681 aa 
Translation table11 
GC content69% 
IMG OID639775058 
ProductMername-AA223 peptidase 
Protein accessionYP_921673 
Protein GI119714708 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0465] ATP-dependent Zn proteases 
TIGRFAM ID[TIGR01241] ATP-dependent metalloprotease FtsH 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAGCGCA TATTCAAGGG TCCCTGGCTG TGGATCGTCC TCGCCGTGGT CGGCGTCCTG 
CTCGCCCTGC AGTACCTCGC GCCCAACCGC GGGGGCGAAG AGGTGGACGC CTCGAAGATG
CAGGACCTCA TCAGCTCCGG TGAGATCAAG GAGCTGACCT TCGTCGACGG CGGCGAGCAG
CAGATCAAGG CCACCCTCGA CAACGGCGAC AAGGTCACCG CCTTCTGGCT CGACGGGACC
CAGGCGGAGC TGGACTCCCA GGTCCAGGAC CAGGTCGATG CGGGCAAGAT CGACTCCTAC
ACGGTCGAGG TGCCCAAGCC GAGCCTGCTC GGCTCGATCC TGGCGACGCT GCTGCCGTTC
GCGCTGATCA TCCTGCTCTT CCTGTTCCTC ATGAACCAGG TCCAGGGCGG CGGCGGTCGC
GGCGTCATGC AGTTCGCGAA GTCGCGCGCG AAGCTGATCT CCAAGGACAT GCCGAAGACC
ACGTTCGGCG ACGTCGCCGG TTGCGAGGAG GCGATCGAGG AGCTCGGGGA GATCAAGGAG
TTCCTCCAGG AGCCCGCCAA GTTCCAGGCG GTCGGCGCCA AGATCCCCAA GGGCGTGCTG
CTCTACGGCC CGCCCGGCAC CGGCAAGACC CTGCTGGCCC GGGCGGTCGC CGGTGAGGCG
GGCGTCCCGT TCTACTCGAT CTCCGGGTCC GACTTCGTCG AGATGTTCGT CGGCGTCGGC
GCCTCCCGGG TCCGTGACCT GTTCGAGCAG GCCAAGGAGA ACGCGCCCGC GATCGTGTTC
ATCGACGAGA TCGACGCCGT CGGTCGCCAC CGCGGCGCCG GGATGGGCGG TGGCCACGAC
GAGCGCGAGC AGACCTTGAA CCAGCTGCTG GTCGAGATGG ACGGCTTCGA CGTCCGCGGC
GGCGTGATCC TGATCGCCGC CACCAACCGG CCCGACGTCT TGGACCCGGC GCTGCTGCGC
CCCGGTCGCT TCGACCGCCA GATCCAGGTC GACGCCCCGG ACCTCAACGG CCGGCACATG
ATCCTCAAGG TCCACTCGCG CGGGAAGCCG ATGTCCCAGG ACATCGACCT GCTCTCCGTG
GCCCGTCGGA CACCCGGCTT CACCGGTGCC GACCTGGCCA ACGTGCTCAA CGAGGCGGCG
CTGCTGACCG CGCGCAGCAA CCAGAAGCTG ATCACCAACG CCAACCTCGA CGAGGCCATC
GACCGGGTGA TCGCGGGCCC GCAACGGCGT ACCCGCCTGA TGAGCGAGAA GGAGAAGCTG
ATCACGGCCT ACCACGAGGG CGGCCACGCC CTGGTGGCTG CGGCGCTGCC CGGCACCGAC
CCGGTGCACA AGATCACGAT CCTGCCCCGC GGCCGGGCCC TCGGCTACAC GATGGTGCTG
CCCGACGAGG ACAAGTACTC CCAGACCCGG TCGCAGATGC TCGACTCGCT GGCCTACATG
CTCGGCGGCC GTGCGGCCGA GGAGATGGTG TTCCACGACC CCACCACCGG TGCCGGCAAC
GACATCGAGA AGGCCACCAA CCTGGCCCGC GCGATGGTCA CCCAGTACGG CATGACCGAG
CGGCTCGGCG CGATCAAGCT GGGGGAGTCG AACTCCGAGC CGTTCTTGGG ACGCGACCTG
GGCCACGCCC GCAACTACTC CGAGGACGTC GCCGCGATCG TCGACGAGGA GACCAAGAAG
CTGCTCGCGA ACGCCCACCA GGAGGCCTTC GAGATCCTCG AGGAGAACCG CGACGTCCTC
GACGCGCTGG TGCTCGAGCT AGTCGAGAAG GAGACGCTGG ACAAGCAGCA GGTCGCGGAG
GTCTTCGCGC CCCTGCGCCG CCGGTCCGAG CGGCCCGCCT GGACGGGCTC GCCCGAGCGC
AACCCGTCCG CGATCCCGCC GGTGGAGATC CCGCAGTGGA TCCGGGACCG GGCGGCCGCC
AACGGTCACT CCAAGGAGGA GGGTGAGGCC GGGCCGGTGC TCACCCCGCC GGGATCCGGC
GGCGACGTGC ACGGTGACCC GGGGGTCGGC GGCGCCGAGA CGCCTCCGGG CCTGCCGCCG
CACTGA
 
Protein sequence
MKRIFKGPWL WIVLAVVGVL LALQYLAPNR GGEEVDASKM QDLISSGEIK ELTFVDGGEQ 
QIKATLDNGD KVTAFWLDGT QAELDSQVQD QVDAGKIDSY TVEVPKPSLL GSILATLLPF
ALIILLFLFL MNQVQGGGGR GVMQFAKSRA KLISKDMPKT TFGDVAGCEE AIEELGEIKE
FLQEPAKFQA VGAKIPKGVL LYGPPGTGKT LLARAVAGEA GVPFYSISGS DFVEMFVGVG
ASRVRDLFEQ AKENAPAIVF IDEIDAVGRH RGAGMGGGHD EREQTLNQLL VEMDGFDVRG
GVILIAATNR PDVLDPALLR PGRFDRQIQV DAPDLNGRHM ILKVHSRGKP MSQDIDLLSV
ARRTPGFTGA DLANVLNEAA LLTARSNQKL ITNANLDEAI DRVIAGPQRR TRLMSEKEKL
ITAYHEGGHA LVAAALPGTD PVHKITILPR GRALGYTMVL PDEDKYSQTR SQMLDSLAYM
LGGRAAEEMV FHDPTTGAGN DIEKATNLAR AMVTQYGMTE RLGAIKLGES NSEPFLGRDL
GHARNYSEDV AAIVDEETKK LLANAHQEAF EILEENRDVL DALVLELVEK ETLDKQQVAE
VFAPLRRRSE RPAWTGSPER NPSAIPPVEI PQWIRDRAAA NGHSKEEGEA GPVLTPPGSG
GDVHGDPGVG GAETPPGLPP H