Gene Noca_3988 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_3988 
Symbol 
ID4598123 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp4207766 
End bp4210075 
Gene Length2310 bp 
Protein Length769 aa 
Translation table11 
GC content74% 
IMG OID639778593 
ProductATP-dependent protease La 
Protein accessionYP_925172 
Protein GI119718207 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0466] ATP-dependent Lon protease, bacterial type 
TIGRFAM ID[TIGR00763] ATP-dependent protease La 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.180349 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCACTCA AGCTCCCCGT CCTCTTCGTC CCGGACGTCG TCCTCCTCCC CGGCATGGTC 
GTTCCCCTCG AGCTCGACGA GTCCTCCCAG GCCGCCATCG ACGCCGCCCG CGCGGGCAGC
GACAGCCAGG TGCTGGTCGC GCCGCGGCTG GACGACCGCT ACGCGTCGTA CGGCGTCATC
GCGACCATCG AGCGCGTCGG CAAGTTCTCG GGCGGCTCCC CGGCCGCGGT CCTCAAGGCC
GGTCCCCGGG CCGCCATCGG CAGCGGCGTC ACCGGGCCCG GGGCGGCCCT GTGGGTCGAG
GTCGAGCCGG CCGAGGACGT CGTGACCCCG CGCGCCCGCG AGCTCGCCGA GGAGTACAAG
CGGCTGGTCG TCGCCGTGCT CCAGCGCCGC GAGGCCTGGC AGATCGTCGA CCAGGTGCAC
CAGATGACCG ACCCGAGCGC GATCGCCGAC ACCGCGGGCT ACGCGCCGTA CCTCTCCACC
GAGCGCAAGC GCGAGCTGCT CGAGGACCCC GACGTGGAGT CCCGGCTGCT GCGCGTCATC
GGCTGGACCC GCGACTACCT CGCCGAGGCC GAGCTGACCG ACAAGATCGG TGAGGAGGTC
CGCGACGGGA TGGAGAAGCA GCAGCGCGAG TACCTCCTGC GTCAGCAGCT CGCCGCGATC
CGCAAGGAGC TCGGCGAGGG CGAGCCGGAA GGGGCCGACG ACTACCGGGC TCGCGTCGAG
GCCGCCGACG TGCCCGACGC GGTGCGCGAG GCGCTGCTGC GCGAGGTCGG CAAGCTCGAG
CGCTCCAGCG ACCAGAACCC CGAGGCCGCG TGGATCCGGA CCTGGCTCGA CACGGTGCTC
GAGCTGCCGT GGAGCACCAC CACCGAGGAC GCGAACGACG TCGCCGGCGC CCGCGCCGTC
CTGGACGCCG ACCACCACGG CCTGGACGAG GTCAAGGAGC GGATCGTGGA GTACCTCGCG
GTCCGGGCCC GTCGCGCGGA GCGCGGCCTG CAGGTCGTCG GAGGCCGCGG CTCGGGCGCC
GTGATCCTGC TGGCCGGGCC TCCCGGTGTC GGCAAGACCT CGCTGGGCGA GTCCGTGGCC
CGGGCGCTCG GCCGCAGGTT CGTCCGGGTC GCCCTGGGCG GCGTGCGCGA CGAGGCCGAG
ATCCGCGGGC ACCGCCGTAC CTATGTCGGC GCGCTGCCCG GCCGGATCGT GCGCGCGATC
AAGGAGGCCG GCTCGATGAA CCCGGTCGTG CTCCTCGACG AGGTCGACAA GGTCGGCTCC
GACTACCGGG GCGACCCCGC GGCCGCGCTG CTCGAGGTCC TCGACCCGGC GCAGAACCAC
ACCTTCCGCG ACCACTACCT GGAGCTGGAC CTGGACCTCT CCGACGTGCT GTTCATCGCG
ACCGCGAACG TCGTGGAGCA GATCCCGTCG GCGCTGCTGG ACCGGATGGA GCTGGTGACG
CTCGACGGGT ACACCGAGGA CGACAAGGTC GGCATCGCCC GCGACTTCCT GCTGCCCCGC
CAGCTCGAGC GCGCGGCGCT GACCTCCGAC GAGGTGCGGG TGACCGACGC CGCGCTGCGC
GAGATGGCCG CCAACTACAC CCGCGAGGCC GGCGTGCGCC AGATGGAGCG GCTGCTCGCC
AAGGCGCTGC GCAAGGCGGC GACCCGGCTC GCCACGGGCG CGGTCCAGGT CGACATCGAC
GTGGCGGACC TCAAGGACCT GGTCGGCCGG CCGCGGTTCA CCCCCGAGGC ACCGGAGCGT
ACGGCGGTCC CGGGCGTGGC GACCGGCCTG GCGGTGACCG GGCTCGGCGG CGACGTGCTG
TTCATCGAGG CGTCGGCTGC GGAGGGGACC GCCGGCCTCA CGCTGACCGG CCAGCTCGGC
GACGTGATGA AGGAGTCGGC GCAGATCGCG CTGTCCTTCG TCCGCGCGCA CGCGGCCGAG
CTCGGGGTGG AGCCGTCGTT CTTCGAGAAG GCGATCCACG TGCACGTGCC GGCCGGCGCC
ACCCCCAAGG ACGGCCCGTC CGCCGGCATC ACCATGGTCA CCGCGCTCAC CTCGCTGGCC
ACCGGCCGCC CCGTGCGCTC GGAGGTCGGC ATGACCGGCG AGGTCTCGCT GACCGGCCGG
GTGCTGCCGA TCGGCGGGCT CAAGCAGAAG CTGCTCGCGG CCCAGCGCCA CGGCCTCACC
GAGGTGTTCG TGCCGCTGCG CAACGAGCCG GACCTGGACG ACGTACCGGC CGACGTCCTC
GGGAGCGTCA CGGTGCACCC GGTCAGCGAC GTCCTGGACG TGGTCCGCGG CGCGCTGGCC
AGCACCGAGG CGGCCACGGT CGCAGCGTGA
 
Protein sequence
MALKLPVLFV PDVVLLPGMV VPLELDESSQ AAIDAARAGS DSQVLVAPRL DDRYASYGVI 
ATIERVGKFS GGSPAAVLKA GPRAAIGSGV TGPGAALWVE VEPAEDVVTP RARELAEEYK
RLVVAVLQRR EAWQIVDQVH QMTDPSAIAD TAGYAPYLST ERKRELLEDP DVESRLLRVI
GWTRDYLAEA ELTDKIGEEV RDGMEKQQRE YLLRQQLAAI RKELGEGEPE GADDYRARVE
AADVPDAVRE ALLREVGKLE RSSDQNPEAA WIRTWLDTVL ELPWSTTTED ANDVAGARAV
LDADHHGLDE VKERIVEYLA VRARRAERGL QVVGGRGSGA VILLAGPPGV GKTSLGESVA
RALGRRFVRV ALGGVRDEAE IRGHRRTYVG ALPGRIVRAI KEAGSMNPVV LLDEVDKVGS
DYRGDPAAAL LEVLDPAQNH TFRDHYLELD LDLSDVLFIA TANVVEQIPS ALLDRMELVT
LDGYTEDDKV GIARDFLLPR QLERAALTSD EVRVTDAALR EMAANYTREA GVRQMERLLA
KALRKAATRL ATGAVQVDID VADLKDLVGR PRFTPEAPER TAVPGVATGL AVTGLGGDVL
FIEASAAEGT AGLTLTGQLG DVMKESAQIA LSFVRAHAAE LGVEPSFFEK AIHVHVPAGA
TPKDGPSAGI TMVTALTSLA TGRPVRSEVG MTGEVSLTGR VLPIGGLKQK LLAAQRHGLT
EVFVPLRNEP DLDDVPADVL GSVTVHPVSD VLDVVRGALA STEAATVAA