Gene Noca_3338 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_3338 
Symbol 
ID4600232 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp3543251 
End bp3545620 
Gene Length2370 bp 
Protein Length789 aa 
Translation table11 
GC content74% 
IMG OID639777946 
Productpeptidase S45, penicillin amidase 
Protein accessionYP_924527 
Protein GI119717562 
COG category[R] General function prediction only 
COG ID[COG2366] Protein related to penicillin acylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.378065 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCAGCCGA AGACCACTCG CGCCGGGGCC GCCCTGGCCA CCCTGCTCCT GCTCGCACCG 
CTCTCCGCGG CCGCGACCTC CGCCGAGGCC GCCCCCGTCG CGGCCACACG GGCGGCGAAG
GCGCCGGCGT ACGACGTGAC GATCCGGATC ACCGAGCACG GCATCCCGCA CATCCGGTCG
AACACCCTGC CGGGCATGGC GTACGGCGCC GGCTGGATCA CCACGGCCGA GGCGACGTGC
AACCTGATGG ACACGCTGCT CACCGCGCGC GGGCAGCGCT CCCTCTACGA GGGCCCGGAC
GCCACCTACA AGGACAACGT CGGCGGCTCG GCCACGAACC TCGAGTGGGA CACGCTGGTC
ACCGACCTGC GCGACCGCGA GGTCGTGGAG CAGCTCCTCG CCGACCCGGT CGCCGGCCCC
AGTGGTCGGG CGAAGGCGAT GGTGGCCGCC GAGACCGCCG GCATCAACGC CTGGCTGGCC
GAGCACGAGA TCACCGACCC GGCCTGTGCG GACGAGGCGT GGATCGCGCC CGACGTCACG
ACCACCGACG TCTGGTACGC CTTCTACCTC GCGCAGCTGA TCTCCTCCAC GACCCGCCTG
CTCCCGCAGA TCGCCGACGC GGCGCCGCCG GCCGCCGGCG TGAAGCCCAC CGCGAGGCCG
ACGCGGGCGG AGCTGCGTGA GGCGCTCGAG CCCGACGCCG AGTTCGGGTC GAACGCCACC
GCGGTCGGCG GCGAGGACAC CTCCACCCGC CGCGGGATGA TCCTCGGCAA CCCGCACTTC
CCCTGGCTCG GCCGGTACCG CTTCACCCAG ATGCACCTGA CCGTCCCGGG CAAGCTGGAC
GTCGCCGGCG GCGCGCTGAC CGGCTTCCCG GCGATCAACA TCGGCTTCAA CAAGGACGTG
GCGTGGAGCC ACACCGTGTC GACCGGCTTC CGGTTCACGC CGTACCAGTA CACGACCGCC
GACACCCCCA CCAGCTACCG GACCGCCGAC GGCGGCACCG CCGAGCTCGA GCGGCGCGAC
GTCGACGTGC AGGTCCGGAC CGACGCCGGG GTCGAGACCG TGACCCGCAC CCTCTGGCGG
ACGCCGCAGG GCTACGTCCT CAACGCACCG AGCCTGTTCA TGGGCTGGAC CAAGGGCAGC
TTCTGGGCGT TCCGCGACGC GAACGCCGAG CACCTGCGCA CCTTCGACAC CTTCCTGGCG
ATGAACATGG CGAGCTCGGT GCGCAACCTG CTGCGCCGCC AGGACCGGGG CGGCGGCATG
CCGTGGGTCA ACACGGTCGC CGCCGACCGC GCCGGCGACG TCCTGTACGC CGACCACGCG
GTCACCCCGC ACGTCACCGA CGCCCTCGCG CGCCGGTGCA TGACCCCGGC CGGCCGGCTG
ATCTTCAGCG TCGCCGGGCT CCCCGGGCTC GACGGCACCC GGGCCAGCAC GGGCTGTGCG
TGGGGCACCG ACGAGGATGC CCAGCGCCCC GGCATCCTCG GCCCCTCGCA CCTGCCCGAG
GTCGTCCGCC GCGACTGGGT GATGAACGCC AACGACTCCT ACTGGCTGCC CAACGACGAG
GTGCGGCTGA CCGGCTACCC GCGGATCATC GGCTGCGAGG ACTGTGAGCG GACCATGCGC
ACGAAGGTCG TGATGGCCTA CGTGCGCGAC CGGTTGGAGG TCGGGAAGGA GACGCCGCGC
ACGCTGGCGG CGCACGAGCA CGCCAACCGG GTGTACGCCG CCGAGGTGGC CCGGGTCGGC
GGTCGGCTCG ACCGGGTCTG CGACGCCACC GGCCTGGTCG CCGCCTGCCG GCTGCTGCAC
GACTGGGACG GCCGCTCGGA CGCGACCTCC TCGGCCGGCG CCGCGATCTT CCAGGAGTTC
GTCCGGCTCG CGGACCAGCG CGACGTCGGC CTCTGGGAGG TGCCGTTCTC GCCCGCCCAC
CCGCTGACGA CGCCGCGCAC GCTGTCGACC GCGCAGCCCG TCGTCGACGC GCTGGCCGCG
GCGATCGCGC TGGTCACCGA CCGCGACGCC GGCCTGACCA AGACGTACGG CGAGCTCCAT
CGCTCGGGCG ACCGGGGCTC CGCCGGCTGG CCGCTCGGCG GCGGCCTCGG CGACCTGTCC
GGCGACGCGA ACGCGGTCAG CAGCACGCTC GGCGACCCGG TCCTCGACCC GGTCACCCGC
GGCTCGTCGT ACCTCCAGGC GGTCGCCTTC CGGGGCCGGA CCGGCGTCGA CGCGCGCACG
ATCCTCACCT ACAGCCAGTA CGAGGACCCG ACGTCCCGCT GGTCGGACGA CCAGACCGAG
ATGTTCTCGA ACGAGCAGTG GGTGCGCTTC CCGTGGACCG CCGCGCAGAT CCGCGACCAG
CTCGTGGAGG TCGTGCACCT GGCGCCCTGA
 
Protein sequence
MQPKTTRAGA ALATLLLLAP LSAAATSAEA APVAATRAAK APAYDVTIRI TEHGIPHIRS 
NTLPGMAYGA GWITTAEATC NLMDTLLTAR GQRSLYEGPD ATYKDNVGGS ATNLEWDTLV
TDLRDREVVE QLLADPVAGP SGRAKAMVAA ETAGINAWLA EHEITDPACA DEAWIAPDVT
TTDVWYAFYL AQLISSTTRL LPQIADAAPP AAGVKPTARP TRAELREALE PDAEFGSNAT
AVGGEDTSTR RGMILGNPHF PWLGRYRFTQ MHLTVPGKLD VAGGALTGFP AINIGFNKDV
AWSHTVSTGF RFTPYQYTTA DTPTSYRTAD GGTAELERRD VDVQVRTDAG VETVTRTLWR
TPQGYVLNAP SLFMGWTKGS FWAFRDANAE HLRTFDTFLA MNMASSVRNL LRRQDRGGGM
PWVNTVAADR AGDVLYADHA VTPHVTDALA RRCMTPAGRL IFSVAGLPGL DGTRASTGCA
WGTDEDAQRP GILGPSHLPE VVRRDWVMNA NDSYWLPNDE VRLTGYPRII GCEDCERTMR
TKVVMAYVRD RLEVGKETPR TLAAHEHANR VYAAEVARVG GRLDRVCDAT GLVAACRLLH
DWDGRSDATS SAGAAIFQEF VRLADQRDVG LWEVPFSPAH PLTTPRTLST AQPVVDALAA
AIALVTDRDA GLTKTYGELH RSGDRGSAGW PLGGGLGDLS GDANAVSSTL GDPVLDPVTR
GSSYLQAVAF RGRTGVDART ILTYSQYEDP TSRWSDDQTE MFSNEQWVRF PWTAAQIRDQ
LVEVVHLAP