Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_4613 |
Symbol | |
ID | 4596069 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | - |
Start bp | 4887395 |
End bp | 4889320 |
Gene Length | 1926 bp |
Protein Length | 641 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 639779222 |
Product | thimet oligopeptidase |
Protein accession | YP_925795 |
Protein GI | 119718830 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0339] Zn-dependent oligopeptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.910564 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCCTCG CCCCTCTCGC GCTCCCGTTG CCCGACGACG CCCAGGAGTG GGTCGAGTCC CTGACCCGCG ACGGACTCGC GACCGCCCGC GAGCGCGCCG AGCGGCTGCG CACGTCCCCG CCGGCCGACC CGCTGGACCT GCTCCGCGAG TGGGACGAGG TGGCCCGCGC GCTGTCCGGC GTCGCGGCGG CCGCCTCGCT GCTCGCGAAC GTGCACCCGC TGGAGCCGGT GCGGACCGCG TGCGAGGCGG CCGACCGGGA GGTGGACCGG CTGCTCACCG AGCTGCGCCA GGACCGCGGG CTGTACGACG TCTTCGCGGC CGCCGACCCG GCCGGCCTGG ACCCCGCGGC AGCACGCCTG CTCGACAAGA CCCTCGAGGA CTTCCGGCGC GCCGGCGTCG ACCTCGACGA CGCGACCCGG GCCCGGCTCG CCGAGATCAA CGAGCGGCTG ACCCAGGTCG GTCAGGAGTT CAGTCGCACC ATCCGCGACG ACGTGCGCAC CGTCCGGGTC CCGGCCGAGC GCCTCGCCGG CCTGCCGCAG GACTGGCTCG ACGCCCATCC CGTCGATGCC GACGGGCTGG TCACGGTCAC CACCGACTAC CCGGACGCGG TGCCGGTCCG GATGTTCGTC CACGACGCGG GCGTCCGGCG CCAGGTCACG GTCGCGTTCC TGGAGCGTGG CTGGCCGCAG AACGAGCCGC TGCTGCGCGA GATGTTCGCA CTGCGGCACG AGCTCGCCAC CCTGGTCGGC TACGCCGACT GGGCGTCGTA CGACGCCGGC GTGAAGATGA TCGGCGACGG ACCGGCGATC CCGGCGTTCA TCGACCGGAT CGCGGCCGCC GCGGACGCGC CCATGCGGCG CGACCTCGAC CAGCTGCTCG AGCGCTACCG GCGCGATGTC CCGGACGCGA CCGCGATCGA CACCGCGGAC TCGCTGTACT ACGAGGAGCT GGTCCGCCAG GAGCGGCACG ACGTCGACGC GCAGCGGGTC CGGGCGTACT TCGACTTCAC CAAGGTCCGC CGTGGCCTGC TCGAGGTCAC CGGCCGGCTG TTCGGGCTGC GCTACGAGCC GGTCCCCGAC GCGACGGTCT GGCACGAGGA CGTCGCGGCG TACGACGTGC TTCGCGACGC CCCCGACGGC CCGGTCCCCG TCGGGCGGAT CTACCTGGAC CTGCACCCGC GCGAGGGCAA GTACAAGCAC GCGGCGCAGT TCACCCTCGT CGACGGGCTC GCCGGGCGGC AGCTGCCCGA GGGCGTGCTG GTGTGCAACT TCTCGCGCGG CCTGATGGAG CACGACCATG TGGTCACGCT GTTCCACGAG TTCGGGCATC TGGTCCACCA CGTGCTCGGC GGCCACGGCG GGTGGACCCG CTTCTCCGGG GTCGCGACCG AGTGGGACTT CGTCGAGGCG CCCAGCCAGC TGCTCGAGGA GTGGGCGTGG GACCCCGAGG TGCTGCGCAC CTTCGCCGCC GACGCCGACG GCGAGCCGAT CCCGGAGGAC CTGGTGGAGC GGATGCGGGC CGCCGACGAG TTCGGCAAGG GCTACCACGC GCGCACCCAG ATGTTCTACG CGGCGATGTC GTACTGGTTC CACACGTCGC GTCCCGACGA CCTGACCGCG GCGATGCGCG AGCTGCAGGA GCGCTACTCG CCGTTCCCCT ACATCGACGG CACCCACATG TTCGCGAGCT TCGGCCACCT CGGCGGCTAC TCCTCGGCGT ACTACACCTA CATGTGGTCG CTGGTGATCG CGAAGGACCT GTTCTCCGCC TTCGACCCCG CCGATCTCTT CGACCCGGTC GTCGCCGGTC GCTACCGCGA CCGCGTGCTC GCCCTCGGCG GCTCCCGGGA CGCCGCCGAC CTGGTCACCG ACTTCCTCGG CCGCCCCTAC ACCTTCGACG CGTACGCCGC CTGGCTCGCT CGCTAG
|
Protein sequence | MSLAPLALPL PDDAQEWVES LTRDGLATAR ERAERLRTSP PADPLDLLRE WDEVARALSG VAAAASLLAN VHPLEPVRTA CEAADREVDR LLTELRQDRG LYDVFAAADP AGLDPAAARL LDKTLEDFRR AGVDLDDATR ARLAEINERL TQVGQEFSRT IRDDVRTVRV PAERLAGLPQ DWLDAHPVDA DGLVTVTTDY PDAVPVRMFV HDAGVRRQVT VAFLERGWPQ NEPLLREMFA LRHELATLVG YADWASYDAG VKMIGDGPAI PAFIDRIAAA ADAPMRRDLD QLLERYRRDV PDATAIDTAD SLYYEELVRQ ERHDVDAQRV RAYFDFTKVR RGLLEVTGRL FGLRYEPVPD ATVWHEDVAA YDVLRDAPDG PVPVGRIYLD LHPREGKYKH AAQFTLVDGL AGRQLPEGVL VCNFSRGLME HDHVVTLFHE FGHLVHHVLG GHGGWTRFSG VATEWDFVEA PSQLLEEWAW DPEVLRTFAA DADGEPIPED LVERMRAADE FGKGYHARTQ MFYAAMSYWF HTSRPDDLTA AMRELQERYS PFPYIDGTHM FASFGHLGGY SSAYYTYMWS LVIAKDLFSA FDPADLFDPV VAGRYRDRVL ALGGSRDAAD LVTDFLGRPY TFDAYAAWLA R
|
| |