Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_3230 |
Symbol | |
ID | 4444011 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | + |
Start bp | 3637291 |
End bp | 3639510 |
Gene Length | 2220 bp |
Protein Length | 739 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 639691054 |
Product | oligopeptidase B |
Protein accession | YP_832706 |
Protein GI | 116671773 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1770] Protease II |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.128986 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCCAGA CTCCAGTGCA GCACCCAGCC GACAACACCC CTGCGGCCGC CGCCCCCGTA GCCAGGAAGG TCCCGTTCGA ACGGACCCAC CACGGCGACA CTTTCGTGGA CAACTACGAA TGGCTGCGGG CCAAGGACTC CGCGGATGTG GTGGAGCACC TCAAGGCGGA GAACGCCTAC CAGGAGGCGG TCACCGCCCA CCAGGAACCG CTGCGCGAAG CCATCTTCCA GGAAATCAAG GGACGCACCC AGGAGACAGA CTTGTCTGTT CCGAACCGCA AGGACGGCTG GTGGTACTAC ACCCGTTCGG TCGAAGGCAA GGAATACGGC ATCCAGTGCC GCGTCCAGGC CCAAAATACC GGGGATCCGG TGGCCGACTG GACGCCCCCG GCGGTGGAGG CCGGCGTCGA ACTTTCCGGT GAAGAAGTCC TGCTGGACTG CAACGTCGAA GCCGAAGGCA AGCCGTTCTT CGCGGTCGGC GGCACCGCCG TGACCGTGGA CGGCAACCTC TACGCCTACG CCGTGGACAA CGCCGGCGAC GAACGCTTCA CGCTGCGCTT CAAGGACCTG CGCACCGGCG AGATGCTGCC GGACGTCATC GAGAACATCT TCTACGGCGT CTCCTTCTCC CCCGACGGCA CACGCCTGTT CTACACCGTG GTGGACGACG CCTGGCGCCC CTACCAGGTG AAGTCGCACG TGCTGGGCAC GCCGGTCACC GACGATGAGG TGATTTACCA GGAGGACGAC GTCGCCATGT GGCTGGGCTT CGACCTCTCC TCCGACCGGC GGCACCTCGT GCTGAGCATC GGCTGCTCCG AGTACAGCGA GACGCGGCTG CTCCGCTTTG ACGATTACGA CGCCGGACTC AGCACCGTGA TCTCCCGCGA CGAACGCGTC CTCTACGAGG CCGAGCCGTT CCTGCTGGAT GGCGCAGAGA AGATCCTGGT CACCCACAAC CGGAACGCCA TCAACTCCAT GGTGTCGCTT GTGGACGCCT CCGAGCTCGC CAAGCCGCTG GCCGAGCAGC AGTGGACCAC CGTCGTCGAA CATTCCGACC AGGTGCGCGT CAACGGCGCG GGCGTCACGT CCACGCACGT GATTGTGTCC GTCCGCAAGG ACACCATCGA GCGCGTCCAG GTGCTGGCCC TGGCCGGACT GGGCACGCCC GCGCAGGGCG ATCCGGTGGA GCCTGCATTC GACGAGGAGC TGTACACCGC CGGCGTCGCA GGCTCCGATT ACGAGGCCCC CGTGATCCGG ATGGGCTACA CCTCCTACTT CACGCCGTCG CGCGTGTACG ACTTCGTGCT TCCCACCCCC GAGCAGCCGG CGGGCGAGCT GCTGCTCCGC AAGGAGAGCC CGGTGCTGGG CGGCTACTCC CCGTCCGACT ACGTGGCCAC CCGGGAATGG GCGACCGCGG CCGACGGCAC CCGCATTCCG CTCTCGGTGC TGCGGCACGC GTCAGTGTCC CGCGATTCCT CCGCGGCCGG GCTCGTGTAC GGATACGGCT CCTACGAGCT GAGCATGGAC CCGGGCTTCG GCATCCCGCG GCTGTCCCTG CTGGACCGCG GGATCGTGTT CGTGATCGCG CACATCCGTG GCGGCGGCGA GCTGGGCCGG CACTGGTACG AGGACGGCAA GAAGCTCCAC AAGAAGAACA CGTTCACGGA CTTCATCGCG GCCACGGACT GGCTCGCTTC GTCCGGGTGG GTTGATCCCG CGCGGATTGC GGCGATGGGC GGTTCCGCGG GCGGGCTGCT GATGGGCGCC GTGGCCAACC TGGCGCCGGA AAAGTATGCG GCCATTGTGG CGGCCGTGCC GTTCGTGGAC GCGCTCACCA CCATCCTGGA CCCCGAGCTG CCGCTGTCCG CCCTGGAATG GGAGGAATGG GGCAACCCGA TCACGGACCC CGAGGTGTAC GCGTACATGA AGTCCTACAC TCCCTACGAG AACGTTGGGC CGCTGCCGTA TCCGAAGATC GCCGCCGTGA CCTCGTTCAA CGACACCCGC GTGCTGTACG TGGAACCGGC CAAGTGGGTG CAGGCCCTGC GCTCTGAAAC CACCGGGGCG GAGCCGATCG TGATGAAGAT CGAGATGGAC GGCGGCCACG GCGGAGCGTC CGGCAGGTAC GTCCAGTGGC GCGAACGGGC CTGGGACTAT GCCTTCGTGG CCGACTCCGT AGGCGCGACG GAACTGCTGC CGGGGGCCGG GATTAAGTAG
|
Protein sequence | MTQTPVQHPA DNTPAAAAPV ARKVPFERTH HGDTFVDNYE WLRAKDSADV VEHLKAENAY QEAVTAHQEP LREAIFQEIK GRTQETDLSV PNRKDGWWYY TRSVEGKEYG IQCRVQAQNT GDPVADWTPP AVEAGVELSG EEVLLDCNVE AEGKPFFAVG GTAVTVDGNL YAYAVDNAGD ERFTLRFKDL RTGEMLPDVI ENIFYGVSFS PDGTRLFYTV VDDAWRPYQV KSHVLGTPVT DDEVIYQEDD VAMWLGFDLS SDRRHLVLSI GCSEYSETRL LRFDDYDAGL STVISRDERV LYEAEPFLLD GAEKILVTHN RNAINSMVSL VDASELAKPL AEQQWTTVVE HSDQVRVNGA GVTSTHVIVS VRKDTIERVQ VLALAGLGTP AQGDPVEPAF DEELYTAGVA GSDYEAPVIR MGYTSYFTPS RVYDFVLPTP EQPAGELLLR KESPVLGGYS PSDYVATREW ATAADGTRIP LSVLRHASVS RDSSAAGLVY GYGSYELSMD PGFGIPRLSL LDRGIVFVIA HIRGGGELGR HWYEDGKKLH KKNTFTDFIA ATDWLASSGW VDPARIAAMG GSAGGLLMGA VANLAPEKYA AIVAAVPFVD ALTTILDPEL PLSALEWEEW GNPITDPEVY AYMKSYTPYE NVGPLPYPKI AAVTSFNDTR VLYVEPAKWV QALRSETTGA EPIVMKIEMD GGHGGASGRY VQWRERAWDY AFVADSVGAT ELLPGAGIK
|
| |