Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_2788 |
Symbol | |
ID | 4444534 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | + |
Start bp | 3139834 |
End bp | 3141426 |
Gene Length | 1593 bp |
Protein Length | 530 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 639690610 |
Product | Xaa-Pro aminopeptidase |
Protein accession | YP_832267 |
Protein GI | 116671334 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0006] Xaa-Pro aminopeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.364809 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAACGATG CAGACAACAC CTACAACGGT GAAAATACAG CAAGCCAGCC CCTCGAAGAG CGCGTCAACA ACCGCTCGCA GCGGCCCAGT TCCGACGCCT TCAAGGCCTT CATGGCCAGC AACTGGGCGC CCTCGGCACA AGAGCTCCCC GATCGGGACG CCGTGGCCGA TCACGCCGCC GCCCGGCGTC GCACCATTTC CGGCCTGTTC AAGGGCGAAC GCCTGGTGGT CCCCGCCGGC CCGCTGAAGG TCCGCTCCAA CGACTGCGAC TACCGCTTCC GCCCCCACTC CGGTTTCGCC CACCTGACGG GCCTGGGCCT CGACCACGAG CCCGACGCCG TGCTCATTTT CGAACCGGTT GAAGAAGGAA AGGGCGACGA CGGCGGGAAC CACCGCGCCA CCCTTTACTT CCGGCCCCTC GCCGGCCGGG ACACCGAACA GTTCTATGCA GACTCCCGCT CCGGCGAATT CTGGATCGGT GCCCGCCCCA CGCTGGCAGA ATTCGAACGC AGGCTGGGCC TCGCCACTGC CCACATCGAC GAGCTCGAAC TGGCAATCAC CAAGAATGTG GGCGCCCCCG AAATCGGCGG TATCTCCATC CGGCTGGTGC GCAAGGTGGA CGAGAACATC GACGCCCTGG TGGACACGGC CCGCTACAAC ACTGCCAAGG ACCCGGACAA CCTGGACCTG GGCGTGCTGG ATGCCCTTGA TGAGAAGCTC ACCGAGGCCC TCTCCGAGCT CCGCCTGGTC AAGGATGCGT GGGAAATCGA GCAGATGAAG ACCGCCGTGG CCGCGACCGT GGAAGGGTTC ACCGAGGTCG TCAAGGCCCT CCCCCGGGCC CTGACCCACC GGCGCGGCGA GCGCGTCGTC GAGGGAGCCT TCTTTGCCCG TGCCCGGGAA GAGGGCAATG AGCTGGGCTA CGACACCATC GCGGCCTCGG GCAACAACGC CACCGTGCTG CACTGGACGC GGAACACCGG AACGGTCAAC GCCGGCGAGC TCCTGCTGCT GGATGCCGGC GTTGAGGCCG ATTCCCTCTA TACGGCTGAC ATCACCCGTA CCCTGCCCGC CAACGGCACG TTTACCGAGG TCCAGCGCAA GGTCTACGAG GCTGTCCTGG ACGCAGCGGA CGCCGGCTTC GCCGCCGCGC AGCCCGGCAC CAAGTTCCGC GACATCCACA CGGCCGCCAC CACTGTCCTC GCTGAGCGCC TGGCGGAATG GGGCCTGCTG CCCGTGTCCG TCGAGGAAGC CATCAGCCCC GAGGGCCAGC AGCACCGCCG CTGGATGCCG CACGGCACCA GCCACCACCT TGGCCTCGAT GTGCACGACT GCGCCCAGGC CAAGCGTGAG CTCTACCTGG ACGGCGTCCT GACCCCGGGA ATGGTGTTCA CGATCGAGCC GGGCCTGTAC TTCAAGAACG AGGATCTCGC GATTCCGGCG GAATACCGCG GCATTGGCGT CCGGATCGAG GACGACATCC TCATGACTGC CGACGGTCCG GTCAACCTCA GCGCCGCACT CCCCCGCAAG GCCGACGACG TCGAGTCCTG GATGGCGGGC ATCTACCAGG AAGCAGAGCA CGCACAGCCG TAA
|
Protein sequence | MNDADNTYNG ENTASQPLEE RVNNRSQRPS SDAFKAFMAS NWAPSAQELP DRDAVADHAA ARRRTISGLF KGERLVVPAG PLKVRSNDCD YRFRPHSGFA HLTGLGLDHE PDAVLIFEPV EEGKGDDGGN HRATLYFRPL AGRDTEQFYA DSRSGEFWIG ARPTLAEFER RLGLATAHID ELELAITKNV GAPEIGGISI RLVRKVDENI DALVDTARYN TAKDPDNLDL GVLDALDEKL TEALSELRLV KDAWEIEQMK TAVAATVEGF TEVVKALPRA LTHRRGERVV EGAFFARARE EGNELGYDTI AASGNNATVL HWTRNTGTVN AGELLLLDAG VEADSLYTAD ITRTLPANGT FTEVQRKVYE AVLDAADAGF AAAQPGTKFR DIHTAATTVL AERLAEWGLL PVSVEEAISP EGQQHRRWMP HGTSHHLGLD VHDCAQAKRE LYLDGVLTPG MVFTIEPGLY FKNEDLAIPA EYRGIGVRIE DDILMTADGP VNLSAALPRK ADDVESWMAG IYQEAEHAQP
|
| |