Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_2940 |
Symbol | |
ID | 4070864 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 3483561 |
End bp | 3485219 |
Gene Length | 1659 bp |
Protein Length | 552 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637984959 |
Product | peptidase M28 |
Protein accession | YP_592015 |
Protein GI | 94969967 |
COG category | [R] General function prediction only |
COG ID | [COG2234] Predicted aminopeptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.529928 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGCATTCAC ACGTACCTCG TGTCCCCTAT CGCATCTCCC TATTTGTGTT TCTCGCATTG TTTTCCTGCT CTGCATACGC TCAACAGCGC GGCGAGACCG TGGACCTGGG CATGGTCACG CAGATTCGGC AGGAAGGATT CCGGAACTCG CAAGTCATGC AAACCGCCAG CGCAATCGTC GACGGTATCG GCGAACGGCT AAGTGGGTCG CCGAATGTCA AGAAAGCGAA CGAATGGACC CGGGATCAAT TCACCAAGTG GGGGTTGCAA AACGCTCACC TTGAGGGCTA CAAGTTCGGG CGCGGGTGGC AGAACGAATT CACGTCGGTC CGAATGGTCT CGCCCGACTT CATGGAACTT ATTGCCTACC CGAAGGCCTG GACGCCGGGA ACGAGTGGCG CGATTAAGGC GCCGGCCGTG CGCGTAGTCG CGAAGTCACC TGCCGACTTC GAAAAGTATC GCGGCAAGCT CTCCGGCAAA ATCGTTCTCT ATGGCGATAT GCCGGAAGTG AAGCCGCAAG CGGAAGCCGC CATGAGCCGT TATGACGAGA AGAAACTCGC CGACATCGGC CATTACGAGA TCCCAAGCGA GAAGCCGCGA TTCAGTCCTG AGGAATTTAA GAATCGTCTA GCGCTACGGA AAGCGGCGGA CGAGTTCTTC GTCAAGGAGA ATGTTGTCGC TGTGATTGAC GCCAGCCGCG GCGATGGCGG TACCGTTTTC GTACAGAGCG CCGGTTCGTA TAAAGAAGGT GAGCCGGAGG TCGCGCCGTC GCTCTCGATG GCGGTGGAAC ATTTCGGCCG CATCGCGCGA CTGCTGGAGC GCGGAGGAAA TGTCGAACTT GAAGTAAATG TTCAGAACAA GTTCTACACA GACGACCCGA CGGCTTACGA CACCGTCGCC GAAATCCCCG GCTCTGACAA GAAAGATGAG CTGGTAATGG TCGGAGCGCA CCTCGATTCC TGGCATGCCG GCACCGGTGC CACGGACAAT GCTGCCGGAT GCGCGGTGAC GATGGAAGCA GTGCGCATTT TGCAGGCACT TGGCGTGAAG CCGCGGCGGA CTATCCGTAT CGCACTATGG ACAGGCGAGG AAGAGGGATT GCTCGGCTCG CGCGCGTATG TCGAGCAACA CTTTGGGTCG CGACCGGAGA GCACTGATCC GAAGGAAAAA GACCTGCCGT CGTTCCTGCG CAAGCCGGGA ACTCCACTCA CCCTCAAGCC GGAGCAGAAG CAGGTTTCGG CATACTTCAA CATCGACAAT GGAAGCGGTA AGGTCCGCGG CATTTACCTG CAGGAGAACG CCGCCGTTGC ACCGATCTTC ACCGAGTGGC TAAAGCCATT CCATGATCTC GGCGCCGATA CGATCACGTA TCGCAACACG GGCGGCACGG ACCACCTCTC GTTCGATGCG GTAGGTATTC CGGGTTTCCA GTTCATCCAG GACCCGATTG AGTATGAGAC TCGCACTCAC CACTCGAATA TGGATGTGTA CGAACGCCTG CAACGTGACG ATCTGATGCA AGCTTCGGTA GTACTCGCAT CGTTCATCTA TAACGCAGCA ATGCGCGACG ACATGATGCC GCGCAAGCCG CTACCGAAGG ATGCGGTGCT CCCGCCCGCA TCCGCGGCTC CGGCAAAGAC CCCTGCAAAG AAAAAGTAG
|
Protein sequence | MHSHVPRVPY RISLFVFLAL FSCSAYAQQR GETVDLGMVT QIRQEGFRNS QVMQTASAIV DGIGERLSGS PNVKKANEWT RDQFTKWGLQ NAHLEGYKFG RGWQNEFTSV RMVSPDFMEL IAYPKAWTPG TSGAIKAPAV RVVAKSPADF EKYRGKLSGK IVLYGDMPEV KPQAEAAMSR YDEKKLADIG HYEIPSEKPR FSPEEFKNRL ALRKAADEFF VKENVVAVID ASRGDGGTVF VQSAGSYKEG EPEVAPSLSM AVEHFGRIAR LLERGGNVEL EVNVQNKFYT DDPTAYDTVA EIPGSDKKDE LVMVGAHLDS WHAGTGATDN AAGCAVTMEA VRILQALGVK PRRTIRIALW TGEEEGLLGS RAYVEQHFGS RPESTDPKEK DLPSFLRKPG TPLTLKPEQK QVSAYFNIDN GSGKVRGIYL QENAAVAPIF TEWLKPFHDL GADTITYRNT GGTDHLSFDA VGIPGFQFIQ DPIEYETRTH HSNMDVYERL QRDDLMQASV VLASFIYNAA MRDDMMPRKP LPKDAVLPPA SAAPAKTPAK KK
|
| |