Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_2040 |
Symbol | |
ID | 4073209 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 2443875 |
End bp | 2445611 |
Gene Length | 1737 bp |
Protein Length | 578 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 637984054 |
Product | peptidase M28 |
Protein accession | YP_591115 |
Protein GI | 94969067 |
COG category | [R] General function prediction only |
COG ID | [COG2234] Predicted aminopeptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.194253 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGAAGT ACGCATGCCT CGCCACATGT CTCCTTGCAC TTTCATTCCC GCTCTGCGCC CAAAAACCCG AAGCCCTTGA CTACAACATG TACCAACGCA TCCGCTCCGA GGGATTCGAC CACTCGCACA TCATGGAATA CGCCTCCGCC CTCATGGACG GTATCGGCCC GCGTCTCACC GGCTCACCGA ATCTCAAGCA CGCCAATGAA TGGACGCGCG ACCAGTTCAC CTCCATGGGC TGCAGCAACG CTCACCTCGA AGACTGGGGC GAGTTCGGAC TGGGCTGGCG CCAGATCAAT ACCTGGGTGC GCATGTCCGC CCCTGACAAC GCCGTCTTCA TCGCGCAAGC GCTCCCGTGG TCTCCCGCCA CCAGCGGCCC CATCAACGGC CAGGCCATCT GGATCGAAGC CAAAGACGAA AAAGATCTCG AGAAGTACAA AGGTAAGCTG ACCGGCAAGA TCATCTTCTT CGGCCCCATG CGCGACGTGA AGCCCGTCGA GAAGCCGCTC ACCAAACGCA ACGAAGATGC CGACCTCAAG AAGATCGAAG ACTTCCCTGT CCGAGTCGGT GAACAGCACG AAGACTTCCT CGCCGGCTTC ATTAAGGAAC TCACCTTCCG CGAAAAGGCC GGCAAGTTCT TCGCGGATGA GCACGCCGCC GCCATCGTCG TCCCTTCCCG CGATGGCCGC GACAACGGCG GCTCTGGCGG CACCATCTTC GACGACGGTG GCACCGGCAT GGGCTGGTTT ACCTACCAGC GTGAGCACGC CGAGAAGCTT CCCATCGTCG TCACCGCCAT CGAAAACTAC GGCCGCGTCT ATCGCCTCTT GAAAGCCAAC GTCCCCGTCT CCATCGAAAT GGACGTTCGC ACCGAGTTCA CCGGCGACCA CGAACACGGC TTCGATACCA TCGCTGAAAT CCCCGGCACC GATCCCGCTC TTAAAGATCA AGTCGTGATG GTCGGCGGCC ACCTCGACTC CTGGGCCTCC GGCACCGGCG CCACCGACAA CGGCGCAGGC ACCGTCGTCG CTATGGAGGT CATGCGAATC CTCAACGCGC TCCACGTGCA GCCTCGCCGC ACCATCCGCG TCGCTCTCTG GACTGGTGAG GAAGAAGGCG AGTTCGGCTC CTACGGCTAC GTCAAAAACC ATTTCGGATT CGCGCCGCTC TCCACCGCCC CCGACCAGCT CGCGCTTCCT GAATTCGTGC GCAAGCCCGG TGGTCCCATC CAGATCAAGC CCGAGCATGC CAAAATTTCC GGCTACTTCA ACGTAGATAA CGGTTCCGGC AAAATCCGCG GCATCTACCT CCAGGGCAAC GCGCAACTGG CGCCGCTCTT CAAGGAGTGG ATCGCGCCTC TCTCTGATCT CGGAGTCAAC ACCATCTCCG TTCGCAACAC CGGCGGCACC GACCACGAAG CCTTTGACTC GGTCGGCATC CCCGGCTTCC AGTTCATCCA AGACCCGCTC GACTACAGCT CCCGCACCCA CCACAGCAAC ATGGATCTCT ACGAGCGTCT CCAACCCGCC GACCTCGCGC AAGCCGCCGT TGTCGAAGCC ATCTTCGTCT ACAACACCGC CATGCGCGAC CAGATGCTCC CGCGCAAACC GCTCCCCCAC CCCGAGCTAG ACGAGCCCGG CAAAGCCCCG CTCAAGAACG TAATGCCCGG TGTAGTCGCC GCCGCCGAAG AACAAAAGAA AGACGCGACA CCAGAAAAGA CGCCCGAGAA AAAGTAG
|
Protein sequence | MKKYACLATC LLALSFPLCA QKPEALDYNM YQRIRSEGFD HSHIMEYASA LMDGIGPRLT GSPNLKHANE WTRDQFTSMG CSNAHLEDWG EFGLGWRQIN TWVRMSAPDN AVFIAQALPW SPATSGPING QAIWIEAKDE KDLEKYKGKL TGKIIFFGPM RDVKPVEKPL TKRNEDADLK KIEDFPVRVG EQHEDFLAGF IKELTFREKA GKFFADEHAA AIVVPSRDGR DNGGSGGTIF DDGGTGMGWF TYQREHAEKL PIVVTAIENY GRVYRLLKAN VPVSIEMDVR TEFTGDHEHG FDTIAEIPGT DPALKDQVVM VGGHLDSWAS GTGATDNGAG TVVAMEVMRI LNALHVQPRR TIRVALWTGE EEGEFGSYGY VKNHFGFAPL STAPDQLALP EFVRKPGGPI QIKPEHAKIS GYFNVDNGSG KIRGIYLQGN AQLAPLFKEW IAPLSDLGVN TISVRNTGGT DHEAFDSVGI PGFQFIQDPL DYSSRTHHSN MDLYERLQPA DLAQAAVVEA IFVYNTAMRD QMLPRKPLPH PELDEPGKAP LKNVMPGVVA AAEEQKKDAT PEKTPEKK
|
| |