Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_0801 |
Symbol | |
ID | 4068582 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 992157 |
End bp | 993383 |
Gene Length | 1227 bp |
Protein Length | 408 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 637982808 |
Product | proteinase inhibitor I4, serpin |
Protein accession | YP_589880 |
Protein GI | 94967832 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG4826] Serine protease inhibitor |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGACGT ATAGATTCGC AGGCTTCGCT CTCTCACTCA TCGTCGCCAC ATTTGCGAGT GCGCAAGGTA AACCCGGCGA GGGCTTGCCG ACGCTCATTC GCAGCAACGA GCAACTCGGC ACCAGGCTGT TGAACTCCAT GCATATGGCC TCGCCTGACC AGAACGTTGC GATCGCTCCT CCACCCATAA CGCTGATGTT CGCTCTGTTC CGAGCGGACG CGCGCGAAGA GACCGACAAA ATCCTCGGAT GGGATTCAGC GCTCGATCTT CGCTATTCGT CCCGAATGTT TATTGGTGCA TTGGATTCAG GCCCCTCCGA TCCGCCGAAA CCAGAAAAAC TTCTTGGTAA TCAATCCCTG ATGCCCGCGG GCGAAGATTC TATTTGGATG GCGAATGCGA TTGTGTATCG CACTCCCAAG GTGAAGGAGG TATTTGCGCC GTACTTCGTA AGAAACTCGC AACGCTTCTA TCACTTAGAT TTTGTGAATA TAGGCGATCG CAAACCCCAG TCGGAAGATT TGCGCCAGGC CACCCGCGGA CTTGCCCTTG CAACGCTTCC GGCGTTCGAC ATCGGGAATG ACTTCGCCTA CGTAGGTTCC CTTCACCTTA GTTGTTCATG GCAGGGCAAT CTCTTCGTTC TTAACCCTGA GATCGACGGT GACTTCGCGG TCCGTTCGGG AGGCACGAAG CCTGTCCACA TGATGGATGG CGAAAAGGAC TTCTTCGATC ACGCGCAAAC CGACGAGTTC GAGGCGATCA CTTTGCCGTG TAGCATCGGC TCTATGACCG TAGTCATGCC TGCGCAAGAG AAGTCCATCG AGGATCTCTC GCAAAAGATA GTCTCCAACC CGTCGCTTGT ATCTGCAGCA CTGCATCGCG AATTCGGCGC TGTCGTCATG CCTTCCTTTG ACTTCGCATT TCATTCCGAA TTGCGGCCGG TTTTGGAACG ATATGGCTTC AAGTCGCTCT TCGGTGTAAT GCGACAGATT ATTCTGGTGC CGGAATCGCA TCTGACCGAA GTGAATCAGA GCGGCCGAAT CAGGGCGGAT CGCGCGGGAG TCTACGCCGA AGCCAGCACT CTCGGCGGTG GAATTCTCGG CGGAATCATG GGCGGCCCGA CGCCGTTCCA CATGGTGGTA AATCGTCCGT TCCTGTTCTT CGTTCACGAC GACGCAGCCA ATATTTTGCT CTTCGCTGGC GTAGTCATGG ATCCTACCCA AAACTGA
|
Protein sequence | MTTYRFAGFA LSLIVATFAS AQGKPGEGLP TLIRSNEQLG TRLLNSMHMA SPDQNVAIAP PPITLMFALF RADAREETDK ILGWDSALDL RYSSRMFIGA LDSGPSDPPK PEKLLGNQSL MPAGEDSIWM ANAIVYRTPK VKEVFAPYFV RNSQRFYHLD FVNIGDRKPQ SEDLRQATRG LALATLPAFD IGNDFAYVGS LHLSCSWQGN LFVLNPEIDG DFAVRSGGTK PVHMMDGEKD FFDHAQTDEF EAITLPCSIG SMTVVMPAQE KSIEDLSQKI VSNPSLVSAA LHREFGAVVM PSFDFAFHSE LRPVLERYGF KSLFGVMRQI ILVPESHLTE VNQSGRIRAD RAGVYAEAST LGGGILGGIM GGPTPFHMVV NRPFLFFVHD DAANILLFAG VVMDPTQN
|
| |