Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_3935 |
Symbol | |
ID | 4071318 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 4652503 |
End bp | 4654035 |
Gene Length | 1533 bp |
Protein Length | 510 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637985961 |
Product | IgA peptidase. metallo peptidase. MEROPS family M64 |
Protein accession | YP_593009 |
Protein GI | 94970961 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.819698 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.416212 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCTAGTTG GCTTCTGCCG CTCCGATTGC TCGTGTCGGA ACGCCGAAAC CGGTTCCACC ACTGAGACAA ACTGTCTCGC CAGATACAAT CCCCGCATGA AGCCGACCGG CAGAATTCTT TTGTGCTCCT TCCTCTTTCT TCTTCTTCTT TCTCTGGGCC GCTCGCCGCA CGTGGTCGCG CGCCAATCTA GCGACGCGTC CGTTTCGCGC ACTATGCGGG TGGACTACTA CCACAACGGC AACGCCAAGG AACAATGGTT CAGCCTCGAT CGCATTGTGC TTGAACCACT TCCGTGGCCC GGCAATCCTC GCAAGGCGAT AGATGACACC CAGATGGGCA ACTACCGCTT CGAAGTCCGC GAGCACGCGG GCGGCAGGCT CGTCTATTCA CGGGGTTTCA ACTCTATTTA CGGCGAGTGG CGCGAAACCG ACGAGGCCAA GAACGCCAAT CGCTCGTTCT CCGAGTCATT GCGCTTCCCT ACGCCATCCG TTCCGGTCGA CATCTCGCTG CAGGAACGGA AAGACGAGCG CTATGTCGAA ATCTGGAAGA CTTCCGTCGA TCCTGCCGAC AAGTTCATCG ACACATCGCG TCCCGCCTCG CCGGGTGCAC TCATTGAGTT GCAGAAGTCC GGTCCGTCCT CCGACAAAGT GGACCTCCTC GTGATGGGCG ATGGCTACAC CGCCGACGAG CGCGGTAAGT TCGAAACCGA TGCGAAGAAG TTCATCGAGA CGCTCTTCGC GACCTCGCCA TTCAAAGAAC ATCGTCAGGA CTTCAATGTC TGGGGTCTCT GCCCGCCGGC TGCAGAATCC GGAATCTCGC GTCCGTCTAC CGGCATTCAT CATCGTTCGC CGCTGGGCAC AACCTACGAC ACTTTCGACA GCGAGCGCTA TATCCTGACC ACCGAAAATC GCGCGATGCG TGATGCCGCT TCCTTCGCGC CCTATGAGTT CGTCGAAATA CTCGTTAACG GCAAGACCTA CGGTGGCGGC GGAATCTTCA ATCTCTACGG CACGGTTGCG ATTGATAACG CCTGGGCCAA CTACGTCGGC GTCCACGAGT TCGGCCACCA CTTCGCTGGC CTCGCGGATG AGTACTACAC CTCCGACGTC GCCTACAACT CCGAGACGAA GCGCAAAGAA CCGTGGGAGC CAAACGTGAC AGCGCTCCTC GATCCCGCCA ATCTCAAGTG GAAAGACCTG GTCGAAAACG GAACGCCCTT ACCCACGCCC TGGAAGAAAA CCGAATTCGA GGAATTTGAA AAAGGCATCC AGGCCGAGCG CCGCAAGCTG CGCGCCGATC GCCGTCCCGA AACAGAGATG GAAGAACTCT TCCGCCGCGA GCGCGAGAAA GAAGAAGCGC TCTTCCGCGA CGATCAGTAC CCGAGCAAAG TCGGCGCCTT CGAAGGCGCG AACTACGAAG CCAAGGGCTA CTACCGACCG GAAGAGAACT GCATCATGTT CACTCGCCAC ACGAAGTTCT GCCGCGTCTG TAGCCGGGCA ATCGAACGCA TCATCTCGAT GTATTCAAAC TAA
|
Protein sequence | MLVGFCRSDC SCRNAETGST TETNCLARYN PRMKPTGRIL LCSFLFLLLL SLGRSPHVVA RQSSDASVSR TMRVDYYHNG NAKEQWFSLD RIVLEPLPWP GNPRKAIDDT QMGNYRFEVR EHAGGRLVYS RGFNSIYGEW RETDEAKNAN RSFSESLRFP TPSVPVDISL QERKDERYVE IWKTSVDPAD KFIDTSRPAS PGALIELQKS GPSSDKVDLL VMGDGYTADE RGKFETDAKK FIETLFATSP FKEHRQDFNV WGLCPPAAES GISRPSTGIH HRSPLGTTYD TFDSERYILT TENRAMRDAA SFAPYEFVEI LVNGKTYGGG GIFNLYGTVA IDNAWANYVG VHEFGHHFAG LADEYYTSDV AYNSETKRKE PWEPNVTALL DPANLKWKDL VENGTPLPTP WKKTEFEEFE KGIQAERRKL RADRRPETEM EELFRREREK EEALFRDDQY PSKVGAFEGA NYEAKGYYRP EENCIMFTRH TKFCRVCSRA IERIISMYSN
|
| |