Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_1576 |
Symbol | |
ID | 4069014 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 1924798 |
End bp | 1926438 |
Gene Length | 1641 bp |
Protein Length | 546 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637983585 |
Product | type II secretion system protein E |
Protein accession | YP_590652 |
Protein GI | 94968604 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG2804] Type II secretory pathway, ATPase PulE/Tfp pilus assembly pathway, ATPase PilB |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.22471 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0615675 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAGAGA ACACAGCAAT TTTCGTCAAC AACGCGACCA CCGCAGCCAA GGAAGAAGAG CGCGGACGCG ATCTCGCGCG GCGCTATCGC TGCGAGTTCG TGGACCTGAA GTCGTTTCGG ATGCACCAGG ACCTGTTCCG CAAGATTCCA GTGGAACTGA TGTTTCGCTA TAACTTCATC CCTCTGGAAG AACTGCCGGA CGGCCGCCTC GAAATCGCGA TCGACGATCC CAGCCGGCTG ATGATGATTG ACGAAGTCGG TTTGCTGTTG CGGCGCGAGA TCGTGACCAA GGTTTCGACG CTCTCCCAGA TCACCGACAT CCTGAAGAAG ACGGAGCAGT CGCAGCGCGT TCTGGAAGAG GCGAGCGAAG ACTTCGCCGT CCACGTTATT CGCGACGACG ACGAGTCCGA CGAAACCATC TCGATCGAGA AGTTGACGGC GGAAGGGGAC ATGAGCCCCA TCATCCGCCT GGTGGACACG ACCATCTTTA CCGCGTTGCA ACGGCGTGCG TCCGATATTC ATATCGAGAC GCAAGATGAA TCCGTGATCA TTAAGTACCG TATTGACGGC GTGCTACAGA AGGCGATGCA ACCGATCGCG AAGGAACACC ACTCGACGAT CATCTCGCGT ATCAAGGTCA TGAGCGAGTT GGATATAGCC GAGCGTCGTG TACCGCAGGA CGGGCGCTTC CGCGTACGCT ACCTGGGCCG CCAGATTGAT TTCCGCGTTT CCATCATGCC GTCTATCCAC GGCGAAGACG CGGTGCTCCG TGTGCTCGAC AAAGAGAGCA TGAGCGAGAA GTTCCACAAG CTGACGCTTG ATGTGGTCGG GTTCAGCGAG GACCACATCA AGACGTTTCG CCGGTACATC AACGAGCCGT ACGGCATGGT GCTGGTGACG GGGCCTACCG GTTCCGGCAA GACGACGACC CTTTACGCTG CGCTGAATGA AATCAAGACG GAAGAAGACA AGCTGATCAC GATTGAAGAT CCGGTCGAAT ACCAGATCCG CGGCGTTACG CAGATTCCGG TGAACGAGAA AAAGGGTCTG ACTTTCGCTC GCGGCCTGCG TTCGATTCTG CGTCACGATC CGGACAAGAT CATGGTCGGC GAAATCCGCG ACACCGAGAC GGCACAAATC GCGATTCAGT CCGCGCTGAC CGGTCACCTT GTGTTCACGA CAGTCCACGC GAACAACGTG GTGGACGTAC TGGGGCGGTT CCTGAACATG GGCGTGGAGG CCTACAACTT TGTGTCGGCA CTGAATTGCA TCCTGGCGCA GCGGTTGGTG CGCGTCATCT GCGACCACTG CAAGCGCAAG GTGCGCTACG ACCTCGAAAC CCTGGAGAAC AGCGGTCTCA ACCCGGCAGA GTGGGGAGAC TTTGAATTCA GCGAGGGCCC GGGCTGTATC GAGTGCGCCG GCACCGGGTT CCGCGGCCGA ACGGCGATCC ATGAACTACT TGAACTGACC GATCGGATTC GCGAAATGAT TCTCGACAAG AAGCCGAGCT CGGAGATCCG CAAGGCGGCG CGCGAGGACG GCATGATTTT CCTGCGCGAG TCGGCGCTGG CGAAGCTGCG CGATGGGATC ACGACGCTAC GCGAAATCAA TAAGGTCACG TTCATCGAGG CCTCGAGATA A
|
Protein sequence | MAENTAIFVN NATTAAKEEE RGRDLARRYR CEFVDLKSFR MHQDLFRKIP VELMFRYNFI PLEELPDGRL EIAIDDPSRL MMIDEVGLLL RREIVTKVST LSQITDILKK TEQSQRVLEE ASEDFAVHVI RDDDESDETI SIEKLTAEGD MSPIIRLVDT TIFTALQRRA SDIHIETQDE SVIIKYRIDG VLQKAMQPIA KEHHSTIISR IKVMSELDIA ERRVPQDGRF RVRYLGRQID FRVSIMPSIH GEDAVLRVLD KESMSEKFHK LTLDVVGFSE DHIKTFRRYI NEPYGMVLVT GPTGSGKTTT LYAALNEIKT EEDKLITIED PVEYQIRGVT QIPVNEKKGL TFARGLRSIL RHDPDKIMVG EIRDTETAQI AIQSALTGHL VFTTVHANNV VDVLGRFLNM GVEAYNFVSA LNCILAQRLV RVICDHCKRK VRYDLETLEN SGLNPAEWGD FEFSEGPGCI ECAGTGFRGR TAIHELLELT DRIREMILDK KPSSEIRKAA REDGMIFLRE SALAKLRDGI TTLREINKVT FIEASR
|
| |