Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_1392 |
Symbol | |
ID | 4068927 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 1689374 |
End bp | 1691089 |
Gene Length | 1716 bp |
Protein Length | 571 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 637983401 |
Product | type II secretion system protein E |
Protein accession | YP_590468 |
Protein GI | 94968420 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG2804] Type II secretory pathway, ATPase PulE/Tfp pilus assembly pathway, ATPase PilB |
TIGRFAM ID | [TIGR02533] general secretory pathway protein E [TIGR02538] type IV-A pilus assembly ATPase PilB |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.025124 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTCAGA GGTTGGGCGA TCTTCTTGTT CGCGAGAAGG TCATCACCGC CGAGCAGTTG GAACAAGCAC TCAGGGAACA AGGCTCCAGC GGCACGCGTC TTGGTGCGGC CCTGGTGAAG CTTGGTTTTC TCTCGGACGA TGACGTCACC AACTTCCTCT CCCGTCAGTA TGGCGTACCC GCCATCAACC TCAACTATTT CGAGATCGAT CCTTCCGTCG TAAAACTCAT TCCTTACGAC ACTGCGAAGC GTTACCAGAT CCTTCCCTTG AGCCGCGTCG GCGCCTCGCT GACCATCGCC ATGGTGGATC CCACCAACGT CTTCGCAATG GACGACATCA AGTTCATGAC CGGCTTCAAC ATTGAGCCGG TGGTTGCCAG CGAAAGCGCG ATCCTCGAAG GCATCGAGAA GGCCTACAAC ACCGCTCCGG AAGAAGATCT TGAGTCGGTG ATGGCCTCGA TGGGTGAGGG TGAGGCCTCC GATATTGAAG TCCAGGCGGA CATGGAAGAG GCTGACTCCG CCGACCTCGA GCGCGCCGCC GAAGAAGCTC CGATCGTCAA GCTGGTGAAC ATGATCCTCA CGGAAGCCGT GAAGAAGGGC GCCAGCGACA TCCACATGGA GCCCTACGAA AAGGAATATC GCGTACGCTT CCGGATTGAC GGCATTCTCC AGACGATGAT GAATCCGCCG ATGAAACTTC GCGACGCGAT CATCTCGCGC GTGAAGATCA TGGCAAAGCT CGACATCAGC GAAAAGCGCC TGCCGCAAGA CGGCCGCATC ATGCTGAAGA TGAACCTCCA GGGAAAGAAG AAAGTGCTCG ACTATCGCGT CAGCACCCTG CCTACCCTGT GGGGCGAAAA AGTCGTTCTC CGACTGCTCG ACAAAGAGAG CCTGCGTCTC GACATGACCA AGCTCGGCAT GGAGCAGGAA TCGCTCGACA AGTTCACCAA AGCTATCTTC AAGCCGTACG GGATGGTGCT GGTCACCGGT CCCACGGGAT CCGGTAAGAC GAACACGCTG TACTCCTCGA TTTCGCAGCT CAACAAGCCC GACACCAACA TCATGACCGC TGAAGATCCG GTCGAGTTCC AGTTGCACGG TGTGAACCAG GTGCAGATGA AGGAACAGAT CGGCTTGAAC TTCGCGGCGG CCTTGCGCTC CTTCCTGCGT CAGGACCCCA ACATCATTCT CGTCGGTGAG ATCCGCGACT TTGAAACCGC GGAAATTGCG ATCAAGGCCG CATTGACCGG CCACTTGGTT TTGTCGACGC TGCACACCAA CGGCGCGCCC GAAACCATCA GCCGCTTGAT GAACATGGGT ATCGAACCAT TTCTTGTCGC GACTTCAGTG CACCTGATTG CTGCGCAGCG CTTGATCCGC CGCATTTGCA GCAACTGCGC CGAAGTCCTC GACCTGCCGC CGCAAGCGTT GATCGAAGCC GGCTATTCGC CGGCCGAGTC CAAGACGGTG AAGATCAGCA AGGGCCGCGG TTGCAGCAAC TGCAACAACA CGGGATATAA GGGCCGTACC GGCCTTTATG AAGTAATGGA GATTGACGAC GAAATCCGGG AATTGATCCT GGTCGGCGCT TCGGCGCTGG AGTTGAAGAA GAAAGCGATC GAGAAAGGCA TGATCACGCT GCGTCGCAGC GGCTTGATCA AAGTTTCACT GGGGATCACG ACGTTGGAAG AAGTCGCACG TGAAACCGTG CACTAA
|
Protein sequence | MSQRLGDLLV REKVITAEQL EQALREQGSS GTRLGAALVK LGFLSDDDVT NFLSRQYGVP AINLNYFEID PSVVKLIPYD TAKRYQILPL SRVGASLTIA MVDPTNVFAM DDIKFMTGFN IEPVVASESA ILEGIEKAYN TAPEEDLESV MASMGEGEAS DIEVQADMEE ADSADLERAA EEAPIVKLVN MILTEAVKKG ASDIHMEPYE KEYRVRFRID GILQTMMNPP MKLRDAIISR VKIMAKLDIS EKRLPQDGRI MLKMNLQGKK KVLDYRVSTL PTLWGEKVVL RLLDKESLRL DMTKLGMEQE SLDKFTKAIF KPYGMVLVTG PTGSGKTNTL YSSISQLNKP DTNIMTAEDP VEFQLHGVNQ VQMKEQIGLN FAAALRSFLR QDPNIILVGE IRDFETAEIA IKAALTGHLV LSTLHTNGAP ETISRLMNMG IEPFLVATSV HLIAAQRLIR RICSNCAEVL DLPPQALIEA GYSPAESKTV KISKGRGCSN CNNTGYKGRT GLYEVMEIDD EIRELILVGA SALELKKKAI EKGMITLRRS GLIKVSLGIT TLEEVARETV H
|
| |