Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_1645 |
Symbol | |
ID | 4072532 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 1992152 |
End bp | 1993381 |
Gene Length | 1230 bp |
Protein Length | 409 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637983654 |
Product | flagellar basal body FlaE |
Protein accession | YP_590721 |
Protein GI | 94968673 |
COG category | [N] Cell motility |
COG ID | [COG1749] Flagellar hook protein FlgE |
TIGRFAM ID | [TIGR03506] fagellar hook-basal body proteins |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGATGT TTTCGATTCC GTTGTCGGGC CTCACCGCGA GTTCCACCGC ATTAGCCACG ATCGCCAACA ATCTTGCCAA TCAGAACACG ATTGGCTACA AGCAGACACG CGCACTGTTC CGCGACCTCT TTTATCAGCA GATCGGACAG ACCGGCAGTG GCGATCCTAT TCAGGTCGGC GCCGGAACCA TGATCGGCAC CATCGACACC AACTTCACCG ATGGCAGCGT GAGTCCGACC GGCGTGCCAA CCGATGTGGC GATCATGGGT GACGGTTTCT TCGTGGCCCA GCAAAACGGG AACGATATTT ACACTCGCGC CGGTAATTTC AAGGTTGGCG CCGATGGGAC CCTCAGGACG CAGGATGGCG CGGTGGTGCT TGGCTACCAG GCCGTGGACG GCAAGGTCAC AACAGGATCC GGGCTGGGTG CGCTGAATCT CGGCCAAGGC CAAGTTAGTT CGCCCTCAGC GACGACTTCC CTCCAGTTGA CGACGAACCT CAATGCCAGC GCGAAAGTTG GCGACAGCTA CAACACCTCG TTAAAGGTCT ATGACTCGCT GGGGGGCGTC CACGTAGTGA CTTTTACGTT CACGAAGACT GGAACCAACA CGTGGGACTA CGACGCTTCT CTGCCCACCG GAGAAGGCAC GGTTTCGCTG CCGTCCGGCA GCCACACGCT GACCTTCGAC AGCGACGGCA AACTTACGAC TCCGTCTTCG AACATCAATT TCGACCTGAC GGGCCTGAGT GATGGCGCAA GCGACATGAA AGGGGTGACG TGGAAGCTCT ATGACGCTAC CGGCGGTTCG TCGATGACCC AGATGGCAGC GGACAGCGCG ACCCCGGCGA CGGCACAGGA CGGCTACGGC AGCGGCATGT TGCAGAACTT CAACATCGGC GCGGATGGAA CCATCGAAGG AACCTTCAGC AACGGTAAGA CTTCGATCAT CGGGCAGATC GCGATTGCGA GCTTTCCAAA TGTGCAGGGA CTCAGGAAGG TAGGCCAGAA CGCATACGTT GGGACTCTCG CGTCGGGCCA GGCAGCGCTC GGCGCGCCGG GAAGCGGCGG ACGTGGCACC CTCGGGGGGG GAGCGCTGGA GCTTTCGAAT GTTGATATGG CGACGGAGTT CTCCAATCTC ATCGTGGCGC AGCGAGGTTA CCAAGCCAAT GCCAAGGTGA TCACAACCTT CGATGAGATC ACCCAGGACA CCATTAACCT CAAGCGCTAA
|
Protein sequence | MPMFSIPLSG LTASSTALAT IANNLANQNT IGYKQTRALF RDLFYQQIGQ TGSGDPIQVG AGTMIGTIDT NFTDGSVSPT GVPTDVAIMG DGFFVAQQNG NDIYTRAGNF KVGADGTLRT QDGAVVLGYQ AVDGKVTTGS GLGALNLGQG QVSSPSATTS LQLTTNLNAS AKVGDSYNTS LKVYDSLGGV HVVTFTFTKT GTNTWDYDAS LPTGEGTVSL PSGSHTLTFD SDGKLTTPSS NINFDLTGLS DGASDMKGVT WKLYDATGGS SMTQMAADSA TPATAQDGYG SGMLQNFNIG ADGTIEGTFS NGKTSIIGQI AIASFPNVQG LRKVGQNAYV GTLASGQAAL GAPGSGGRGT LGGGALELSN VDMATEFSNL IVAQRGYQAN AKVITTFDEI TQDTINLKR
|
| |