Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_4601 |
Symbol | |
ID | 4071546 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 5448410 |
End bp | 5449606 |
Gene Length | 1197 bp |
Protein Length | 398 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 637986641 |
Product | TPR repeat-containing protein |
Protein accession | YP_593675 |
Protein GI | 94971627 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG5010] Flp pilus assembly protein TadD, contains TPR repeats |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.458402 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCTTGA TCGGGGCTGT GTGCTTGTTG TCGTTGTGCG CGAGCGTTTT TGCGCAGGAC CTTTCGGACA AGGCGGAGTA CGACAAGCTG AAAGCGCACG CGACGGAGCT CTTCAATCAG AACAATTTCC TCGCAGCTTT GCCGGAGCTC CAAAAACTCG CAGACCAGAA CCCGAAAGAT TATGCAGTGC TGGAGGCGCT AGGTTTTGCG CTCGCCAGCA AAGCGCTTCT GGAAACCGAT GCCGACCAGC GTAAGGCCGA CCGCATTGCT GCGCGCAAGC ACCTGCTGGA GGCCAAAAAA CTCGGCGATA ACAGCGAGAT GATCAACTAC CTGCTAGAAA CGACCCCGGA AGACGGCACC CCGCGAAAGT TCTCCGACAA CAAAGAGATC GAACGGCTGA TGCAAACCGC CGAAGCGCAT TTTGCGAAGG GAGAACTCAA CGAGGCAAAG GCCGGATATC TCCAGGTGCT GCTGCTCGAT CCCGAGAATT ATGCAGCGGC GTTGTTCACT GGAGATGTGT ATTTCAAGGA TGGCAAGTAC TGCAGCTCCA TCCAGTGGTT CCAGAAAGCG ATTGAGATAG ACGCCAACAC CGAAACCGCC TACCGATACT GGGGCGATGC ACTCGACCAC CTGGGCCAGA AAGACGAAGC GCGACGAAAG TTTATGGAGG CGGTGATCGC CGACCCGTAC AACAATCGTC CATGGCAACA CTTGTACCAG TGGATGAAAA CGCAGGGCCA CGAACTGACG GTTCCCAAGA TACAACCGCA GGCCTCGGTG AACGTGGAAT CGGACAAGAA AATCAATATT ACGGTGAACT CAGGTAGCGT CGAGAAGCAC GATGGCAGCG CTGCGTGGAT GACATATGGA ATCGGCCGCG CGGCTTGGCA AGGTGAGAGG TTCAAGAAGG AATTTCCGAA CGAGCCGAAG TATCGCCACA CGCTGCGCGA GGAGAATCAT GCACTCTCGC TCGTCGTAAG CTCGGTGAAG AGTCAAAAAG ACATCAAACA GCTTGACCCG CAGCTCGCAA CACTGGTGAA GATATCCGAC GCCGGACTGC TCGAGCCGTA CATCCTGCTC AATGCGGCAG ACCAAGGCAT TGCCCAAGAC TACGCGCCGT ATCGCAAGGA ACACCGCGAT CTGCTCTACA AATATCTCGA TACGATTGTT GTCCCGCAGT TGAAGCCGGG GCTCTAG
|
Protein sequence | MRLIGAVCLL SLCASVFAQD LSDKAEYDKL KAHATELFNQ NNFLAALPEL QKLADQNPKD YAVLEALGFA LASKALLETD ADQRKADRIA ARKHLLEAKK LGDNSEMINY LLETTPEDGT PRKFSDNKEI ERLMQTAEAH FAKGELNEAK AGYLQVLLLD PENYAAALFT GDVYFKDGKY CSSIQWFQKA IEIDANTETA YRYWGDALDH LGQKDEARRK FMEAVIADPY NNRPWQHLYQ WMKTQGHELT VPKIQPQASV NVESDKKINI TVNSGSVEKH DGSAAWMTYG IGRAAWQGER FKKEFPNEPK YRHTLREENH ALSLVVSSVK SQKDIKQLDP QLATLVKISD AGLLEPYILL NAADQGIAQD YAPYRKEHRD LLYKYLDTIV VPQLKPGL
|
| |