Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_2162 |
Symbol | |
ID | 4073104 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 2581408 |
End bp | 2582391 |
Gene Length | 984 bp |
Protein Length | 327 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 637984178 |
Product | curli production assembly/transport component CsgG |
Protein accession | YP_591237 |
Protein GI | 94969189 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1462] Uncharacterized protein involved in formation of curli polymers |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0328897 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTCGTT TCACTACTGC TGCGGTGATT GCGCTGTTGC TGCTCTCCGT GTCTGCCTTC CCTCAGGCGA CACGCAAGAA ACGCGTTGCG ATCATGAGCT TCGACTACGG CACTGTTCAT AGCAGCGTCG CCGCCATCTT CGGTTCCGAT CAAGATGTGG GGAAGGGTAT CTCCGACCTT CTTATTCAGA AGCTCGTCAA CGACGGAGAT TATTCCGTGA TCGAGCGTGC GCAGCTCGAC AAGATCATGG CCGAGCAGAA CTTTTCCAAT AGCGATCGCG CCGATCCGAA CAGTGCCGCG AAGATCGGCC GTCTGCTGGG CGTGGATGCC ATCATCACCG GCAGCATTAC CCAATTCGGT CGCGATGACC AGCACACCAA CGTTGGCGGA GGCGGCTACG GCGGGATTAC CGGCCGCTAC GGCATCGGCG GCGTAGGCAC GCACAGCGCA AAGGCGGTGG TCGGTATCAC CGCTCGCCTG GTTGATGTAA ACACCGCGGA GATCCTTGCG GCTTGTACCG GCACGGGAAC TTCGAAACGC AGTGGCGTTT CACTGCTCGG AGCAGGCGGC AGCGGGTGGA ACGGCGGCGG CGGATCGCTA GATATGGGGA GCTCAAACTT CGGCGAGACG ATCCTCGGCG AAGCGGTGCA TCAGGCCGTG GATTCCCTGG GAGCACAACT CGACGCCAAG GCTGGCGCAC TTCCGACGAA CAAGGTTGTG GTTAGCGGCG TGGTGGCTGA TGTCTCCGGA AACTCGATCA TCATCAACGT CGGCAGCCGG CAGGGCATCA AGGTCGGCGA CCAGCTCGAT GTCGAACATC CGACCCGCAC GGTGAAAGAT CCGACGACCG GCAAGGTCCT GCGAACTGTG TCTGACCACC TCGGCAGCGC GACTGTGACC GAAGTGGATG AAGGCTCAGC AACACTCAAC TTCAACGGCA GCGGCAAGCC CGCCGTGGGC GACACCGTGA AGTCTCCTCA GTAG
|
Protein sequence | MRRFTTAAVI ALLLLSVSAF PQATRKKRVA IMSFDYGTVH SSVAAIFGSD QDVGKGISDL LIQKLVNDGD YSVIERAQLD KIMAEQNFSN SDRADPNSAA KIGRLLGVDA IITGSITQFG RDDQHTNVGG GGYGGITGRY GIGGVGTHSA KAVVGITARL VDVNTAEILA ACTGTGTSKR SGVSLLGAGG SGWNGGGGSL DMGSSNFGET ILGEAVHQAV DSLGAQLDAK AGALPTNKVV VSGVVADVSG NSIIINVGSR QGIKVGDQLD VEHPTRTVKD PTTGKVLRTV SDHLGSATVT EVDEGSATLN FNGSGKPAVG DTVKSPQ
|
| |