Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_3263 |
Symbol | |
ID | 4072675 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 3865468 |
End bp | 3866586 |
Gene Length | 1119 bp |
Protein Length | 372 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637985284 |
Product | glycosyl transferase, group 1 |
Protein accession | YP_592338 |
Protein GI | 94970290 |
COG category | [S] Function unknown |
COG ID | [COG4641] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.364975 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGATC GTTTCAAAAT CCGCTTCTTC GCACACTCCT GGGTCTCGGA CTGGAACCAC GGCAACGCGC ATTTCCTGCG CGGTCTCGCC CGCGAACTCG GCAAACTAGG GCACGAAGTC CGCTGCTTTG AAGAACTCGG GGCGTGGTCG CTCTCCAATC TTGTGAACAG CGAATGCGAT CGCGCCATTG AATCCATTGA CCAGTTCCGC GCCAAGTTCC CCATGCTCGA CGTGCGGTTC TACGATCGCA ACAACAAGTT CGATGAGTTT CTCGACGAGC ATCTAAGCGA CGCCGACTTC GTCATCATCC ACGAATGGAA CGATCCGGCG ATTGCGAGTG CGATCCTGTC GCGCAAGGAG AAATTCGGTT TTCGCGCGTT GTTTCACGAC ACTCACCATC GCGCCTATTC TCGTGCCGGA GAGATCCTTC GCTTTCCGCT CCACCTGTTC GACGGCGTTC TAGCATTCGG CGAGCCCATC ACCCGCATCT ATCGCGACGG CTTCGGTATT CCAAAGGTCT GGACCTTCCA CGAAGCCGCC GACATCGACA ACTTCCATCC CATCAGGTCC GACAAATCCA CCGACGTAGT GTGGGTCGGG AATTGGGGGG ATGAAGAGCG CACGAAGGAA CTTCTCGAGT TCCTAGTCCG TCCCGCAACC GAGTTACCTG ACAGAAAATT TAAGGTCCAC GGCGTTCGCT ATCCGGAGAT CGCGATCCAG ACCCTCGAGA CAGCCGGAAT CGATTATCAG GGCTATCTGC CGAATCTCTC CGTGCCGGCG GCATTTTCGC AGAGCTGCGT CGCATTGCAC GTCCCTCGTC GCGAGTACGC CAACGGACTG AGCGGCATCC CGACGATCCG CGTCTTCGAA GCGCTCGCCT GCGGCGTACC TCTGGTGTGT TCCCCATGGA GCGACGCCGA AAATCTCTTC CGTCCTGGAC ACGACTACCT CGTGGTGGAG AGCGGCGAAG CGATGACCGC GGAACTCGCT CACTTGTTGA GCGACGACGC TGCCCGCACC CAACTCAGCG CTAGCGGTAT GGAAACCGTG CGCGCCCGCC ACACCTGCGC CCATCGCGCG CAACAACTAC AGGAAATTTG CGAGGAGATG ACAAGGTGA
|
Protein sequence | MSDRFKIRFF AHSWVSDWNH GNAHFLRGLA RELGKLGHEV RCFEELGAWS LSNLVNSECD RAIESIDQFR AKFPMLDVRF YDRNNKFDEF LDEHLSDADF VIIHEWNDPA IASAILSRKE KFGFRALFHD THHRAYSRAG EILRFPLHLF DGVLAFGEPI TRIYRDGFGI PKVWTFHEAA DIDNFHPIRS DKSTDVVWVG NWGDEERTKE LLEFLVRPAT ELPDRKFKVH GVRYPEIAIQ TLETAGIDYQ GYLPNLSVPA AFSQSCVALH VPRREYANGL SGIPTIRVFE ALACGVPLVC SPWSDAENLF RPGHDYLVVE SGEAMTAELA HLLSDDAART QLSASGMETV RARHTCAHRA QQLQEICEEM TR
|
| |