Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_1220 |
Symbol | |
ID | 4068560 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 1504889 |
End bp | 1506718 |
Gene Length | 1830 bp |
Protein Length | 609 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 637983229 |
Product | TPR repeat-containing protein |
Protein accession | YP_590296 |
Protein GI | 94968248 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00113386 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.00622178 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | TTGAACCACA AGCGTCAACG AACAGATTTC AACTTTTCTC ACGAATTCTG TCCGGCGAGC ACCTGCAAAC CCCATGGTAT CATCCGCCTC ATCCAGCAAA TGTCTTACGT ACCGGCCCCC GACAACCGCC AACCCGGCTT GCTAAGCGGG TCCCGCGCCA TTTTGGCACT CATTCTCGGA GTCACTGCTC TCGTCTACGC CGGTACCCTG CAATTTGGCT TTACCTACGA CGACACCCCA CAAATCGTAA CCAATCCACG CATTTCCTCA TGGAGTTACC TGCCCAAGTA CTTCACGGAG CACGTCTGGG CGCAGATTTC TGCCTCCGGA ACATACTACC GGCCCTTATT TTTGTTGTGG TTACGTCTGA ATCACGCGTT ATTCGGAGTC CAGAACCCGT TTCCGTGGCA CTTGACGACC GTGCTTTTGC ATCTTCTCGC GGTGACGTTG GTTTTCCGGC TCCTGATCAA GACATTCAGC CTCGAAGTCG CCGGAATCGC GACTTTCGTG TTCGCGCTGC ACCCCGGACA CGTTGAATCC GTCGCGTGGA TCTCCGGCTG TACTGAGCCG CTCATGACCT GTGCGCTCGT CGGCGCAATC CTCTGCTGGA CGAATCGCAG AAATTCTCGG GGTGCGCTTT GGCTCGCGGC CTCATGGCTG CTCTGTCTCG CCAGCCTGTT GATCAAAGAA ACCGCAGTGC TGTTGCCGGT CCTGATCTTT GTCTATTCAC TTTTCGAAGA CCGTGAATCG CCGCGCTTCG ACGCTTACAA GCGCGCAGTC CTGAACACGC TGCCCTTCGC GGTAATCACG GTCGCGGAAT TGATGGTTCG CGCTCGCGTG CTTCGCGGCG GCGTCGCCGA CGAACCGCAT CCGGCGATGC AAACGCTGCT GACAGTTCCA TCAGCGATTT CTTTTTACAT TCAACATCTG TTCTGGCCCG TGAAATTGAG CCCTTTCTAC GGCCTCGAGC TGATGCAAAA ATTCAGCGCC GTTGTCGTGA TTCCGGCGGC TCTCGTCGCT CTCTGCGCCG TCGCACTTCT TCTTGTCCTC GCATTTCGCT CGCGCACATT GCTGATCGCC GTCGCATGGC TTGTTTTACC TGTGCTGCCT GCGCTCATTG GCATCCGCCT CTTTGATAGC AACGATATCG TTCACGACCG CTATCTCTAT CTCTCGACGA TCGGACTAGG ACTTCTTCTT GGACTTGCGA TCTCCCGGCT TCCGGCGGCG GGCACAGAGA TTTTCCGCCT CCCACGCGCG CAATTTGCGT GCATCGCGCT TATCGCGATC GCAATTGCTG CTGGAACCGC GCTGGAAATT CGCCCCTGGA GCAACAATCT CGCCTTGTTC CTCCGTGGGG TCGACGTCGC ACCCAACAGC ACGCCGGCCT ACAGTCATCT CGCCTTCGAG GTTTACAAGC GCGGCGATGC CGCGGATGCC GAGCGCCTCT ACAAACACGC CGTGGCGCTT GGCCCCAACG ACTGGCCCGC GAATTTTGGC TTGGCTATGA TCGAAATGCG CATGTCGAAC TGGAACGAGG CCGACCGATT TTTCCAACGC GCAATCGAGA TCAGCCCTTC GGTGAGCAAT GGAAGCTATC TTCTGCAAGC ACGGGTCCGG GTCGAAATGC AGCATTACGA TGCCGCCGAA AAAAGCGTGC GGGAAGCGAT CGACAACTGG CCGAACATCG CGAGCCAGCA TTTATTATTG GCTCAGATCC TGACAAAGCA AGGGCGCATT GAGGAAGCCC GCTCTGAATA TCAGAAGGAG TTGACGCTTA ATCCCACCTC CACCGAAGCA CGGATGGGAC TCGCGGAAAT TGGGCAGTGA
|
Protein sequence | MNHKRQRTDF NFSHEFCPAS TCKPHGIIRL IQQMSYVPAP DNRQPGLLSG SRAILALILG VTALVYAGTL QFGFTYDDTP QIVTNPRISS WSYLPKYFTE HVWAQISASG TYYRPLFLLW LRLNHALFGV QNPFPWHLTT VLLHLLAVTL VFRLLIKTFS LEVAGIATFV FALHPGHVES VAWISGCTEP LMTCALVGAI LCWTNRRNSR GALWLAASWL LCLASLLIKE TAVLLPVLIF VYSLFEDRES PRFDAYKRAV LNTLPFAVIT VAELMVRARV LRGGVADEPH PAMQTLLTVP SAISFYIQHL FWPVKLSPFY GLELMQKFSA VVVIPAALVA LCAVALLLVL AFRSRTLLIA VAWLVLPVLP ALIGIRLFDS NDIVHDRYLY LSTIGLGLLL GLAISRLPAA GTEIFRLPRA QFACIALIAI AIAAGTALEI RPWSNNLALF LRGVDVAPNS TPAYSHLAFE VYKRGDAADA ERLYKHAVAL GPNDWPANFG LAMIEMRMSN WNEADRFFQR AIEISPSVSN GSYLLQARVR VEMQHYDAAE KSVREAIDNW PNIASQHLLL AQILTKQGRI EEARSEYQKE LTLNPTSTEA RMGLAEIGQ
|
| |