Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_0917 |
Symbol | |
ID | 4070569 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 1155559 |
End bp | 1157259 |
Gene Length | 1701 bp |
Protein Length | 566 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 637982924 |
Product | TPR repeat-containing protein |
Protein accession | YP_589994 |
Protein GI | 94967946 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2956] Predicted N-acetylglucosaminyl transferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.00563549 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTGCTTGG CAACTCCCGC CGTATTCGCC CAACAGTCGG CGGCTGATTT TTATAAGCGG GGAGTTCAGG CCTACGGGCG GGGAGACGAC GCATCCGCGC TCTCGTCTTT TCAACAGGCA TCGAAACTCG ATCCCAATAA TCCCGAGTAT CAGAATGCCG TAGGCCAGGC GCTGTTCAAG CAGGGAAGAC CAGCCGAAGC AATTCCGTAT TTCCGTCATG CCCTCAAACT CCGCCCCGAT CTCGCAGTCA TTCATGCATA CCTGGGTCAA GCTCTTCTCG CCGATCACCA GGCTGATGCC GCCATTTCCG AATACCGTAT CGCTGTCAAA ATGGCTCCCA ACGAAGTCGA GGCCAATCGT GGATTGGGTC GCTCGCTCAG CACCAAAGGG GACCTCGACG GCGCCATCGC CGTCTATCGT TCCGCACTGG AGACCAATTC GCAAAGCGCG CCACTTCATG ACGATCTCGG CTCGTTGCTG GCCCAGAAAA AAGACTTCGT TGCCGCGCAA CAGCAATTCG AACAAGCCTT AAAACTCGAC CGCCAGTACG AGCCCGCACA TTTTCACCTT GGCGTCGCGC TACTTTCACA AGACAAAGAT CCTGAGGCAA TGCTTTCTTT ACAGGAAGCG GTGCGTCTCG CGCCGAACGA TGTTGCCGCC CACTTCTTTC TCGGTCGCGT TCTCGAGACA CTCGGCGACA ACGCGAATGC TCTACAGAAC TACAAAGACG CTGCCCAACG CTCTTCCGAA TTTCCCGGCC TCCAGGAGAG ACTTGGACTC ACAGCGCAAC GAGTAGGCGA AATGCCGACC GCGATCTCCG CTTTCCAGAA AGCCATCGCG CAATCCCCGC AGAACCCCGA TCTTCATAAC GACCTTGGCC TGGCATTCAT GCAGGCTGGA GATGGCGAGG GAGCTATTCG GGAATTTAAC CAGGCCCTCA ACCTGAAGCC GGAGGATGTC GGCTATCTCG GAAATCTCGG GGCCGCCTAC CTTCAGCTTT CCGAGTTCGA CAACGCCGTT GATAACTTCC GCAAAGCTCT CCAGATCGCG CCGGCCAACG CATCACTGCA CCATGATCTC GCGTTGACAT TGAAGTTGAA GGACGATCTC GCCGGAGCTG CAGCGGAGCT TCGCGAGGCC ATCCGGCTCG ATCCTAAACT CTACGACGCA CATTACACGC TGGGAGTCAC CCTTTGGCAG CAAGGCGAGT TTCCCGCCGC CGTTGAAGAA CTCGAAGCCG CCCTCGCCCA GAAGCCCGAC TATGCTGAGG CTTATTACAC CCTCGGCACC GTTTACAAGC AGATGAATAA ACCGCGTGAA TCCGCCGAAG CACTTCGCTC TGCATTGAAA ATTCAGCCCG ACTTCGCCGG CGCTCACACG ACTCTAGCCG CAGTCCTCCG TCAATTGGGC GACACCGCTG GTGCCTCCGA AGAAGCACGT ATCGGCGCGG AACTTGCAAA GAAGAAAACC GGCATGCAGG CCGCGGTGTT CGCAACCAAC TCTGGAATTC GTCTCCTAAA TGCAGGCGAT CTGGATGGGG CTGTTTCCCA ATTCCGACGG GCTACCGAGT CGGCACCCGA CTACGCCATG GGGCACTTTC AACTCGCAAC TGCACTCTCC CGCCAAGGCA AACGCGACGA GGCCGATGCC GAGTTCTCCA AGGCTGCAAC CCTTGATCCA CACCTGAAAA CCCAGAAGTA G
|
Protein sequence | MCLATPAVFA QQSAADFYKR GVQAYGRGDD ASALSSFQQA SKLDPNNPEY QNAVGQALFK QGRPAEAIPY FRHALKLRPD LAVIHAYLGQ ALLADHQADA AISEYRIAVK MAPNEVEANR GLGRSLSTKG DLDGAIAVYR SALETNSQSA PLHDDLGSLL AQKKDFVAAQ QQFEQALKLD RQYEPAHFHL GVALLSQDKD PEAMLSLQEA VRLAPNDVAA HFFLGRVLET LGDNANALQN YKDAAQRSSE FPGLQERLGL TAQRVGEMPT AISAFQKAIA QSPQNPDLHN DLGLAFMQAG DGEGAIREFN QALNLKPEDV GYLGNLGAAY LQLSEFDNAV DNFRKALQIA PANASLHHDL ALTLKLKDDL AGAAAELREA IRLDPKLYDA HYTLGVTLWQ QGEFPAAVEE LEAALAQKPD YAEAYYTLGT VYKQMNKPRE SAEALRSALK IQPDFAGAHT TLAAVLRQLG DTAGASEEAR IGAELAKKKT GMQAAVFATN SGIRLLNAGD LDGAVSQFRR ATESAPDYAM GHFQLATALS RQGKRDEADA EFSKAATLDP HLKTQK
|
| |