Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_2980 |
Symbol | |
ID | 4068881 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 3528379 |
End bp | 3531759 |
Gene Length | 3381 bp |
Protein Length | 1126 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 637984999 |
Product | integrin-like protein |
Protein accession | YP_592055 |
Protein GI | 94970007 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.16679 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCCAAG TGGCTGCGGT TAGTTTCTTT TTAGTTTTCA CTGCGGTATT CAGCGCATAC TCTCAGGATC TCACTCAGGC AAAGCGCACG CCTGCGGCGT CTGGCTCGAC GCAACCCAAC GTATTTCTCA CACCCAAACA ATATCCCGCT GGTCCTTCCG GTGTCACTTC GATCGCTAAG GGCGACTTCA ATAACGATAG TTACATGGAT GTCGCGGTAA CTAACGTTTC TGGCACGATC ACGGTTCTCC TGGGCAAAGG CGATGGCACT TTCCAAGCCC CCGTATCGTA CCCAGCCTTA TCTTCCCCGG TCTCGATCGC CGCGGCAGAT TTGAATGGGG ACGGAAAATT AGACTTAGCG GTAGCAAACA GCGGTAGTGG GAGCATTAGC GTATTCCTCG GAAATGGCGA CGGAACCTTC CAATCGCACA CGGATGTTGC CGTCGGCACG AGCGTGCAAA TGTTGACCGT TGCAGACTTC AACGGCGACG GCAAACCCGA CCTTGCGGTT TTAGTCGATG GCATGGTGAG CGTCCTGATC GGGAAAGGCG ACGCCACCTT CAACGCGATA GGCGAGTACG CGAAACCATG CGCTACCTAT TTGGCCACAG GGGATTTCAA CGGAGACGGC AAGACCGACA TCGTGGCAGG ACGTCAGTGC GTCCTACTGG GCAATGGCGA CGGCACTTTT CAACCGCCTG TAGGTTCCCA GAAGATTGGG AACACGGTGA GTACTGCAGT AGGCGACATC AATGGTGACG GCAAGCTCGA CTTGATAGAG GGTGGTATCG GCGACTCGGA TGGAACTCCA AGGGCCCTGG TTGTCGTGCT CCTGGGCAAT GGAGACGGAA CGTTTCAACC GCCCCAAGGC TTTTTTGGTT ATGGGAGCGG TGTACAGGGA TTATTGCTTG CCGATGTGAA CGGCGATTCC CATCCAGACA TCGTGCTGTC CAGCTCTGAA AACGTGGAAG TTGTAAACGG AAAGGGGGAC GGCACCTTTG AACCGGGTGT GCTCTACCCC GTGGGGAACC GGCCGGTAGC AGGAGGGTTG GTGCTGTCCG ATTTTACCGG CAGCGGCAGG CTTGATCTCG CAGTTTTGAC CTCGTGCGCT AACCCGTCTA TTTGTGGAGA CGGAGCCGTG ACCCTATTGA GAGGCAAAGG GGACGGCACG TATGTTGCGC CAGCAAGTTA TTACATTTTT GAGGGTGACG AACGTTTCGA CGCTGTAGGG GGATGGGTCG CGGTGGGTGA CTTCGACGGC GACGGGAAAC TCGACGTACT CGAAGTATTC GACACTCGCG CCTTCATCTC TCTAGGAAAC GGGGATGGCA CCGTGCAAAC GGGCCAACGT TATTATCAAG TTGCATATCA GTCCGATGGG GCGGTAGTGG GAGATTTCAA CGGCGATGGC AAACTCGACG CTGCCATTCT ACACTCTTGC GACGTCTTTT ATAGCGACCC TGGCCCAGGC AATCCTCCCC CCTATTGCGT TAGCGCAGGT TCGGTAGGGG TGCTATTGGG AAATGGAAAC GGAACTTGGC AGCGGTGGCC GGAGAGCCTG TACTTTGGAG TTGGAGACAC GCCTACATCT ATTGCGACGG GTGACTTCAA TCAAGATGGC AAACTCGACC TTGTCGTTTC TGATGGAGCG AATGCTTATA TTCTCTTAGG AAACGGTGAT GGCACTTTCC CGGTCCACCA AGCCTACCCA ACCGGAGCTG CGGCTGATTA CCTTAATCCG TTTCTGCCCA ATGCGCAATC GGTCGTAGTT GGCGATTTCA ATGGCGACGG CGCTCCGGAC GTCGCGGTTT CAAATTCAGA CGGGGGTATC GCCGTTTTGC TCGGCAACGG GGACGGCACG CTCCGCGCCC CCCAACTCTT CCCAGCTATA AAGAGTTCGC AGTCGCTTGC GATCGGAGAT CTTAACCGAG ATGGCAAATT GGACATCGTA GCTTCCGACG GGAGTGGCAG CATCAGTATT TTCCTTGGTA ATGGCGATGG TACTTTCCAG ACCAGCAAAG TTTATGCGGC CGTTGGATCA CAAAGTGTAA CGGTCGGAGA TTTTAACGGC GACGGCATCC TTGACGTAGC GAGTGGGACG GGACACACCG TAAGCCTGCT TCTCGGAAAT GGCGATGGCA GTTTGCAGCC GCCGGTGAAT TACATTGTTG GACTCAGTGC AACCGGGCTG GCAGCTGGAG ACTTCAATGG GGACGGGGCT TTGGACCTCG TCACCGAAGA CTTCTCGATT CTATTGAATC GCCAGGGAAC GCAGCTTAAC GTGCAATCTT CCCGCAATCC ATCCAACGTA GGCCAACCTG TGACTTTCAC CGTGACCTCC GCTGCGAGTT TGCCGGAGAC AGGCTTGCCT TCGGGCACCA TCACGCTGCG CGATGGGAGC ACTGTGCTGG GAGAATCCGG GGTATCGGGA GTATTTGACG TGAAGGTCTC CGGATTGACT GCCGGAACGC ACCAAATCAC GGCCACATAC TCCGGCGATA ACAACTTTCA ACCACACACG ACCGCCATCC TGACAGAACA CGTCGGTGCT CCCGCCACGA TGATTAGCCC CGCTCTTGGA TCGATTCTAA CCAGCACTAC CGTAACCTTC GCGTGGAAAG CAGCCGCGGG TGCTTCGCAG TACAGCCTGT ACCTCGGCAC TAAACCCGGT CGGGACGACC TCGGGTACGT CAATGCGCAT TCGAGCACGT CCGCGACTGT GAAAAACCTC CCATCCACGG GATCCTCCCT ATACGTCACG CTCTTCTCGC TTGTCGGAGG GGTGTACTAT TCAAATTCTT ATACGTATAT CCTCCCTGGA ACACCTGCCA AGGCCAAGAT GACCTCGCCC CTGCCCGGCA CGATGTTAGT CGGCAAGGAT GCGACATTTA CGTGGAGCCA CGGAACGGGC GTCACCTACT ACAGTCTGTA TGTCGGGACA AAGGGTTACG GCACTCACGA TCTGGATTTC ATCAACGCCA CGACTACCAG TGCCAGCGTG TCGAACCTCC CTGCCGACGG GAGCACAATC TACGTTCAAG TAAATTCGTA TATCGACGGC GCGTGGACTA GCCAGAGCTA CACCTATATA AGTGGAAGTG GAACTCCCGC GCCCGCAACC ATGATCTCGC CCACCCCTGG AAGCAGTATC TCCGGCAACT CTGCGACTTT CACATGGACG AGCGGCGTTG GAGTCAGCGA ATTCAGTTTG TACGTAGGTA CGGGAGGGGT GGGTTCCCAT AACATTGCGT TCATCGAAAC CGGAACCACG AGTGCGACCG TCACTGGCCT TCCCGCTACC GGCGCAACGA TCTATGTGCG CTTGAATTCG TTTGTCAACG GCGCGTGGCA GTGGGTGGAC TATTCTTATC GGAACCCGTA A
|
Protein sequence | MRQVAAVSFF LVFTAVFSAY SQDLTQAKRT PAASGSTQPN VFLTPKQYPA GPSGVTSIAK GDFNNDSYMD VAVTNVSGTI TVLLGKGDGT FQAPVSYPAL SSPVSIAAAD LNGDGKLDLA VANSGSGSIS VFLGNGDGTF QSHTDVAVGT SVQMLTVADF NGDGKPDLAV LVDGMVSVLI GKGDATFNAI GEYAKPCATY LATGDFNGDG KTDIVAGRQC VLLGNGDGTF QPPVGSQKIG NTVSTAVGDI NGDGKLDLIE GGIGDSDGTP RALVVVLLGN GDGTFQPPQG FFGYGSGVQG LLLADVNGDS HPDIVLSSSE NVEVVNGKGD GTFEPGVLYP VGNRPVAGGL VLSDFTGSGR LDLAVLTSCA NPSICGDGAV TLLRGKGDGT YVAPASYYIF EGDERFDAVG GWVAVGDFDG DGKLDVLEVF DTRAFISLGN GDGTVQTGQR YYQVAYQSDG AVVGDFNGDG KLDAAILHSC DVFYSDPGPG NPPPYCVSAG SVGVLLGNGN GTWQRWPESL YFGVGDTPTS IATGDFNQDG KLDLVVSDGA NAYILLGNGD GTFPVHQAYP TGAAADYLNP FLPNAQSVVV GDFNGDGAPD VAVSNSDGGI AVLLGNGDGT LRAPQLFPAI KSSQSLAIGD LNRDGKLDIV ASDGSGSISI FLGNGDGTFQ TSKVYAAVGS QSVTVGDFNG DGILDVASGT GHTVSLLLGN GDGSLQPPVN YIVGLSATGL AAGDFNGDGA LDLVTEDFSI LLNRQGTQLN VQSSRNPSNV GQPVTFTVTS AASLPETGLP SGTITLRDGS TVLGESGVSG VFDVKVSGLT AGTHQITATY SGDNNFQPHT TAILTEHVGA PATMISPALG SILTSTTVTF AWKAAAGASQ YSLYLGTKPG RDDLGYVNAH SSTSATVKNL PSTGSSLYVT LFSLVGGVYY SNSYTYILPG TPAKAKMTSP LPGTMLVGKD ATFTWSHGTG VTYYSLYVGT KGYGTHDLDF INATTTSASV SNLPADGSTI YVQVNSYIDG AWTSQSYTYI SGSGTPAPAT MISPTPGSSI SGNSATFTWT SGVGVSEFSL YVGTGGVGSH NIAFIETGTT SATVTGLPAT GATIYVRLNS FVNGAWQWVD YSYRNP
|
| |