Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_4747 |
Symbol | |
ID | 4070685 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 5610387 |
End bp | 5612792 |
Gene Length | 2406 bp |
Protein Length | 801 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637986791 |
Product | cell surface glycoprotein (s-layer protein) related protein-like |
Protein accession | YP_593820 |
Protein GI | 94971772 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0992088 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCCGCC ACTCGTCGCT TCGCCTGCTG CTTGCCAGCT TGTGTCTCCT CGTGTCTTTC ACTTTGCCGG CCCTCGCCCA GGGCATTGAT CTGCCACTCA GCTTCGAGCC CAATCTTGGG CAATCCGATC CGGCGGTGCG CTTCTTGTCG CATGGAAAGG GATACGGAAT CTATTTGTCG CAGAACACGA CAGTTCTGCA GATGGGTTCC GACCGGCTCG CGTTACAAGT CGCGGGCGGG CAAGCTCCCA GCGCAATCCG TGGCGAAGAG CCATTGACGG GCAAAGTGAA TTACCTGCGT GGCGCCGATC GTTCGAGTTG GCTGCGAGGG GTTCCGACCT ATGCGCGGGT GAGGATGAGC TCGGTGTATC CGGGCGTGGA CCTGGTTTAC TACGGCAACC ATCGCCAGCT GGAATACGAC TTCGTGGTGC ACCCCGACGC GGATGCAAAG CAGATTGGGT TTGCGGCGAA GGACGCTGAG CTTCGGCTGA ACCGCGATGG TCAGCTCACG ATGACAGCGG GCGCTGCCGA AGTGCATTGG CATGCGCCGG TGGCTTACCA GGAGATTGAC GGCAAACGCC ACGCGGTAGA GGCGAAATAT GAAATTGCGG GTTCGATGAT CCGGTTTCAC GTCGGTGCGT ATGACCATTC GCACGATCTC GTGATTGACC CGGTGATGGT TTATTCAACG TACGTCGGCG GCAACGGTGG TGAGACGGGC GATGTTGGCA ACGCAATCGC GGTGGATGCG GCGGGGAACG CCTACATCGC AGGAGTGGCG TCATCTACCA ATTTTCCAGT GACGAGCGAG GCGATGCAGC CGTCGTCGCG TGGGAATGAC GATGCCTTCG TGGCGAAGAT CAATCCGCAG GGCACGGGCT ACGTGTATGC CACTTATCTT GGAGGCGGCG GACAGGACAT TGCCTGGGGG ATCGCGATTG ATGGCGCAGG CAACGCGTAC GTCACGGGGC AAACTGGTTC CGGACTGCAT GGACAGGCGG CGTTTCCGAC GACAGCGGGC GCTTATCAGC GCACGCAAAA TGCAAATGTG CTGAACAACA GTGTGTTCGT TGCAAAGCTC AGCGCGGATG GCACCGACCT GCTCTACTCG ACGTATCTGA CTGGCACGAA CGATTCTACG GCGTCGGGAA TTGCGGTGGA CGGCGGGGGA AATGCTTATG TGCTCACGAA CACCGCGGGC GGATTCCCGG TATCAGGCGC CGCATATCAG AAGACGGCAG GCACAGACCA GTGTCCGTAC GAACAGTTTG CTGACGGCCA GGCACAAGTG GTCACGAAAG TAAATGCGAC GGGATCGGCG CTGGTGTACT CGACGTATGT CGGCCACGGA TGCGATTACG GTGCGGGCAT CGCGGTGAAC ACCGCGGGCG AAGCTTACAT TGTGGGGCAT ACGCAGGACA GCGCTTATCC GGTAACAAGC GGTGAGGTGG GATCGACGTT CGGCGGTGTG GTAGATGGAT TCGTGACGCG CCTCAACGCG AGCGGAAGTG GGATCGTGTA TTCCACGTTC CTTGGCGGTT CTCTAGCTGA TTTTGCGAAC GCGGTCGCGC TGGATTCTTC GGGATATGCG TACATCGCGG GTGGCACGGA TGGCGACTTT CCCACGACTT CGAGCGCGTA TCAGACAACG GCGAGCAACA ACGGCTACCG CAAGGGATTC GTCACGAAGC TTAGTCCGAT GGGCAAGGCG CTGATTTATT CGACGTACAT TCGCGGCGCG GCAAATGTGT CGTTCAGTTC GATTGCTGTG GACAAGAGCC ACTATGCGCA CGTTACTGGC TATTCGGATG GGAGCCAATA TCCGGCGACG AGCACGGCCG TGCAGGGCAC GTGCCACCAG GGACCGAGTG GCTGCCTGAC GCAGGCGGTG GTGACGAAGG TGAACGCGAC GGGCTCGGGA TTGTTGTATT CGAGCTACTT CGGCGCGAGC GACGCCAGCA ATAACTACTT CCCGGGGAAC ATAGGCAATG GCATCGCCGT GGACAACAAC GGCGGGTTCT ACATCACGGG GCGCACCAGC GCGGGGCTGA AGACGACCAG CAGCGCGGCG GAACCGAGTT ATCGTTCGAA CAGCAACAGC ACGGATGCGT TCGTGGCGAA GTTCAACGTG TATGGAACGT CTTCGGCTAC CAAGGTGATC GTGCTCTTGC CTTTGGACGG ATCGCTAGTG ACGGCAAAGG CCGGCGTCAG CGCAACGGCT CTCGGGAGTT CCAGCCCGGT GGCGTACATG CAGGTGTACG TGGATGGCGT GAGAAAGGCG CAGGTTTCCG GCAGCACGAT CCTAACCGTC GTTTCGCTGG GGACGGGCCA ACACCGGATT ACAGCGCAGG CGATCAACAA AGACAGCTCG ATTGCGAAGA GCACGGTGTA CGTCACCGCT AAGTAG
|
Protein sequence | MPRHSSLRLL LASLCLLVSF TLPALAQGID LPLSFEPNLG QSDPAVRFLS HGKGYGIYLS QNTTVLQMGS DRLALQVAGG QAPSAIRGEE PLTGKVNYLR GADRSSWLRG VPTYARVRMS SVYPGVDLVY YGNHRQLEYD FVVHPDADAK QIGFAAKDAE LRLNRDGQLT MTAGAAEVHW HAPVAYQEID GKRHAVEAKY EIAGSMIRFH VGAYDHSHDL VIDPVMVYST YVGGNGGETG DVGNAIAVDA AGNAYIAGVA SSTNFPVTSE AMQPSSRGND DAFVAKINPQ GTGYVYATYL GGGGQDIAWG IAIDGAGNAY VTGQTGSGLH GQAAFPTTAG AYQRTQNANV LNNSVFVAKL SADGTDLLYS TYLTGTNDST ASGIAVDGGG NAYVLTNTAG GFPVSGAAYQ KTAGTDQCPY EQFADGQAQV VTKVNATGSA LVYSTYVGHG CDYGAGIAVN TAGEAYIVGH TQDSAYPVTS GEVGSTFGGV VDGFVTRLNA SGSGIVYSTF LGGSLADFAN AVALDSSGYA YIAGGTDGDF PTTSSAYQTT ASNNGYRKGF VTKLSPMGKA LIYSTYIRGA ANVSFSSIAV DKSHYAHVTG YSDGSQYPAT STAVQGTCHQ GPSGCLTQAV VTKVNATGSG LLYSSYFGAS DASNNYFPGN IGNGIAVDNN GGFYITGRTS AGLKTTSSAA EPSYRSNSNS TDAFVAKFNV YGTSSATKVI VLLPLDGSLV TAKAGVSATA LGSSSPVAYM QVYVDGVRKA QVSGSTILTV VSLGTGQHRI TAQAINKDSS IAKSTVYVTA K
|
| |