Gene Acid345_4747 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4747 
Symbol 
ID4070685 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp5610387 
End bp5612792 
Gene Length2406 bp 
Protein Length801 aa 
Translation table11 
GC content60% 
IMG OID637986791 
Productcell surface glycoprotein (s-layer protein) related protein-like 
Protein accessionYP_593820 
Protein GI94971772 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0992088 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCCGCC ACTCGTCGCT TCGCCTGCTG CTTGCCAGCT TGTGTCTCCT CGTGTCTTTC 
ACTTTGCCGG CCCTCGCCCA GGGCATTGAT CTGCCACTCA GCTTCGAGCC CAATCTTGGG
CAATCCGATC CGGCGGTGCG CTTCTTGTCG CATGGAAAGG GATACGGAAT CTATTTGTCG
CAGAACACGA CAGTTCTGCA GATGGGTTCC GACCGGCTCG CGTTACAAGT CGCGGGCGGG
CAAGCTCCCA GCGCAATCCG TGGCGAAGAG CCATTGACGG GCAAAGTGAA TTACCTGCGT
GGCGCCGATC GTTCGAGTTG GCTGCGAGGG GTTCCGACCT ATGCGCGGGT GAGGATGAGC
TCGGTGTATC CGGGCGTGGA CCTGGTTTAC TACGGCAACC ATCGCCAGCT GGAATACGAC
TTCGTGGTGC ACCCCGACGC GGATGCAAAG CAGATTGGGT TTGCGGCGAA GGACGCTGAG
CTTCGGCTGA ACCGCGATGG TCAGCTCACG ATGACAGCGG GCGCTGCCGA AGTGCATTGG
CATGCGCCGG TGGCTTACCA GGAGATTGAC GGCAAACGCC ACGCGGTAGA GGCGAAATAT
GAAATTGCGG GTTCGATGAT CCGGTTTCAC GTCGGTGCGT ATGACCATTC GCACGATCTC
GTGATTGACC CGGTGATGGT TTATTCAACG TACGTCGGCG GCAACGGTGG TGAGACGGGC
GATGTTGGCA ACGCAATCGC GGTGGATGCG GCGGGGAACG CCTACATCGC AGGAGTGGCG
TCATCTACCA ATTTTCCAGT GACGAGCGAG GCGATGCAGC CGTCGTCGCG TGGGAATGAC
GATGCCTTCG TGGCGAAGAT CAATCCGCAG GGCACGGGCT ACGTGTATGC CACTTATCTT
GGAGGCGGCG GACAGGACAT TGCCTGGGGG ATCGCGATTG ATGGCGCAGG CAACGCGTAC
GTCACGGGGC AAACTGGTTC CGGACTGCAT GGACAGGCGG CGTTTCCGAC GACAGCGGGC
GCTTATCAGC GCACGCAAAA TGCAAATGTG CTGAACAACA GTGTGTTCGT TGCAAAGCTC
AGCGCGGATG GCACCGACCT GCTCTACTCG ACGTATCTGA CTGGCACGAA CGATTCTACG
GCGTCGGGAA TTGCGGTGGA CGGCGGGGGA AATGCTTATG TGCTCACGAA CACCGCGGGC
GGATTCCCGG TATCAGGCGC CGCATATCAG AAGACGGCAG GCACAGACCA GTGTCCGTAC
GAACAGTTTG CTGACGGCCA GGCACAAGTG GTCACGAAAG TAAATGCGAC GGGATCGGCG
CTGGTGTACT CGACGTATGT CGGCCACGGA TGCGATTACG GTGCGGGCAT CGCGGTGAAC
ACCGCGGGCG AAGCTTACAT TGTGGGGCAT ACGCAGGACA GCGCTTATCC GGTAACAAGC
GGTGAGGTGG GATCGACGTT CGGCGGTGTG GTAGATGGAT TCGTGACGCG CCTCAACGCG
AGCGGAAGTG GGATCGTGTA TTCCACGTTC CTTGGCGGTT CTCTAGCTGA TTTTGCGAAC
GCGGTCGCGC TGGATTCTTC GGGATATGCG TACATCGCGG GTGGCACGGA TGGCGACTTT
CCCACGACTT CGAGCGCGTA TCAGACAACG GCGAGCAACA ACGGCTACCG CAAGGGATTC
GTCACGAAGC TTAGTCCGAT GGGCAAGGCG CTGATTTATT CGACGTACAT TCGCGGCGCG
GCAAATGTGT CGTTCAGTTC GATTGCTGTG GACAAGAGCC ACTATGCGCA CGTTACTGGC
TATTCGGATG GGAGCCAATA TCCGGCGACG AGCACGGCCG TGCAGGGCAC GTGCCACCAG
GGACCGAGTG GCTGCCTGAC GCAGGCGGTG GTGACGAAGG TGAACGCGAC GGGCTCGGGA
TTGTTGTATT CGAGCTACTT CGGCGCGAGC GACGCCAGCA ATAACTACTT CCCGGGGAAC
ATAGGCAATG GCATCGCCGT GGACAACAAC GGCGGGTTCT ACATCACGGG GCGCACCAGC
GCGGGGCTGA AGACGACCAG CAGCGCGGCG GAACCGAGTT ATCGTTCGAA CAGCAACAGC
ACGGATGCGT TCGTGGCGAA GTTCAACGTG TATGGAACGT CTTCGGCTAC CAAGGTGATC
GTGCTCTTGC CTTTGGACGG ATCGCTAGTG ACGGCAAAGG CCGGCGTCAG CGCAACGGCT
CTCGGGAGTT CCAGCCCGGT GGCGTACATG CAGGTGTACG TGGATGGCGT GAGAAAGGCG
CAGGTTTCCG GCAGCACGAT CCTAACCGTC GTTTCGCTGG GGACGGGCCA ACACCGGATT
ACAGCGCAGG CGATCAACAA AGACAGCTCG ATTGCGAAGA GCACGGTGTA CGTCACCGCT
AAGTAG
 
Protein sequence
MPRHSSLRLL LASLCLLVSF TLPALAQGID LPLSFEPNLG QSDPAVRFLS HGKGYGIYLS 
QNTTVLQMGS DRLALQVAGG QAPSAIRGEE PLTGKVNYLR GADRSSWLRG VPTYARVRMS
SVYPGVDLVY YGNHRQLEYD FVVHPDADAK QIGFAAKDAE LRLNRDGQLT MTAGAAEVHW
HAPVAYQEID GKRHAVEAKY EIAGSMIRFH VGAYDHSHDL VIDPVMVYST YVGGNGGETG
DVGNAIAVDA AGNAYIAGVA SSTNFPVTSE AMQPSSRGND DAFVAKINPQ GTGYVYATYL
GGGGQDIAWG IAIDGAGNAY VTGQTGSGLH GQAAFPTTAG AYQRTQNANV LNNSVFVAKL
SADGTDLLYS TYLTGTNDST ASGIAVDGGG NAYVLTNTAG GFPVSGAAYQ KTAGTDQCPY
EQFADGQAQV VTKVNATGSA LVYSTYVGHG CDYGAGIAVN TAGEAYIVGH TQDSAYPVTS
GEVGSTFGGV VDGFVTRLNA SGSGIVYSTF LGGSLADFAN AVALDSSGYA YIAGGTDGDF
PTTSSAYQTT ASNNGYRKGF VTKLSPMGKA LIYSTYIRGA ANVSFSSIAV DKSHYAHVTG
YSDGSQYPAT STAVQGTCHQ GPSGCLTQAV VTKVNATGSG LLYSSYFGAS DASNNYFPGN
IGNGIAVDNN GGFYITGRTS AGLKTTSSAA EPSYRSNSNS TDAFVAKFNV YGTSSATKVI
VLLPLDGSLV TAKAGVSATA LGSSSPVAYM QVYVDGVRKA QVSGSTILTV VSLGTGQHRI
TAQAINKDSS IAKSTVYVTA K