Gene Acid345_3895 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3895 
Symbol 
ID4072230 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4607876 
End bp4611325 
Gene Length3450 bp 
Protein Length1149 aa 
Translation table11 
GC content57% 
IMG OID637985919 
ProductTonB-dependent receptor 
Protein accessionYP_592969 
Protein GI94970921 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0561409 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGGAT CAAGAGCAAG GTACACACTT CTGTGCGCGC TGCTATTTCT GTGTTCCGCC 
ATGTTGTTTG GTCAGGCGGA GACAGGTTTG ATCACAGGCA CCGTCGTGGA TGTTTCCGGC
GCAGTTGTCG GCGGAGCGAC GGTGACAGTG ACGGACGTGA ATACGGGCGC GCAGCGAACC
GCTACCACCA ATAACGATGG TTCCTACACG GTTTCCAACC TCAAACCGTC GATGTACGAA
GTCGTGATCG ACAAGCAAGG CTTCACCAAA TACACCCGCA GGATCGCGGT GACCGTCGGA
TCAAGAAATG AACTTTCGGC GCAGATGAGT GTGATGGGCG GCGGTACTAC CGTCGAAGTG
ACCGCGGAAT CAGGCGGCGC CGCGGTGAAC ACCGAAACCC AGACGCTTTC ATCGGTCGTG
AGCGGCGCGC AGATCACCGA ACTCCCAACT CTCACCCGCA ATCCATACGA CCTCGTTGCC
ACCGCCGGCA ACGTGACAGA AGACACCACC GGTACCATGC GCGGTGCAGG CTTCTCGATC
AACGGTCAGC GCTCAGCATC AACCGATGTA CTGTTAGACG GTGGTGAAAA TGTCGATATG
TTCACCGCTT CCGTCGGGCA GCAGGTTCCG CTCGATTCCG TGCAGGAATT TCGCGTTGTC
ACCAGCAACT TCACCGCAGA ATATGGCCGC GCGGGCGGCG GCGTCCTGAA CGTCGCTACC
AAATCTGGCG CCAATGCCTT CCATGGCACC GCGTATGAGT TCAACCGCAT ATCTGCGCTG
GCGGCGAACA CCTGGGAGAA CGACACCAAC GATATCCCCA AGTCCACGTT CACGCGCAAT
CAGTTCGGAT ATTCGGTGGG TGGGCCAATC ATCAAGAACA AACTGTTCTT CTTCTCCAAC
ACCGAATGGA TCCGGGTCCG CAGCAGTTCG AACCAGATCG TCTCGATCAT TGATCCTGCG
ATGTTCCCGA ACCTGGCTCC GAACTCGGTA GCTGCGCTGT CGTATGCTGA CGTGCGCTCG
AACGCGACTT TGCTCGGCTC TACCTCATGC GCAGCCGATG CTCTGTGCTC CCCTCTGCTC
GCGAGCAATG GCGGACCGTT GCCCAATGGC TCGCCGTTCA CCCAACAGTT GTCGTACACG
GCGCCCGCAG AGGCAGGTGG CGGCCTTCCG GAAAACACCT GGATGACCGT CAACCGCTTC
GACTACAACA TGACCGACAA AACCACCTTC TTCGGCCGCT ACGCTGGATA CCACGAAGAG
GATTTCAACG GGACCGTCAA CAGCAGCCCG TACTCCGAGG GATTCGATAC CGGCCAGAAC
ATCTTCAACA ACAACGTGCT TATCAACATG ACGCACGTGT TCACTCCCAA CATCGTGAGC
CAGTCGAAGT TTGATTTCAA CCGACTGAAC TTGCTGCAAC CGCTCGGAAC CCAGCCCGTG
GGGCCAACGA TGTACGTCTC CTCGCAAGGC GTGCCCACCT CCGGCGGATA CTCGCTGATT
TTCCCCGGAT ATAGCGAATT CACGCCAGGT AACTCAATTC CATTCGGCGG CCCGCAGAAC
CTCTATCAAT TCTTCCAGGA TGTCTCATGG ACGAAAGGTC GTCACCAGCT GCGTTTTGGC
GGACAGTACA TTCACATTCG CGATAACCGC ACCTTCGGCG CTTATGAAAA CGCTGTGCAG
TATCTCAGCA CGGGCGCACC CGTGACCGCG GGTGGAAATA CTTATCGCGG CAACACCGCC
GGCATTTACA ACCTAGTCGC CGGCAACATC GCGAACATGC AAGTCGCGGT TGACCCTCGC
GGAGCGTTCC CCGGAGACAA CATCTCTCTT CCGGCAGGCG CACCGAGCTT CTCGCGTAAC
AATCGCTTCA ACGACGGCGC GTTCTACCTC CAGGATTCCT GGAAAGTAAC CAGCCGCTTG
ACGCTCAACT ACGGCGTGCG CTGGGAGTAC TACGGTGTGC AGCACAATGC CGATCCCTCG
CTCGATTCCA ACTTCTACGA AGGGTCCGGT GCGACGCTGC CGATCCAGGT TGAGAACGGG
ACCGTGCAGA TTGCCAATCA GAGCCCGGTC GGCTCTCTCT GGGAGCCTTC CAAACACAAC
TGGGGTCCGC GCCTCGGCTT CGCCTGGGAC GTCTTCGGTG ATGGCAAAAC GGCGATACGC
GGTGGTTGGG GCATGAGCTA CGAGCGGAAC TTTGGCAACG TGACCTTCAA TGTCATTCAG
AACCCACCGA ATTACGCAGT TCTGAATGCA GTGAACACGC CCGTGACGCT CGACAACTTC
GGGCCTCTCT CCGGCAGCAG CGGCAGCGTA GTTCTTCCTC CAACCACTCT GCGTGCCGTG
CAGCCCAACA TTGACAACGC GTACACCGAG TTCCGCAGCC TATCGCTGGA ACGCGAGGTA
CTAAAAAATA GCCTGGTTGC CTTTGAATAC AGCGGCTCGA ACGGCGTTCA CCTGTATGAC
ATCGGCAACA CGAACGTGTT TTTCCCGGGG TATGCTGGCT ACGGTGATTA CTTCGATCCT
GCTACCTATC ACTCTGGAGT TGCGTGTTAT CCCGGATGCC GCCTGAACCA GCAGTATTCG
AACATCAACA GCCGTGGCAG CCGCGGATTC TCGCGCTACA ACGGCCTGAA CACACGCTTC
ACCACCAACA ATCTCTTCAA CAAAGGTCTG CAGCTCAACT TCAACTGGAC GTGGTCACAC
TCGATTGACA ACTTGAGCTC AACCTTTAGC GAAGGCAACA ACGGCGCGTT CCAACTCGGT
TACGAAAACT ACTATGCTCC GCAACTCGAC ACCGGCAATT CCGAGTTCGA CGTCCGTCAC
CGCATCGCGG TTAGCGCAGT CTGGGACCTG CCCTGGATGA AGAACGCGAG CAATGCGTTC
GTTCGCCAGG CGCTCGGTGG GTGGAGCTTT TCTCCCCTGA TCACCTACCA TACCGGCTAC
CCGTTTTCGG TCTATGACTG CACCAACGGA ATCAGCCAGT GTCCGCGCTA CTTGCCGACG
GGTGGTGAAC GCGACGGCTT CGCCAATTCA TCAACCTACG CTGGTGGAGG CGTCTTCAAT
TACCTGAATG CCGGCTCGCT CGTCGCAGCT CCTGGCTTCG GAATGCCGGG TGTCGGCGGT
TCGAGCCAGG TTCCGGAAGC GCCTTGCCAG GGAGCGATCG GATGCAACTG GGCCGTCGGT
CCGCGCAACA TGTATACCGG CCCCGGCAAT CACCAGTTCA ACGCGGTTAT CGGGAAGACG
TTCAAACTCA CCGAACGGTT CAACTTGCAG TTCCGCGGCG AGATGTACAA CGTCTTCAAC
AACCACAACT ACTTCCTGCT CACGTCGAAC GCCGACGTCA GCAGCGGTGC GCTGGGAAGC
CCGTTCTTCG TACAAGCTGT TAAGGGCGGC TTCGGCAATC CGACGGACGA ACGTCGTAAT
GTTCAGTTCG GCTTGAAGTT GATTTTCTAA
 
Protein sequence
MIGSRARYTL LCALLFLCSA MLFGQAETGL ITGTVVDVSG AVVGGATVTV TDVNTGAQRT 
ATTNNDGSYT VSNLKPSMYE VVIDKQGFTK YTRRIAVTVG SRNELSAQMS VMGGGTTVEV
TAESGGAAVN TETQTLSSVV SGAQITELPT LTRNPYDLVA TAGNVTEDTT GTMRGAGFSI
NGQRSASTDV LLDGGENVDM FTASVGQQVP LDSVQEFRVV TSNFTAEYGR AGGGVLNVAT
KSGANAFHGT AYEFNRISAL AANTWENDTN DIPKSTFTRN QFGYSVGGPI IKNKLFFFSN
TEWIRVRSSS NQIVSIIDPA MFPNLAPNSV AALSYADVRS NATLLGSTSC AADALCSPLL
ASNGGPLPNG SPFTQQLSYT APAEAGGGLP ENTWMTVNRF DYNMTDKTTF FGRYAGYHEE
DFNGTVNSSP YSEGFDTGQN IFNNNVLINM THVFTPNIVS QSKFDFNRLN LLQPLGTQPV
GPTMYVSSQG VPTSGGYSLI FPGYSEFTPG NSIPFGGPQN LYQFFQDVSW TKGRHQLRFG
GQYIHIRDNR TFGAYENAVQ YLSTGAPVTA GGNTYRGNTA GIYNLVAGNI ANMQVAVDPR
GAFPGDNISL PAGAPSFSRN NRFNDGAFYL QDSWKVTSRL TLNYGVRWEY YGVQHNADPS
LDSNFYEGSG ATLPIQVENG TVQIANQSPV GSLWEPSKHN WGPRLGFAWD VFGDGKTAIR
GGWGMSYERN FGNVTFNVIQ NPPNYAVLNA VNTPVTLDNF GPLSGSSGSV VLPPTTLRAV
QPNIDNAYTE FRSLSLEREV LKNSLVAFEY SGSNGVHLYD IGNTNVFFPG YAGYGDYFDP
ATYHSGVACY PGCRLNQQYS NINSRGSRGF SRYNGLNTRF TTNNLFNKGL QLNFNWTWSH
SIDNLSSTFS EGNNGAFQLG YENYYAPQLD TGNSEFDVRH RIAVSAVWDL PWMKNASNAF
VRQALGGWSF SPLITYHTGY PFSVYDCTNG ISQCPRYLPT GGERDGFANS STYAGGGVFN
YLNAGSLVAA PGFGMPGVGG SSQVPEAPCQ GAIGCNWAVG PRNMYTGPGN HQFNAVIGKT
FKLTERFNLQ FRGEMYNVFN NHNYFLLTSN ADVSSGALGS PFFVQAVKGG FGNPTDERRN
VQFGLKLIF