Gene Acid345_1501 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1501 
Symbol 
ID4069248 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1827378 
End bp1830749 
Gene Length3372 bp 
Protein Length1123 aa 
Translation table11 
GC content58% 
IMG OID637983510 
ProductTonB-dependent receptor 
Protein accessionYP_590577 
Protein GI94968529 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.175276 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.273267 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAAGT CTTACCTGGC GATTCTGCTC GCCACGGTAT TGGTTTTGTT GTGCGCGTTT 
CCCGCCTTAG CACAATCCAA TACAGCAAAG TTGACCGGAA CAGTCGCAGA CGCGCAGGGT
GCAGTCGTCC CCGGCGTCAG CATTACTGTC ACCAGCTCGA AAGGTCGTGC AACTACGGTC
CAGAGCAATG CTTCCGGCTT CTACACGATT GCTGCGTTGG ATCCCGATAC CTATAACGTG
GATGTGAAGC AGTCCGGATT CAAACCCATC ACGCAAAAGA TCACGCTTCA GACTGCCCAG
CAGGCGGAAC TGAACTTCAC CATGCAATTA GGTTCGACCT CAGAGACCGT TGAGGTTACC
AGCGATGTTC CGCTCGTCGA CGCCATCAGC TCGAACGTGA GCGCCGTCAT CGTCGGCCGC
CAGATTACCG AGCTCCCGCT CAACGGCAAC AACTTCACGC AGCTTGCAAC CCTCGTCCCG
GGCGTCACTC GCGGTATGCC GGCCAATCAA CAGTCTGGAG AGGGCAACCA GGCCGAAACA
TTCCGCTATG CGGGCAACGG CGGTTCCGCC ATCTCCGTAA ACGGTCTTCG TCAGCAGGCG
AACAACTTCC TGCTCGATGG TTTCGACAAC AACGAGAGCC TGGTGAACAC GATTGTGTTC
TTCCCGCCGA CCGAGGCGAT CCAGGAATTC CGCGTTGACA CCAGCGTTGC GCCGGCAGAA
TTCGGCCGCG CCGGCGGTGG CATCGTCAAC TCGTCTATCA AGTCAGGCAC CAACAGCTGG
CACGGCTCTG CATTCGACTA TCTTCGCAAC AGCGTCCTGG ATGCGAAGAC CTACGCATTG
GGTCCGAATG ACCCGAAGCC CCCCTTCCGC CGCAACCAGT TTGGCGGCGC AATTGGCGGC
GCGATCATCA AGAACAAGCT TTTCATTTTC GGTGATTATC AGGGCCTTCG GCAGGCGCAG
CCGGCCAGTC TCGACTTCGC GACTGTCCCG ACCGACATGA TGCGCAGTGG CGATTTCTCC
GAATTGTTGG GACTGCAACG CGTGGACGGA ACGGCGGTAG CGCCCATCCA AATTCGCCAT
GCGGGCGCCA ACAGTTCCGC CGATTACGTG AACTACGTTG GGAATATCAT CCCCAGCGCT
GACTTCGTTG GCGCCGGCCA AGCTTATTTG AACGCCTATC CGGAGCCGAA TGTCTCCGCC
ACGAACACAC ACTGCGGTCT GGTCTCGACC CTCGATGGCC TCTGCCTGCA GCAGAACTAC
ATGGTCCAGC GGCAGCAGAT CCAGAACTTC GATGATTTCG ACATCCGTGC CGACTATGTC
CTTCGCACCC AGGACACGAT GTTCCTGCGC TACAGCTATG GGCACGAAAC CGACGTCACG
ACGTCGCGCC TGCCTGCACT GCCCGCCGGA TTTGGTTCCG GCGATCAGTT CAGTATGCCT
CGCTCCTGGG CATTCGGTGA AACCCACACT TTCTCCCCAA ACATCCTGAA CGAGTTCCGC
TTCGGATGGA TTCGCGCGAA CCTCGGCTAT CTGCCGCCGT TCGACGACCA GGCCTTGTCG
GCAAATCTCG GAATTCCGAA CGCCAACACT TCGTCTCTGC TTGGAGGCGG CGCCCTTATC
GGTGGTTACA ACGGTCAGCT GGAATACACC GGGGACTACG GTCCGTACCA GGTTCCGGAA
AACACTTATC AACTCGCCGA CAATGTCAGC TGGATCAAGG GTCATCACAG CTTCAAGTTC
GGCGCGAACC TAATGTGGCG ACAAGTCAAC CTCTTCCGTC CCAAGGCGGG CAAGGGTTAC
TTCTTCATCT CCGGTAACGG TGGCGATGCC TTCTCGACCG GATACGAAGT CTCCGACGTT
CTCGCCGGTT TCATGAACAA CTACCAGGTT GGTCCGGCAC TTGGATTCGC TGGGACCCGC
AGCTGGGAAA ACGGCATTTT CGCTCAGGAC GATTGGCGCA TTACCAATCG GCTGACGTTG
AACCTCGGCT TCCGCTGGGA TGTGCTCACA TGGCCCACCG AAGTCCACAA CCAGATGTCG
AACTTCGACC TGAACCCCGG CAGCGGTACC TATGGACAGG TGGTTCTGCC TGGAACCCAC
GGCTACAACG ATTCGTTCGT TCCGACCGAT TGGCATAACT TCGCTCCTCG CGTCGGTTTT
GCCTACAACC TGTTTGGCAA CGGTAAGTCC GTGGTGCGCG GCGGTTATGG AATGTTCTAT
TTCGTAGACC GTGGCGGCAT CGACAACCAG ATGGCGCAGA ACGCTCCGTT TGCGGGCGTC
TCGCAATACA ACTACACCAA CGGCTATCGC TTCACGCTCG GCGGCCAGGC TCCACTGAAC
TCAGTGGATC CGACGCAAGG CGGCGCGGTC GGCATGCCGG ACAAGAACGA TTTCAACATC
GACTTCACCA ATCCGGCGAA CCTCTCGATG GTGTCGTACC TTAACCGCAA CGTACCGGCC
AGTGTGCAGG AGTGGAACCT CCAGTTCGAA CAGGAACTTG GCGTCAACAC CTCCTTCATG
CTCGCTTACG TGGGCAACAA GGGCACGCAT CTTACCTCGT ACCACGACTT CAACCGCCAG
TTCTACAACC AGGCCAATGG CGACAAGAAC TTCCCAGCCA TGGGCAGCCT CACCGTTCAC
GATACTAGCG GCAACTCAAA CTACAACTCG TTGCAGGCGC AGTTGACTCG CCGCATGACC
AAGGGCCTGC AGTTCAACGC TTCGTACACC TGGGCCCACG CAATCGACGA TTCCCAGGGC
GCATTCGACG CCAACAATGG CGTTGTCGAT TACTTCAACC TGGCGCATGA GCGCGCGAAC
TCGTTGCTCG ACTTCCGTCA ACGCTTCGTG TTCAACGCTC TGTACGAGCT GCCTTTCGGC
CACGGACGGC AGTGGGGCAA TAGCTGGAAC GGCGTCACCA ACGCGGTCCT TGGTGGATGG
CAGTTCAACC CGATCCTGAC GCTCAGCAGC GGATCACCGT TCGACCTCGG TGGAATCGGT
AACCCGCAGA CGCGTCCTGA CCTGGTCGGC CAGCTGCATC AGTTGAACAG CGTGAACGGC
ATGTGGTTCG ACACCAGCGC GTTCGCGGTT CCCGCCTCGA ACGGCACCGG CGTCTTCCTC
GCGCCTGGCA CCGCTCCTCG CAACCCGTTC ACCGGACCGG GAACCGAGAT CTTCGACTTC
TCTGTCCAGA AGACGTTCGC GATCACTGAA CGCGTCCGGA CCGAGTTCAC GTCGCAGTTC
TTCAACGCCT TCAACACCCC GCAATTCGCA CAGCCGGATG GCAACTTCTA CGATGGCAAC
TTCGGTCGTG TGACCAACAC TCGCTTGAAC AGCGAACGCC AGGTCCAGTT CGGACTTCGC
GTTCTCTTCT AA
 
Protein sequence
MKKSYLAILL ATVLVLLCAF PALAQSNTAK LTGTVADAQG AVVPGVSITV TSSKGRATTV 
QSNASGFYTI AALDPDTYNV DVKQSGFKPI TQKITLQTAQ QAELNFTMQL GSTSETVEVT
SDVPLVDAIS SNVSAVIVGR QITELPLNGN NFTQLATLVP GVTRGMPANQ QSGEGNQAET
FRYAGNGGSA ISVNGLRQQA NNFLLDGFDN NESLVNTIVF FPPTEAIQEF RVDTSVAPAE
FGRAGGGIVN SSIKSGTNSW HGSAFDYLRN SVLDAKTYAL GPNDPKPPFR RNQFGGAIGG
AIIKNKLFIF GDYQGLRQAQ PASLDFATVP TDMMRSGDFS ELLGLQRVDG TAVAPIQIRH
AGANSSADYV NYVGNIIPSA DFVGAGQAYL NAYPEPNVSA TNTHCGLVST LDGLCLQQNY
MVQRQQIQNF DDFDIRADYV LRTQDTMFLR YSYGHETDVT TSRLPALPAG FGSGDQFSMP
RSWAFGETHT FSPNILNEFR FGWIRANLGY LPPFDDQALS ANLGIPNANT SSLLGGGALI
GGYNGQLEYT GDYGPYQVPE NTYQLADNVS WIKGHHSFKF GANLMWRQVN LFRPKAGKGY
FFISGNGGDA FSTGYEVSDV LAGFMNNYQV GPALGFAGTR SWENGIFAQD DWRITNRLTL
NLGFRWDVLT WPTEVHNQMS NFDLNPGSGT YGQVVLPGTH GYNDSFVPTD WHNFAPRVGF
AYNLFGNGKS VVRGGYGMFY FVDRGGIDNQ MAQNAPFAGV SQYNYTNGYR FTLGGQAPLN
SVDPTQGGAV GMPDKNDFNI DFTNPANLSM VSYLNRNVPA SVQEWNLQFE QELGVNTSFM
LAYVGNKGTH LTSYHDFNRQ FYNQANGDKN FPAMGSLTVH DTSGNSNYNS LQAQLTRRMT
KGLQFNASYT WAHAIDDSQG AFDANNGVVD YFNLAHERAN SLLDFRQRFV FNALYELPFG
HGRQWGNSWN GVTNAVLGGW QFNPILTLSS GSPFDLGGIG NPQTRPDLVG QLHQLNSVNG
MWFDTSAFAV PASNGTGVFL APGTAPRNPF TGPGTEIFDF SVQKTFAITE RVRTEFTSQF
FNAFNTPQFA QPDGNFYDGN FGRVTNTRLN SERQVQFGLR VLF