Gene Acid345_3469 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3469 
Symbol 
ID4069045 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4090697 
End bp4093030 
Gene Length2334 bp 
Protein Length777 aa 
Translation table11 
GC content62% 
IMG OID637985491 
ProductHep_Hag 
Protein accessionYP_592544 
Protein GI94970496 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.259932 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0166964 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATTGC GTTGGCTATT CGCAGCTGTC CTCACTTGTG TTTCGGTGTT CGCCCAACAA 
GAATCTGCAG TTGTGGCCTC GCCCACAGCG GCAGTCGTTC CGCGACTGAT CCGCTTCTCC
GGCCAACTCA CCGAGAGCAA CAAGACAGTC GGCATCACGT TCACCCTGCA TGCATCGCAA
AAAGACGATC GCACTCTCTG GACGGAAACC CAGAACGTGA AAGTGGACGC AACCGGCAAA
TACACGGTGC TGCTCGGCGC GACGAAGGCC GACGGGATAC CGATGGAAAT GTTCGCGTCA
GGCGAAGCGC AATGGCTTGG GATCCGCATC GAAGGGCAGA AAGAACAGTC TCGCGTCCTG
CTCGTCAGCG TGCCTTACGC GTTGAAGGCG GCAGAAGCCG AAACCCTCGC CGGCCACAGC
GCCACCGAGT TCGTCACTTC CGAAAAACTA AGTGTCGCGG TGCATGAAGA GCTAAATGGA
CAGAGCGCCA CATCTACGCC CGGCACTACG AAAACGGCAG ATCCCAAAGC CAGGACCAGC
GTGGTCAGTG CCGGAGCAAC AAACTTCAGC GCCAACACCG CAGACCAGGT CGTAAAAGTC
ACGCAGGTCG GTACCGGTTC GGCGCTGATC GCGACCAGTC CGCAGGCCAA TGCCGTGTTG
GGCAGCGCAA CCACGGCCAC TGCCTACGGC GTCACTGGCT CCAACACCGC CAGCACGGGC
GTCGCGGTCG GCGTACGCGG CACCACCGTC GCTGACAACG GCATTTCCAT CTACGGCACG
GCAAGTGGCA CCACCGGCAC CGCCACCGGC GTTAAAGGCA TTACCGGCGC GCCGAACGGA
TACGGCGTCT TCGGACAAAA CACTTCCACC ACCGGCCCGG CCATCGGTTT CCGCGGCGCG
ACAGCTTCGA GCAGCGGCAC TGCAATCTAC GGAACGTCCA CCGCGGGCAG CGGCTTGACC
ATCGGCCTGC GTACCTCGGT TGCTAGCGCG GATGGCACCT CCGCCGTCCT GCAAAATACC
GCTGGCGGCA AGATCCTCAG CGGGCAATCC GGCGGCGCTC TCACCGAAGT CTTCTCCGTC
TCCGGCACCG GCGACATCGC AGCAACTGGC CTCACCACAA CCGGCAACAT CACTGCCGGC
GGAGGTTTCC ACGGCAACGG TGCAACCCTC AGCAACGTGG TCGTCCTGGA TCCCGCCGCC
GACCAGGTTG CCGACACCGG GGCAGCCGAC ACTTCAGGGG CCGCGATGAT CGCGCTCAAC
GGCGTGACAC ATAACGGCTC TACCGCCAAC AACGCATACT TCCACGTCCA GCAGGACAGC
GGCGCACTCT TCTCGGGCAC GCTTGGCATC GGCAAAATTC CTGCGACCGG CGCCGGCTAT
CGCATGATGT GGCATCCCTA CAAGGCCGCA TTCCGAGTCG GCGGAGTGAC TGCCACAGAC
TGGGACGACC CGAATATCGG CTTTTACTCG TTCGCTGGAG GTCACGACAC CCTCGCAAGC
GCCTTCGGTT CGTTTGCGTT CGGGGACGGC ACGCTCGTCT CAGGCACCGA TGCAGTGGGT
TTTGGGAATG CGAACCACGT AACCGGAACC ATTGGCTTAG CCATTGGGGC CAGCAACTAT
GCCAGTGGTT TCGGCTCCAC CGCTATCGGA TATACCGAGT GGTCTCAAGG CCAGGGATCC
GTGGCGATCG GTTATCGCAC GGGTGCCTGC AACGACTACG TCGTCGCGTT GGGCCACCGG
GCAACCAATG ACCACGCCCA GGCCGATGCC GTCGCTCCAC CTTGCTCAAC CGCCGGAACA
CCCAGCGGAT ACACCGGCAC CTTCATCTGG GGCGACGAAA GCACGACCAA CAACGTTGCG
AACCAGGCTA ACAACGAATT CCGTATTCGC GCCTCGGGCG GGGTCCGCCT GCGAACTTCC
GTCAATGCCA GCAGCGCGCT CGGCACCAAC AGCAACACCG GATGCGACCT CCCTGCCGGG
TCCGGCGTCT TCAGCTGCGC TTCGTCGCGC ACCGTAAAGG AGAACTTCGC GTTCCTCAGG
GGCACCGACG TTCTCGCTCG CCTGCGCGCC ATGCCCGTCT CCACGTGGAA CTACAAAGCT
GAAGGCGCAG AAGTCCGCCA CATGGGCCCC GTCGCCGAAG ACTTCCGCGC CGCGTTCGGA
CTCGGTGAGA ACGAAACTAC CGTCGGCGTC AACGACCTCG CCGGCGTAAG CCTGGCCGCA
GCCAAAGCCC TCGACGAAGA AAACGCGCGC CTGAAGAAAG AACTCCGCGA ACAAAAGGCC
CTGATCAAGG CCCTTAACGC GCGCCTGACC AAGCTCGAAC AATCCAGGAA ATAA
 
Protein sequence
MKLRWLFAAV LTCVSVFAQQ ESAVVASPTA AVVPRLIRFS GQLTESNKTV GITFTLHASQ 
KDDRTLWTET QNVKVDATGK YTVLLGATKA DGIPMEMFAS GEAQWLGIRI EGQKEQSRVL
LVSVPYALKA AEAETLAGHS ATEFVTSEKL SVAVHEELNG QSATSTPGTT KTADPKARTS
VVSAGATNFS ANTADQVVKV TQVGTGSALI ATSPQANAVL GSATTATAYG VTGSNTASTG
VAVGVRGTTV ADNGISIYGT ASGTTGTATG VKGITGAPNG YGVFGQNTST TGPAIGFRGA
TASSSGTAIY GTSTAGSGLT IGLRTSVASA DGTSAVLQNT AGGKILSGQS GGALTEVFSV
SGTGDIAATG LTTTGNITAG GGFHGNGATL SNVVVLDPAA DQVADTGAAD TSGAAMIALN
GVTHNGSTAN NAYFHVQQDS GALFSGTLGI GKIPATGAGY RMMWHPYKAA FRVGGVTATD
WDDPNIGFYS FAGGHDTLAS AFGSFAFGDG TLVSGTDAVG FGNANHVTGT IGLAIGASNY
ASGFGSTAIG YTEWSQGQGS VAIGYRTGAC NDYVVALGHR ATNDHAQADA VAPPCSTAGT
PSGYTGTFIW GDESTTNNVA NQANNEFRIR ASGGVRLRTS VNASSALGTN SNTGCDLPAG
SGVFSCASSR TVKENFAFLR GTDVLARLRA MPVSTWNYKA EGAEVRHMGP VAEDFRAAFG
LGENETTVGV NDLAGVSLAA AKALDEENAR LKKELREQKA LIKALNARLT KLEQSRK