Gene Acid345_4728 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4728 
Symbol 
ID4070666 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp5590047 
End bp5592203 
Gene Length2157 bp 
Protein Length718 aa 
Translation table11 
GC content59% 
IMG OID637986772 
ProductTPR repeat-containing protein 
Protein accessionYP_593801 
Protein GI94971753 
COG category[N] Cell motility
[O] Posttranslational modification, protein turnover, chaperones
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3063] Tfp pilus assembly protein PilF
[COG3914] Predicted O-linked N-acetylglucosamine transferase, SPINDLY family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.346608 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGACCA AGATCGGCCG CAACGACCGT TGTCCGTGCG GCAGCGGGAA AAAATACAAG 
CTATGCTGCC TGGTGCGTCC GTCGAACGCA GCACGGCTCA TGATGGCACG CGAACACCAC
GAGGCCGGAC GGCTGCAACC CGCAGCAAAG ATCTATGAGC AGGTCTTACG GGGCGATCCG
AATAACGTCG AGGCTCTTCA TTCGCTTAGC ATCCTCGCCA GCCAGATCGG CGAGACGGCT
ACGGCAGAGC GCCTGATGCG GCAGGTTCTC AGTTTGCAAC CGGAACACGT CGGCGCACTC
AGCAATTTAG GGATCACGTT GCAATCCCAA GGACGCCAAG AAGATGCGAT TGCCTGCTAC
GAGAAGGTAA TCGCGCTCCG CCCTCACCAC GCCGAAGCGC ATAACAACCT TGGCAATCTG
CGCTTGGCGC AAGGCGATTT GGAGCAGGCG ATTGCGAGTT ACCAGCGCGC GCTGGACTTG
AAGCCTGACT ACGCGGACGC GCATTACAAC TTGGGAAATG CCTATCAACG TCGCGGAAAT
TGGACGCAAG CAAGAGAAAG TTATAGACGT GCCGTGGCGT CACGTCCCGA ATTTCCAGAA
GCACAAAACA ATCTCGGAGT AGTGCTGCGG GAGATGGGCG AGACGTCCGC GGCGATTGAG
GCCTTCGAAC GGGCAATTGC GCTTCGCGCG GAATATGCAG ACCCGCTCAA CAATCTCGGT
GTCGCGCTCC AGGAACAGGG CCGAATGTCG GCCGCGGTCG AACACTATCA CCAAGCGATC
GCGTTGCGTC CTGCGGATGT GGAAGCGCAT TTCAATCTCG GCAGCGCACT ACAGGAACTC
CACCGTACCG ACGAGGCAAT CGCCGCCTAT CAAAGTGCGC TGGAGATTCA ACCTGGCTAC
TTGCCCGCGT ACAGCAATTT GCTGCTGCTT TACGCTTCAA CAGGTTGCGT TTCACCCGCG
GAAGAGCTCG CGTTCGCACT CGGCTGGGAA CGGGCAGCCC TCACCGAGGA GGAACGCGCG
GAAGCCAGAA GTAGGAGGTT CGTCCGGACA CATCTCGCAG GAAGAAAGCT GCGGATAGGC
ATCGTATCGG CGGAGCTGGG CGAACACGCC GTTGCCGATT TTCTTGAGCC GCTGCTGAGT
GAGATTGACC GCAGCCAGTT CGAATTGCTC TTGTTCCCGA CTCGGCTGCG CGATGGCGCA
CGCACCCAGC GTCTGCATGC GCTTGGCGAT AAGGTCATTT CGCTGGCGCA GGTTCCGGAT
GCTGCGGCAG CAGAGGTTAT ACGCAAAGAA GGCGTAGACG TGCTGATCGA TACCACCGGG
CATACTCGCG GCTGTCGCCT CGGAATCTTC GCGCATCGGG CGGCGCCGGT GCAGATGACG
TGGATCGGCT ACTGGAGCAC GACAGGACTC ACCGAAGTGG ATTGGGTGCT CGCGGATGAC
AAGCTGCCGG CCAGCTTCGA TGCTCATTTC TGCGAAGGCA TCTGGCGAGT ACCTCGGTTG
CCCTTGGTGT ATCGCGGCGA CACTGCTCTG CCTCAGAGCG CATGGACACC GAGCGCAGAC
GGCACGCTTT GGTTCGGAAG TCTTAACCGG TACTCGAAGA TCGGCCAGGA GAGCCTCGAT
CTTTGGGCGA AGGTGATGGA AGCAGTACCG AAGTCGAAAC TCCTTCTCGA GGATCGGACC
GCAGATGATA CTGATGCGCA TCAGCGCATA TCGGCGGAAC TCGCGACCCA CGGTATCGGC
GCCGATCGAA TCGAGTTCGA GCCATACATC CCCGGACACG AGCGCCATAT GCGCCTCTAC
GACCGCGTTG ACGTCGCACT CGACACCATC CCGCTCAATA GCGGTACGAC GGCTTGCGAC
GCTCTCTGGA TGGGGGTGCC TCTGGTTGCG ATGGAGGGGA ACCGTACCGC ATCACGCATT
GCGGCAGGAT TTCTGCGCGC CATCGGCCGT ACAGAGTGGA TTGCGGATAG CGAGCAGAAC
TACATTTCGA AGGTTGTCGA ACTGTCGAAT AATGTCGAAC TGCGCAAGCA ATTGCGCGGC
TCGCAGCGTC AACGGATGGT CGAAAGTTCA TTGTGCGATG CCCGCGGACT GGCGCGCGAA
CTTGAGCAGA CGTTCGTGCA GATGTTCGAC CGCTGGACGG CCGCGCAGAC GTCTTGA
 
Protein sequence
MTTKIGRNDR CPCGSGKKYK LCCLVRPSNA ARLMMAREHH EAGRLQPAAK IYEQVLRGDP 
NNVEALHSLS ILASQIGETA TAERLMRQVL SLQPEHVGAL SNLGITLQSQ GRQEDAIACY
EKVIALRPHH AEAHNNLGNL RLAQGDLEQA IASYQRALDL KPDYADAHYN LGNAYQRRGN
WTQARESYRR AVASRPEFPE AQNNLGVVLR EMGETSAAIE AFERAIALRA EYADPLNNLG
VALQEQGRMS AAVEHYHQAI ALRPADVEAH FNLGSALQEL HRTDEAIAAY QSALEIQPGY
LPAYSNLLLL YASTGCVSPA EELAFALGWE RAALTEEERA EARSRRFVRT HLAGRKLRIG
IVSAELGEHA VADFLEPLLS EIDRSQFELL LFPTRLRDGA RTQRLHALGD KVISLAQVPD
AAAAEVIRKE GVDVLIDTTG HTRGCRLGIF AHRAAPVQMT WIGYWSTTGL TEVDWVLADD
KLPASFDAHF CEGIWRVPRL PLVYRGDTAL PQSAWTPSAD GTLWFGSLNR YSKIGQESLD
LWAKVMEAVP KSKLLLEDRT ADDTDAHQRI SAELATHGIG ADRIEFEPYI PGHERHMRLY
DRVDVALDTI PLNSGTTACD ALWMGVPLVA MEGNRTASRI AAGFLRAIGR TEWIADSEQN
YISKVVELSN NVELRKQLRG SQRQRMVESS LCDARGLARE LEQTFVQMFD RWTAAQTS