Gene Acid345_1201 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1201 
Symbol 
ID4072613 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1483739 
End bp1485772 
Gene Length2034 bp 
Protein Length677 aa 
Translation table11 
GC content58% 
IMG OID637983211 
ProductTPR repeat-containing protein 
Protein accessionYP_590278 
Protein GI94968230 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1928] Dolichyl-phosphate-mannose--protein O-mannosyl transferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0532149 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCGCAAA CTCCGCGCCC TTCCCGGCGC GTTTTCGTTG TCCTCGGTGT TGTCGTCCTT 
CTCGGCGTCC TCTACCTGCA GTTGGGCCTT TCTGCTAACG CAAACTCCAT CACCTGGGAC
GAAGACGACC ACATCTTCGC CGGCTACATG ATGTGGAAGA CCGGCGACTT CGGGTTGAAT
CCCGAGCACC CGCCGCTGGT AAAGCTCGTC GCAACCCTGC CGATGCTTAC GATGCATCTG
AACCAGCCTC CCCTGCAGCA GCGGTCGTTC AAGATCGAAG CCTACCTCGA CGGTCGCGAC
ATGCTCTTCA AGAGCGGCAA CGATGCCAAC GCGATCCTGC TGCGCGTACG ACTGGCAGCC
GCACTCTTCA CGGTTCTTTT GGGACTGCTG GTCTTTCTGG TCGGGCAAGA GATGTTCGGC
ACCGTTGCGG GCTTCATTGG ACTGGCCTTG CTAGTGTTCG ATCCCAATAT CCTCGCCCAC
GGCATGGAGG TCGCCACCGA TACCGGCATT TCCTGTTTCA TGCTGGCCAC GGTGTACGCG
TTCTATCGCT ATAACAAAAA GCCGACGATT GGCAGACTGC TGCTGGTTGG GCTCGCCCTC
GGTTGTACGC TGGGCGTGAA GCATACCGGC ATCCTCGTTC CGCCGATGGT TATCCTGCTC
GCAGCCGTGG AACTGTTCTT AGGCCCCCGC GCGGAAGAAG GGGGTGTCGA CCACGCCGAG
ACTCGCAGCC ACATGGCGAA GCGCCTGGCG GTTGCCGTGC TCGCGATCTT CGCGATCACC
ATCTTCGTTG TGTGGGCGAT GTACGGCTTC CGCTACGCGG CCCGCCCCGA CGGCATGGTG
CTGAATCCCT CGCTGAATGA GGCCATTGCG AACATCTCGC GGCCGCACGA ACAGTGGGGC
ATGCGCATGA TCGCGCACTA CAAGCTACTA CCCGAGTCCT ACATCGCCGG CGTTACCGAC
ATCCGTAATA TGTCCGACTT CTACACCAGT TACGTGCTCG GCAAGAGTTA TAGGCACGGC
GTCTGGTTCT ACTTTCCGTT CGCGTTCGTC ATCAAGTCCA CTCTCGCGCT GCTCGCGCTG
TGCGTGATCG CATTATTCGC AATCGCCACG AGAAAACTGA AGCACGGCCG TGAACTGCTC
TTCCTCGCAG TACCTGCCGG GTTCTACCTG TTCATTGCGA TGTTCTCGCG CATGAATATC
GGGGTGCGTC ACATCCTGCC GGTATATGCG TTTCTTTTCG TAATGATTGG GGGCGCCTGT
GCGGCGCTCA TCCAGCGCAA TCGCATTTGG GTTTATGTTG TCAGTTTCCT GCTCGCCTAT
CAGGCTTTCG TCGAGGTCCG CATCTACCCC GCGTACATGG CCTACGCCAA CGAATTATGG
GGAGGTCCGA CCCAGACATG GAAACTGCTT GCCGATTCCA ACGTCGATTG GGCACAACAA
CTGAAGCGCA CGAAGAAGTA CATCGACCAG AACAACATCA AAGATTGCTG GATGATCTAC
TTCGCCTATC CGGTAGTCGA TTACCACGAC TACGGCATCC CATGCCGTCC GCTGCCGACC
ATGGACCAGA TGTGGATTGG CGACGAACTG GAGGTGCCCA CGGAAATCGA TGGCCCACTG
TTCTTAAGCG CCGGAGACCT CACCGGCTTT GAGTTCGGCG AAGGCCATAT GAATCCCTAC
GAGCAGTTCA AGAGTATGAA GCCTGTGGCG AATATCGACT ACGGCATCTA TGTATACGAG
GGCCATTTCA ACATCGCGCA GGCCGCGGCC GTCAGCCATA ACTTCAAAGC CGGGCATCGG
CTGGAAGCCA GGCAATATGA TGCGGCACTC GCGGAAGCGC AGCAGGCGGT ACAACTCGAT
CCGCAGATGG CGGCCGCGCA TTTGAGCGTT GCCCACGCCC TCGACGGGCT CGGTCGCAAG
GACGAAGCAA AAGCCGAATA TCAGAAGGCA CTGGAGTTGG CGCAGACCAT TCAGCCGACG
TTCCAGGAAG GCACCGCCAA CGCCGCTAAG GAGAAGCTCG CAAGTTTGAA ATGA
 
Protein sequence
MPQTPRPSRR VFVVLGVVVL LGVLYLQLGL SANANSITWD EDDHIFAGYM MWKTGDFGLN 
PEHPPLVKLV ATLPMLTMHL NQPPLQQRSF KIEAYLDGRD MLFKSGNDAN AILLRVRLAA
ALFTVLLGLL VFLVGQEMFG TVAGFIGLAL LVFDPNILAH GMEVATDTGI SCFMLATVYA
FYRYNKKPTI GRLLLVGLAL GCTLGVKHTG ILVPPMVILL AAVELFLGPR AEEGGVDHAE
TRSHMAKRLA VAVLAIFAIT IFVVWAMYGF RYAARPDGMV LNPSLNEAIA NISRPHEQWG
MRMIAHYKLL PESYIAGVTD IRNMSDFYTS YVLGKSYRHG VWFYFPFAFV IKSTLALLAL
CVIALFAIAT RKLKHGRELL FLAVPAGFYL FIAMFSRMNI GVRHILPVYA FLFVMIGGAC
AALIQRNRIW VYVVSFLLAY QAFVEVRIYP AYMAYANELW GGPTQTWKLL ADSNVDWAQQ
LKRTKKYIDQ NNIKDCWMIY FAYPVVDYHD YGIPCRPLPT MDQMWIGDEL EVPTEIDGPL
FLSAGDLTGF EFGEGHMNPY EQFKSMKPVA NIDYGIYVYE GHFNIAQAAA VSHNFKAGHR
LEARQYDAAL AEAQQAVQLD PQMAAAHLSV AHALDGLGRK DEAKAEYQKA LELAQTIQPT
FQEGTANAAK EKLASLK