Gene Acid345_0736 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0736 
Symbol 
ID4069078 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp904350 
End bp906008 
Gene Length1659 bp 
Protein Length552 aa 
Translation table11 
GC content59% 
IMG OID637982742 
ProductTPR repeat-containing protein 
Protein accessionYP_589815 
Protein GI94967767 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.325309 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.93082 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGCCA CAGCGCAAGC CGTTGTAACC GCGCCCATCA CCGCCAAAAC TTGGGTGATG 
CGGATCTTTT GCGCGGTGTT CGGAGCTGCG CTGATTCTCG GGTGCCTGGA AGGCGCTCTG
CGGGTTTTCG ACGTTGGATT TCCGACTTCG TTGACCGTTC CCTGCACCAT GGAACAGCAA
CCGGCGGCTT GCTACAACCT GTTCTTCACG GCGCCGTATT TTCCTCCAGG CATCATTCAC
ACGCCGACTC TATTCGCGGT TCCCGCAGTG AAAGCGCAGA ACACGTACCG AATTTTCGTG
CTCGGCGAAT CGGCGGCAAT GGGAGACCCT GATCCGGCGT ATGGGTTCAG CCGCTATCTC
GAAGTCATGT TGCGCAATCG GTATCCGCAG ATGAGGTTCG AGGTAATGAA CACGGGCACC
GTTGCCATCA ATTCGCATGT TGGATTGCCG ATCGCGCGCG AGATTGCGAA GCTCAAACCC
GACGTGGTGA TCATCTACTC GGGGAACAAT GAGGTCGTGG GGCCGTATGG CGCGGGGACG
GCGTTTGCGG CGTCTGCCAT GGAATTACCG GCGATTCGAA GCAGCATCTG GTACCACACC
ACGCGCACGG GACAACTGCT GACCAAGCTC GGGATGCAGA AGTTGGAATG GCGCGGCATG
GAGATGTTCC TCGACAAGCA GGTGCCGCAG TCGTCGCCTC TAATGCCTTA TGTTTACGCG
AACTTCGAAG CCAACCTACG CGACACGATC GGGGTCTTGC GCGGAGCGGG AGCGACAGCG
ATTGTCTCAA CCGTAGCGAC GAACTTGCGC GACTGCGCAC CTTTCTCTTC GCTGCATCGG
GCAGGGTTGA GCAAAGAAGC GTTGCAGCGG TGGGACACTC TGGTGAATGA GGGCGCGAAG
TTGGAAGAGG CCGGTGCTCA TTCTGAAGCA CTGAAGCTCT ACGCGCAAAC TCTTGCGATC
GACGACGAGT ATGCGGAATT AGAGTTTCGA ATTGCGAGAG TGCAACTGGC GCTCGGCAAG
CGCGAAGAGG CGCTCAAGCA CTTCGAACGC GCCCGCGACC TTGATACCCT GCGCTTCCGC
GCCGATAGCC GAATTAACGC GATCAATCGC AGCACGGCGG AATCCGGCGG CGCGGAGTTG
GTGGACGCAG AACAACTTCT GTATGCGAAC GCCGTTGATG GCATCACCGG AGGCGATCTC
ATCTATGAAC ACGTTCATTT GACGCCGACA GGAAATTACC TGCTCGCGCG CGCGATGTTT
CTGAAGATTG CCGGCAAGCT ATCGCCGACG GCGGGCGAAG CCGACGTGCC GTCAGAGTCT
GAATGCGAGG AATGGCTCGC GCTTACCGGA CACGATCGGA TCCGAATCGC GCACGAGATG
GCGGAACGGT TGCAGAAGCC GCCGTTCACG AACCAATCGA ACCACTCCGA GCAGCTGCTT
CGAATTTCAA TGCAGGCGCA GCAGGCTGAC GAGAGCCCGC AGGACACGGC AGCGCAATAT
CAACGGGCGC TGCAACAGGC GCCGAATGAC CATCTTCTTC ACTATGGCTT TGGGCGCTTT
CTCTTCCGCT ACAATCCCGA CGCTGGCGCG AACGAACTGC GGCAATCGCG GCCGTGGGAC
GGCTTTCCGG TCTTCGCGCC TAACGGTCAG ATATTTTAG
 
Protein sequence
MSATAQAVVT APITAKTWVM RIFCAVFGAA LILGCLEGAL RVFDVGFPTS LTVPCTMEQQ 
PAACYNLFFT APYFPPGIIH TPTLFAVPAV KAQNTYRIFV LGESAAMGDP DPAYGFSRYL
EVMLRNRYPQ MRFEVMNTGT VAINSHVGLP IAREIAKLKP DVVIIYSGNN EVVGPYGAGT
AFAASAMELP AIRSSIWYHT TRTGQLLTKL GMQKLEWRGM EMFLDKQVPQ SSPLMPYVYA
NFEANLRDTI GVLRGAGATA IVSTVATNLR DCAPFSSLHR AGLSKEALQR WDTLVNEGAK
LEEAGAHSEA LKLYAQTLAI DDEYAELEFR IARVQLALGK REEALKHFER ARDLDTLRFR
ADSRINAINR STAESGGAEL VDAEQLLYAN AVDGITGGDL IYEHVHLTPT GNYLLARAMF
LKIAGKLSPT AGEADVPSES ECEEWLALTG HDRIRIAHEM AERLQKPPFT NQSNHSEQLL
RISMQAQQAD ESPQDTAAQY QRALQQAPND HLLHYGFGRF LFRYNPDAGA NELRQSRPWD
GFPVFAPNGQ IF