Gene Acid345_2353 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2353 
Symbol 
ID4069165 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2784235 
End bp2787381 
Gene Length3147 bp 
Protein Length1048 aa 
Translation table11 
GC content59% 
IMG OID637984369 
Producthypothetical protein 
Protein accessionYP_591428 
Protein GI94969380 
COG category[R] General function prediction only 
COG ID[COG4447] Uncharacterized protein related to plant photosystem II stability/assembly factor 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGATCGAA AGTATTCGGC GCTCCGCTCA AGCGCCGCAG TGGGGATTCT TCTTTTCTTC 
TGCTCTTTCT TACTGCCGGC ACAGGTAGAT CCCGCGCTCT TCTCACAGTT GCAATGGCGA
TTCGTCGGCC CTTTCCGTGG TGGACGCTGC GATTCCGCAA CCGGCGTTCC GGGGAATCCT
GCAGTGTTCT ACTTCGGCTC TGTCGGCGGC GGTGTATGGA AGACGACGGA TGCTGGTGTG
ACATGGAAGC CCATCATGGA CTCCCAGCCT GTTGGCTCCA TCGGCGCGAT CGCCGTCGCT
CCATCGAATT CCGATGTTAT TTATGTGGGC AGCGGCGAAG CCGACATGCG TTCGCAGATC
AGCTACGGTA ACGGTATGTA CAAGTCCGCC GACGCCGGCA AGACCTGGAC GCATATCGGT
CTTGAAGACA CGCGTCAAAT CGGCCGCGTG ATCGTTGATC CCAAAGATCC GAACATCGTG
TATGTCGCCG CGTTGGGCCA TGCCTACGGC GCGAACAAAC AACGTGGCGT TTTCAAGTCC
ACCGACGGCG GTCAGAACTG GCAATCCATT CTCTTCAAAG ATGACGACAC CGGCGCCATT
GACCTCGCCT TCGATCCGCA GGACAGCCAA ACCATCTACG CCGCGATGTG GCAAACCCGT
CGTCCGCCAT GGAATGTTTA CCCCGCCTCG AACGGTCCAG GAAGCGGCTT GTACAAATCC
ACCGACGGCG GCGCGCACTG GTCCCAACTC ACGAACGGAT TGCCGACGGA AGGCCTCGGA
CGTATCGGCA TCGCCATCGC GCCGAGTGAT CGCAATCGCG TGTATGCCAT CGTGGATGCG
AAGCAGGGCG GTCTCTACCG CTCTGACGAC GCCGGTGCAA CATGGCAGTT GCTCGACAAC
GAGCGCCGCA TCTGGGGACG CGGCTGGTAC TTCGGCGTGA TCGCGGTCGA TCCCAAGAAC
CCCGATCTCG TCTACATCTC AAACACTTCG ATGTACCGCT CCACCGATGG GGGCAAGTCA
TTCGTCTCCT TCAAAGGCGC GCCCGGCGGA GACGACTACC ACGGCCTATG GATCGCCCCC
GAAGACGGGA AGCGCATGAT CGTTGCCAGC GACCAGGGCA CCGTTGTTTC GCTCAATGCG
GGCGAAACAT GGAGTTCCTG GTACAACCAG CCGACCGCGC AAATCTATCG CGTAGCGACT
GACAATTCGT ATCCCTACTG GATTTACGGA GCGCAGCAAG ATAGCGGCGC CATCGCCGTG
AAGAGCCGAA GCAAATATTC GTCGATCACT GAGCGCGACT GGAAGGGCGT GGAAGTCGGT
GGTGAAAGCG GTATGCTTGC GCCCGATCCG AACGATCCGA ACACGGTCTT CGGCGGCATG
GTCTCGAAGT ACCAGGCTGA CCTCTCCCAG GACCAGGACG TCTCGCCCAC GCTCGGCCGC
GAAGGAACGT GGCGTCAAAC CTGGACGTTG CCTCTCGTCT TCTCGCCTGC CGATCCACAC
AAGCTTTACT TCAGCCACCA GGTGCTTTTC CGCAGCGACA ACGGTGGCAA GTCATGGGCC
GCCATCAGTC CCGATCTCAC GCGCGACAAT CCTGGCGTTC CCGCAAACCT GGATCCCATC
ACCGCGAAGT ACGGCCTCGA TTCGCCGCGC AAAGGCGTCA TCTATTCGAT CGCGCCGTCG
CCCCTCGATG CCAACTTACT TTGGATTGGC ACCGACGACG GACTCATCCA CCGCTCCACC
GACGACGGCA AACACTGGCA GAACGTCACG CCCGCCGAAA TCACGGCGTG GAGCAAAGTC
GCTACGCTCG AAGCCTCACA TTTCGACAAG AACACCGCCT ACGCCGCAGT GGACCGTCAT
CGTCTGGAAG ACTACAAGCC ATACATCTAT CGCACCCGCG ATGGCGGCAA GACATGGCAG
CAAATCGCCG GCAATTTCAA TAGCTATGTG AATTTCATTC GTGAAGACTT GGTCCGCCCC
GGCTTGCTAT TTGCCGGCAC GGAATTGAGC CTGTGGGTGA GCTTCGACGA TGGCGAGCAC
TGGCAGTCCT TCCAGAAGAA CCTGCCAACC GTTTCCATGC GTGACATCAC GATTCACGGA
GACGACATCG TGGTTGCGAC CCATGGCCGC GGCTTCTACG TGATGGACAA CATCACGTCG
CTTCGCGAGC TAAAGCCGCA GACGAATGAA GAGGACGCCC ACTTCTACAA ACCGCAAGTC
GCCACCCGCA CTCGTCCTGG CGGCGACGAA GCCACGCCCT ATCCACCAGA AATTCCTCAC
GGAACAAATC CGCTGACAGG CGCGATCTTC GACTATTACT TGAAGACCGA CGTTCCCGGC
CCGATCACAC TCAATATCTT CGACGCCAAA GGCACGAGCA TCCGCAAGTT CTCCAGCGAA
AACAATCCCA AGCAGCCGGA TGAGAAGACC CTGGTGTTTC CCGCGTTCTG GGTGAAGCTT
CCGACGCCGC TGGCCACTAA CGCAGGCGCG CATCGATTCG TGTGGGACAT GCATTACGAA
CTCAACGGAG CGAGCGGCGG ATACCGCCAG GCCGCCGGCA CGTGGGCGCT CCCAGGCGAA
TATTCCGCAG TCCTAATCGT TGCAGGGAAG ACTTACAAGC AGCCGTTCAC CATCCGCATG
GATCCGCGTG TGAAAACTAC GCCCGCGGAT TTGCAGCGTC AATTCGCTGT CTCGCAGCGC
GCCGCCCAAG TGCTTGCAAA GCTCAACGCG GCGGTGGCAA AGGGAACGGC CATCGAAAAA
CAACTCGGGA CAACATCAGC GCCGATTGAA CCTTTCCGTC AGCATCTCAG CGCGGTGCTA
GGGCCTGCCG ATCTCGGTTA TGGGGCATCC AGCACTCCGA TTGACACCGA CACCACGAGC
CTGCGCCATC TCCAGGGAAG CTTCCGGTCG GTGCTCTACG CCTTGCAGAG CGCTGACGCA
GCGCCTACTC CGGACCAAGA AGCCGCGCTC GTAAAATTCG AGCAGACGTT CGCGTCCACC
GATAAGCAGT GGACTGCGTG GCTGGCGACC GACTTGCCAA AGCTGAACGA TCAATTAAAG
GCAGCGGGAC AGAAGGAAAT CTCGAAAGGT CCGGAACCGG AAGAGGACGA CTTCGATTAC
GGCAACGATA AAGATCGAGA CCAGTAG
 
Protein sequence
MDRKYSALRS SAAVGILLFF CSFLLPAQVD PALFSQLQWR FVGPFRGGRC DSATGVPGNP 
AVFYFGSVGG GVWKTTDAGV TWKPIMDSQP VGSIGAIAVA PSNSDVIYVG SGEADMRSQI
SYGNGMYKSA DAGKTWTHIG LEDTRQIGRV IVDPKDPNIV YVAALGHAYG ANKQRGVFKS
TDGGQNWQSI LFKDDDTGAI DLAFDPQDSQ TIYAAMWQTR RPPWNVYPAS NGPGSGLYKS
TDGGAHWSQL TNGLPTEGLG RIGIAIAPSD RNRVYAIVDA KQGGLYRSDD AGATWQLLDN
ERRIWGRGWY FGVIAVDPKN PDLVYISNTS MYRSTDGGKS FVSFKGAPGG DDYHGLWIAP
EDGKRMIVAS DQGTVVSLNA GETWSSWYNQ PTAQIYRVAT DNSYPYWIYG AQQDSGAIAV
KSRSKYSSIT ERDWKGVEVG GESGMLAPDP NDPNTVFGGM VSKYQADLSQ DQDVSPTLGR
EGTWRQTWTL PLVFSPADPH KLYFSHQVLF RSDNGGKSWA AISPDLTRDN PGVPANLDPI
TAKYGLDSPR KGVIYSIAPS PLDANLLWIG TDDGLIHRST DDGKHWQNVT PAEITAWSKV
ATLEASHFDK NTAYAAVDRH RLEDYKPYIY RTRDGGKTWQ QIAGNFNSYV NFIREDLVRP
GLLFAGTELS LWVSFDDGEH WQSFQKNLPT VSMRDITIHG DDIVVATHGR GFYVMDNITS
LRELKPQTNE EDAHFYKPQV ATRTRPGGDE ATPYPPEIPH GTNPLTGAIF DYYLKTDVPG
PITLNIFDAK GTSIRKFSSE NNPKQPDEKT LVFPAFWVKL PTPLATNAGA HRFVWDMHYE
LNGASGGYRQ AAGTWALPGE YSAVLIVAGK TYKQPFTIRM DPRVKTTPAD LQRQFAVSQR
AAQVLAKLNA AVAKGTAIEK QLGTTSAPIE PFRQHLSAVL GPADLGYGAS STPIDTDTTS
LRHLQGSFRS VLYALQSADA APTPDQEAAL VKFEQTFAST DKQWTAWLAT DLPKLNDQLK
AAGQKEISKG PEPEEDDFDY GNDKDRDQ