Gene Acid345_2384 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2384 
Symbol 
ID4071382 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2817135 
End bp2819252 
Gene Length2118 bp 
Protein Length705 aa 
Translation table11 
GC content60% 
IMG OID637984400 
Producthypothetical protein 
Protein accessionYP_591459 
Protein GI94969411 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1331] Highly conserved protein containing a thioredoxin domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.249366 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.331232 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGTCGC GCTCGAACGC ACTGGGAAAC GCTTCCTCCT CTTATCTCCG CTCCGCCCTA 
CACCAGCCCA TCCACTGGCA CCAGTGGGGT CCGGAGGCCT TTGCCGCTGC GCAGCAGGAA
AACAAGCCGA TTCTTCTCGA CATCGGGGCA GTGTGGTGCC ACTGGTGCCA TGTCATGGAC
CGCGAGTCCT ACGACGATCC AGAAGTTGCA GACATCCTTA ACCGTGAGTT TATCGCCATT
AAAGTAGACC GCGACGAGCG TCCCGACGTA GACAGCCGAT ACCAAACGGC TGTCGCCGCG
ATTACCGGCC AAGGAGGCTG GCCGCTCACC GCGTTCCTCA CCACCGAAGG CAAACCCTTC
TATGGCGGAA CTTACTTCCC TCCCCGCGAC GCCCACGGCC GTCCGGGCTT CAAGAAGATT
CTGCTGGCGA TTGCCGACGC CTACAAGAAC CGCCGCGACG ACGTATTGCG CGAAGCCGAC
GGCATGATGA CCGCATTGCA CCACGCCGAA GGCCTCGCCG GTCACGGTGG CGATTTCAAT
CCACGCGTCA TCACCATGAT GGTGCAGTCG GCGCTCAATT CCTTCGATCC CAAGAACGGT
GGCTTCGGCT CTGCGCCGAA ATTTCCTCAT GCTTCGATCG TCGAGGTCCT GCTCGATTGG
TACGCGCGCA CCGGTGAAGA CGGCGCCGCC AACGTCGCGC GCACTACGCT TGAGAAGATG
GCGCAAGGCG GCGTGTACGA CCAGATCGCT GGCGGCTTTC ATCGCTATTC GGTAGACGAG
AACTGGATCG TTCCGCACTT CGAGAAGATG TCGTACGACA ACTCGGAGCT GCTGCGCAAT
TACGTGCACG CGGCGCAGCT TTTTCCCGAC GCTGCCTTTG CTGAAACTGC GAAGGACATC
ATCCGCTGGG TAGATTCCAC GTTGACAGAT CGCGAGCACG GGGGCTTCTA CGCCTCTCAG
GATGCCGACA TCAACCTCGA GGATGACGGC GACTATTTCA CCTGGACCGT GGACGAAGCC
AAAGCTGCCC TCACCGCGCA GGAATTTGAA GTTGCGGCGC TGCACTACGA CATCAACGAA
GTCGGCGAAA TGCACCACAA CTCAGCGAAG AACGTCCTCT GGATCCGCGC CGAGGTCGAA
GAAATCGCGA TGCGGCTGTC GCTCAAGCCG GACCAGATCC GGATGTTGCT GAACTCTGCG
AAACAGAAGA TGCTCGTCGC ACGTCTCCAG CGTCCCACGC CATACATCGA CAAGACCGTC
TATGTGAACT GGAATGCGAT GTTCGTGAGC GCCTACCTCG CCGCCGGGCG TGTACTCGGA
ATGAAAGACG CCCACCACTT CGCGCTACGC ACGCTCGATC GCATTCTCGG ACAGTGGAAC
GACAAGCAGC AGTTGCCACA CGTGATCGCC TACTCCGATC CCAACGCCGT GCTGCGCGAA
AGCCGGGGTT TGCTCGACGA TTACGTCTTC ACCGCACTCG CTTGCCTCGA TGCCTACGAA
GCCACTGGCG ATCTCACCTA CTTCCGCTGC GCGCAACAGA TCGCCGACAC CGCCATCGCA
AAATTCGGCG ATGCCACTTC GGGCGGCTTC TTCGACGCCG AACCTACAAC CGAGCAAGTC
GCCCTCGGCG CGCTCTCGGT GCGCCGCAAA GCCTTCCAGG ATTCACCAAC GCCCGCGGGT
AATCCCGCTG CCGCAATCCT GATGCTCCGA CTCCATGCGT ATACCAACGA CACGCGCTAT
CGCGATAAAG CCGAAGACAC CCTCGAAACG TTCGCAGGCG CCGTCGAACA GTTTGGGATT
TATGCCGGAA CCTATGGTCG CGCCGCGATC TGGTTTTCAA AACCCCACAC CCAGGTTGTA
ATCATCGGCA CCGACGCCTC CGCCGCGGAT CTCGAACGCG CCGCATTTCA GACCTTCGCC
GAGAACTTGT CGGTCATTCG TCTCGCACAA GCCGATGCGC ACCTGCTTCC GCCCGCACTG
GCAGAGACCA TACCCAACGT TCCAGGCGTC AATGATGGCC GCGCTGTCGC CGTCGTCTGC
TCGAACTTTG CCTGCCAGCC TCCCATTACA TCCGCGCAGG ATCTTACAGA CACTTTAAAG
AAACTACTTC GCTCCTAG
 
Protein sequence
MSSRSNALGN ASSSYLRSAL HQPIHWHQWG PEAFAAAQQE NKPILLDIGA VWCHWCHVMD 
RESYDDPEVA DILNREFIAI KVDRDERPDV DSRYQTAVAA ITGQGGWPLT AFLTTEGKPF
YGGTYFPPRD AHGRPGFKKI LLAIADAYKN RRDDVLREAD GMMTALHHAE GLAGHGGDFN
PRVITMMVQS ALNSFDPKNG GFGSAPKFPH ASIVEVLLDW YARTGEDGAA NVARTTLEKM
AQGGVYDQIA GGFHRYSVDE NWIVPHFEKM SYDNSELLRN YVHAAQLFPD AAFAETAKDI
IRWVDSTLTD REHGGFYASQ DADINLEDDG DYFTWTVDEA KAALTAQEFE VAALHYDINE
VGEMHHNSAK NVLWIRAEVE EIAMRLSLKP DQIRMLLNSA KQKMLVARLQ RPTPYIDKTV
YVNWNAMFVS AYLAAGRVLG MKDAHHFALR TLDRILGQWN DKQQLPHVIA YSDPNAVLRE
SRGLLDDYVF TALACLDAYE ATGDLTYFRC AQQIADTAIA KFGDATSGGF FDAEPTTEQV
ALGALSVRRK AFQDSPTPAG NPAAAILMLR LHAYTNDTRY RDKAEDTLET FAGAVEQFGI
YAGTYGRAAI WFSKPHTQVV IIGTDASAAD LERAAFQTFA ENLSVIRLAQ ADAHLLPPAL
AETIPNVPGV NDGRAVAVVC SNFACQPPIT SAQDLTDTLK KLLRS