Gene Acid345_0795 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0795 
Symbol 
ID4068576 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp985164 
End bp986699 
Gene Length1536 bp 
Protein Length511 aa 
Translation table11 
GC content60% 
IMG OID637982802 
Productthiol oxidoreductase-like 
Protein accessionYP_589874 
Protein GI94967826 
COG category[C] Energy production and conversion 
COG ID[COG3488] Predicted thiol oxidoreductase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.39359 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGTTCG GTCGTTCCTG GATGTTGATG TGCTCGTTGC TAACCGTCCT GTTGTTGACC 
ACCCTTGTCG CAGCTCAGAA AGATCCCGGA GTACGAAAAG GTGCGCCTGG CGCCGGCACT
CCGCTCAATG GATTAACGCC AATAGAACTC AACATGTTCT TTGAAGGATT CCAACGCACT
GTCCAGTTGG AAGGTGTTTG CGATGACTGT ACAGATGTCA CGCTGGGTAC TTTTGTGGAT
CCGGCCAAGG CCAACTTGGT GACCCAGACG AACTCGTCCG GGCTCGGCGT CCGCTTCAAT
GGAGACCAGT GCAGCTCTTG CCACAACCAG CCGGCAGTGG GCGGTTCGGG TGGCTTCATG
GTTCCTAACC CACAAGCTCC GGCGAATCTT CAGCGCCCGC CTGAGAACCC GATGTTCGAT
CTCATCCCGC ATCGGAAGGG CGCGACGAAC GCGGTACCGT CGTTCATCCA TCAGTATGGG
CCGATCCGCG AAGTACGTTT TGCACGGAAG CCTGACGGCT CGCCCGATGG CGGAGTCCAT
CAGCTTTTCA GCGTCGTCGG CCGTTCTGAC ATCTTTCCCG CCGGCCAGGA AAATACCTGT
ACGAGTGCCG TGTTGCCGCC GACGGACTTC GAGTCGCAAT ACCGTTCTGG TAACCTGCGC
TTTCGGATAC CGCTGCAACT CTTCGGGCTC GGCATCATCG ACGGAATCCA GGACCGCGAG
ATTCTCGGCC GGCACCAGGC GACAGCGTCG GTGCGCCAGC TCTTCGGTAT CCAGGGAGTG
CCCAATCACA GTGGCAATGA CGGCACGATC ACGCGTTTCG GTTGGAAGGC ACAGAACAAG
TCGATCGCGA TCTTCTCCGG CGAGGCCTAC AACGTGGAGA TGGGCGTGAC CAACGATCTG
TTCACGCAGG CGACCGACGA GTCACCGCTG TGCACCGCCG ACAAGAGCGA GCCCAACGAC
ATCACGCGGC TCGACCCTGA CGACACGCGC AACCAGAGCT TCTACAACCC GAACCACGAG
GTCGCCGACT GGCTGATGTT CGCGATCTTC ATGCGCTTCC TGGACGCGCC GCAACCGGCG
ACGTTCACGG ACAGCGCCCA ACATGGGCAG CAGCTCTTCG GCACGGGGCC GGACAATCCG
GGTGTCGGCT GCGTGCTCTG CCATACCGCA ACCATGAATA CTCCGGCGAG GAGCGAGACC
CCTGCGCTGG AGAATCTGAC GGTGCATCCG TATACCGACC TGCTCATCCA TCACATGGGA
AGCGGCCTGG CGGACGACAT CACGCAAGGA CAAGCAACCG GCGACATGTT CCGCACTACG
CCACTCTGGG GAGTCGGCCA GCGCATGTTC TTCCTGCACG ATGGCCGAAC CAGCGACTTG
CTGCAGGCCA TTGAAGCGCA CGCTTCAGGC GGCGATTCGC ACGGGATGAA ACCGTACGGC
TACGGGCCAT CGGAGGCGAA CGCCGTGATC CGGAAGTTCA ACGCACTGCC TGCGAAAGAC
CAGCAATCGG TGCTCGATTT CCTGAGAGCA CTGTGA
 
Protein sequence
MKFGRSWMLM CSLLTVLLLT TLVAAQKDPG VRKGAPGAGT PLNGLTPIEL NMFFEGFQRT 
VQLEGVCDDC TDVTLGTFVD PAKANLVTQT NSSGLGVRFN GDQCSSCHNQ PAVGGSGGFM
VPNPQAPANL QRPPENPMFD LIPHRKGATN AVPSFIHQYG PIREVRFARK PDGSPDGGVH
QLFSVVGRSD IFPAGQENTC TSAVLPPTDF ESQYRSGNLR FRIPLQLFGL GIIDGIQDRE
ILGRHQATAS VRQLFGIQGV PNHSGNDGTI TRFGWKAQNK SIAIFSGEAY NVEMGVTNDL
FTQATDESPL CTADKSEPND ITRLDPDDTR NQSFYNPNHE VADWLMFAIF MRFLDAPQPA
TFTDSAQHGQ QLFGTGPDNP GVGCVLCHTA TMNTPARSET PALENLTVHP YTDLLIHHMG
SGLADDITQG QATGDMFRTT PLWGVGQRMF FLHDGRTSDL LQAIEAHASG GDSHGMKPYG
YGPSEANAVI RKFNALPAKD QQSVLDFLRA L