Gene Acid345_4508 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4508 
Symbol 
ID4070186 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp5349581 
End bp5350606 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content59% 
IMG OID637986547 
Productaldo/keto reductase 
Protein accessionYP_593582 
Protein GI94971534 
COG category[R] General function prediction only 
COG ID[COG1453] Predicted oxidoreductases of the aldo/keto reductase family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.532493 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.56936 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGACG AGCAAAAGAA CCGCGAATTG GATCGGCGTG AGTTCCTCGG GATCTCCGTC 
GGAGCAGCGA TGCTGTTCGG CATGAATGGC TATCCAGGTC TCGCGCAAGA CAAGACCGGT
GTGCAATACC GTACCCTCGG CCACACCGGC GAACGTGTCT CGTGCATCGG CCTCGGTGGA
TACCACATCG GCATCCAAGG CTCGGAAGAC GAGAGCATCC GCATCATTCG TCGAGCGATC
GATGGCGGCA TCAACTTCCT CGACAACTGC TGGGACTACA ACGACGGTGG CAGCGAAGTC
CGCATGGGCA AGGCCCTGCG CGATGGCTAT CGCAAGCGTG CCTTCCTAAT GACCAAAATT
GACGGGCACG ACGGTAAGAC CGCGACGAAA CAACTCGAAG ACTCCCTCCG TCGCCTCCAG
ACGGATCACC TCGACTTGTT GCAATTCCAC GAAGTCATTC GGCCCACCGA TCCCGACAGG
ATCTTTGCCG CCAACGGCTC ATTCGAAGCG ATGCAGAAGG CCAAGCAGGC TGGAAAGATT
CGTTACCTTG GCTACACCGG CCACAAGGAT CCCGAGATCC ACCTGAAGAT GTTGAGCACG
GCGCTCGCGC ACAATTGGAC GCCCGACTCC GTCCAAATGC CGCTGAACGT CATGGACACA
CACTTCAACA GCTTCGAACA CAAAGTCCTG CCCGAACTGG TGAAGCACAA CATCGGCGTG
CTCGGTATGA AGCCCATGGG CAGTGGCGTC ATCCTGCAGA GCAAGGTCGT CACGCCCGTC
GAGTGCCTGA CCTATGCACT CAGCTTGCCG ACCAGCGTCG TCATCACTGG TTGCGATTCC
ATGCAAGTCG TCGATCAGGC GCTGAAGGTC GCGCGTGAGT TCAAGCCGCC CAGCGAAAAA
GAAGTCGCAG CTCTTCGCGC GAAAACCGCA CCGGTTGCGA TGGCAGGGAA GTACGAGCTC
TACAAGACTT CGAGCAATTT CGACGGCACC GCTCACAACC CGCAGTGGCT CGGCGGGGCA
CCTTAG
 
Protein sequence
MTDEQKNREL DRREFLGISV GAAMLFGMNG YPGLAQDKTG VQYRTLGHTG ERVSCIGLGG 
YHIGIQGSED ESIRIIRRAI DGGINFLDNC WDYNDGGSEV RMGKALRDGY RKRAFLMTKI
DGHDGKTATK QLEDSLRRLQ TDHLDLLQFH EVIRPTDPDR IFAANGSFEA MQKAKQAGKI
RYLGYTGHKD PEIHLKMLST ALAHNWTPDS VQMPLNVMDT HFNSFEHKVL PELVKHNIGV
LGMKPMGSGV ILQSKVVTPV ECLTYALSLP TSVVITGCDS MQVVDQALKV AREFKPPSEK
EVAALRAKTA PVAMAGKYEL YKTSSNFDGT AHNPQWLGGA P