Gene Acid345_3654 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3654 
Symbol 
ID4072257 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4323445 
End bp4324440 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content60% 
IMG OID637985677 
Productaldo/keto reductase 
Protein accessionYP_592729 
Protein GI94970681 
COG category[R] General function prediction only 
COG ID[COG1453] Predicted oxidoreductases of the aldo/keto reductase family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.488784 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGACT CCACTGCTGC GATCCCTCAA CGCAAATTCG GCAAAGCCGA CGCCACGGTT 
TCCTGCGTCG GCTTCGGTGG TCACCACGTT GGCGACGCCA CCGACGTGAA AGAAGCCGTC
AAGCTGATCC ATCAGGCCGT AGATGCCGGC ATCACCTTCT TCGACAACTG CTGGGAATAC
CATCGTGGTA AGACCGAAGA CTGGATGGGC CAAGGCCTCA AGGGTCGTCG CGAAAAGGTC
TTCCTGATGT CCAAGGTCTG CACCCACGGG CGCGATGCCG ACCTCGCCAT GCGCATGCTC
GAGCAGTCGC TCAACCGCCT CCAGACCGAT CATCTCGACC TCTGGCAGAT TCATGGCGTC
TCATTCGACA ACGATCCCGA GCTCTTCATT CGCCCGAACG GCGCGGCCGA GGCCCTCCGC
AAAGCCAAAG AACAGGGCAA AGTCCGCTTC GTAGGATTCA CCGGTCACAA GGATCCCGAC
ATCCATCTCG CGATGCTGAA TACCGGCTTT CCCTTCGACG CCGTGCAGAT GCCGCTGAAC
CCGTTCGACT ACCACTTCCG CAGCTTTCAG GGAAAAGTGC TGCCCGAGTT GCAGAAGCGC
GGGATCGCGG CACTTGGCAT GAAGCCCATC AGCGGCCATG GCGACGCGGT AAAGCGCGGT
GTGCTGAGTG GAGAAGAATC GCTGCGTTAC GCGATGAGCC TGCCCGGCGT GACCACGACC
ATCACCGGCA TCGACAAACA GGAAGCGCTC GACCAGGCGA TCAAAGTCGC GCGCGGCTTC
CAACCCATGA CCGAGCAGGA AATGTCCGCC CTTCGCGATC GCGTGAAGCC CTACGCCGGC
GATGGACGCT ACGAACTCTA CAAAGTCTCG CTCAAGTTCG ATAATCCCGA GGCGCGCATG
GCACACGATT TCCCCCTCGA CATGCAGTCC GTCGAGGTGA AGGAAATGAT GAAGGCCACC
GAAAATACCG GCAAGCCGTT TCCGGAGGCG AAATGA
 
Protein sequence
MADSTAAIPQ RKFGKADATV SCVGFGGHHV GDATDVKEAV KLIHQAVDAG ITFFDNCWEY 
HRGKTEDWMG QGLKGRREKV FLMSKVCTHG RDADLAMRML EQSLNRLQTD HLDLWQIHGV
SFDNDPELFI RPNGAAEALR KAKEQGKVRF VGFTGHKDPD IHLAMLNTGF PFDAVQMPLN
PFDYHFRSFQ GKVLPELQKR GIAALGMKPI SGHGDAVKRG VLSGEESLRY AMSLPGVTTT
ITGIDKQEAL DQAIKVARGF QPMTEQEMSA LRDRVKPYAG DGRYELYKVS LKFDNPEARM
AHDFPLDMQS VEVKEMMKAT ENTGKPFPEA K