Gene Acid345_4044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4044 
Symbol 
ID4072466 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4778435 
End bp4779469 
Gene Length1035 bp 
Protein Length344 aa 
Translation table11 
GC content60% 
IMG OID637986075 
Productaldo/keto reductase 
Protein accessionYP_593118 
Protein GI94971070 
COG category[C] Energy production and conversion 
COG ID[COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAATATC GACTGCTCGG CGGTTCTGGA CTCAAGGTGC CGGTGCTCAG CTTCGGCGCA 
GCGACATTTG GCGGTAAGGG CGAATTCTTC GGCGCGTGGG GTAAAACCGA CGTAGCCCAA
GCCTCTCGCA TGGTGGACAT GTGCGTGGCA GCGGGCGTCA ACTTCTTCGA TACCGCCGAC
ATTTATTCAC AAGGCGCCTC CGAAGAAATT CTTGGTGAAG CGATCAAGGG CAAACGCTCG
AACCTCCTGA TCTCGACAAA AGCCACATTC CCCATGGGCG AGGGACCAAA CGATCTCGGC
TCCTCTCGTT ATCACCTGAT CCAGGCGTGC GAAGCCAGCC TGCGGCGCCT ACAAACGGAT
TACATCGACG TCTATCACCT CCACGGCTTC GACTATTCCA CGCCGATCGA AGAAACGCTG
CGCACCCTCG ACACGCTCGT GACCAGCGGC AAAGTGCGTT ACATCGCGTG CTCGAATTTC
TCTGGCTGGC ACCTGATGAA GTCGCTGGCG ATCTCCGAGA AATACGGCTG GTCGCGCTAC
GTCGCGCACC AGGTGTACTA CTCGCTCATC GGCCGCGACT ACGAGTGGGA ATTGATGCCG
CTCGGAATCG ACCAGAAAGT CGGCGCCATC GTATGGAGCC CGCTGGGCTG GGGCCGCCTC
ACCGGTAAAA TCCGCCGCGG CAAACCGTTG CCGGAGGTGA GCCGCTTGCA CAAAGCCGCT
GACGGCGGGC CGATCGTCGC CGACGAATAC CTCTATAACG TGGTGGACGC GCTGGACGAA
GTGGCGAAAG AGGTCGGCAA AACCATCCCG CAGGTGGCGT TGAACTGGCT TCTCCAGCGC
CCAACCGTGG CAAATGTGAT CATCGGCGCG CGCAACGAGG AGCAGTTGGC GCAAAACCTG
GGCGCGGTGG GGTGGAATTT ATCCACCGAG CAGGTGCGGC GGCTGGATGC CGCCAGCGAT
GTGACTCCGA TCTATCCCTA TTGGCATCAA CGGCAATTCG TCTCACGCAA TCCTCTGGCG
CAGATTTCTG CATAA
 
Protein sequence
MEYRLLGGSG LKVPVLSFGA ATFGGKGEFF GAWGKTDVAQ ASRMVDMCVA AGVNFFDTAD 
IYSQGASEEI LGEAIKGKRS NLLISTKATF PMGEGPNDLG SSRYHLIQAC EASLRRLQTD
YIDVYHLHGF DYSTPIEETL RTLDTLVTSG KVRYIACSNF SGWHLMKSLA ISEKYGWSRY
VAHQVYYSLI GRDYEWELMP LGIDQKVGAI VWSPLGWGRL TGKIRRGKPL PEVSRLHKAA
DGGPIVADEY LYNVVDALDE VAKEVGKTIP QVALNWLLQR PTVANVIIGA RNEEQLAQNL
GAVGWNLSTE QVRRLDAASD VTPIYPYWHQ RQFVSRNPLA QISA