Gene Acid345_0232 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0232 
Symbol 
ID4073082 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp246094 
End bp247134 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content61% 
IMG OID637982233 
Productoxidoreductase 
Protein accessionYP_589311 
Protein GI94967263 
COG category[R] General function prediction only 
COG ID[COG0673] Predicted dehydrogenases and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTCCAAGC CTATCCGTGT CGCCGTGATT GGAGCGGGAG CGTTCGGCAA GAACCATGCC 
CGTGTTTATT CCGAACTGCA TCGTGAGACC CAGGCCGGCA CCCTGCCGCA GGGCACGCCC
GGCGTAGAAC TCGTCGGCAT TGTCGACAGC GACACCGCTC GCGCCGCGCA GATCGCCGCC
ACTCACGAAA CTCGCGCTTA TGCTTCGATC CGCGAACTGC TGGCGCACGG CGGAGCCGAT
GCCGTCTCCA TTGCGGTGCC CACGAAATTC CATCGCAATG TGGCGCGTGA AGCTCTCGAA
GCCGGCTTCG ACGTGCTGAT TGAAAAGCCG CTCGCCGCCA CGCTCGAAGA AGCAGACGAG
ATCATTGACC TCGCGGCCGA GCATCGCCGC ATCGCGCAGG TCGGACACCT CGAACGCTTC
AATCCTGCTG TCCGCGCGAC GATCCCGCTC ATCACCAAGC CCATGTTTTT CGAGATTCAC
CGCCTGAGCC TCTTCGGTCC GCGCGCTCTC GACGTGGATG TCGTCCTCGA CCTCATGATC
CACGACATCG ACATCATCCT GTCGTTCGTT GCAGGCGAAG TGGTTGAAGT GCGCGCCGTC
GGTCTGCCGG TGCTCTCGCA GAAGGTGGAT ATCGCCAGCG TGCGCCTGCA ATTCGCAAAC
GGATGCATCG CCAATCTCAC TGCGAGCCGA GTTTCGACGG AACGCGTGCG CAAACTGCGC
TTCTTCCAGC CCAACCAATA CGTGTCGATC GATTACGCGC GCCAGGATGT GCTGGTGTTT
ACGGTGAACA AGATGGCGCA GCACACGCCC AACGATGGTG GCCTGCCGCA AATGCCGGAT
GTAATGCCGT CGAAACCGCC GGTGGTGCCG GAAGAGCCGC TGCGCTCGGA AATCCTCAGC
TTCCTGCAAG CAGTGCAAAC TCGGGAAACG CCAGTGGTGT CGTTGGAAGA TGGCCGCCGA
GCTTTGGCGG TTGCGTTGGA AATTCAGAAG GCGATTGAAG AACACGCCGA GCGGGCGCAT
CTGGATGCGC TGACGAAGTA G
 
Protein sequence
MSKPIRVAVI GAGAFGKNHA RVYSELHRET QAGTLPQGTP GVELVGIVDS DTARAAQIAA 
THETRAYASI RELLAHGGAD AVSIAVPTKF HRNVAREALE AGFDVLIEKP LAATLEEADE
IIDLAAEHRR IAQVGHLERF NPAVRATIPL ITKPMFFEIH RLSLFGPRAL DVDVVLDLMI
HDIDIILSFV AGEVVEVRAV GLPVLSQKVD IASVRLQFAN GCIANLTASR VSTERVRKLR
FFQPNQYVSI DYARQDVLVF TVNKMAQHTP NDGGLPQMPD VMPSKPPVVP EEPLRSEILS
FLQAVQTRET PVVSLEDGRR ALAVALEIQK AIEEHAERAH LDALTK