Gene Acid345_3652 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3652 
Symbol 
ID4072255 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4320556 
End bp4321569 
Gene Length1014 bp 
Protein Length337 aa 
Translation table11 
GC content64% 
IMG OID637985675 
Productoxidoreductase 
Protein accessionYP_592727 
Protein GI94970679 
COG category[R] General function prediction only 
COG ID[COG0673] Predicted dehydrogenases and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.502003 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTACAT CGAAAGTCCG ATTTGGAATT GTGGGGTTCG GTCTGCACGC GGTAAAGCGC 
CTGATGCCTG CGTTTGGCAT TGCGGAACAC AGCGAGGTCG TTGCGCTTTC GCGGCGCGAT
GCGGCACGCG CCCGTGCATC GGCCGAACAG TTCGGCATCG CGAATGCTTT TGCTTCCGTC
GAAGAACTCG CGCACTGCGC CGAGGTTGAC GCCGTCTTCA TCGCTTCGCC CGACGCGCTG
CATTTGCACG ACACGCTGGT TTGCCTTCGC GCAGGCAAGC CGGTGCTCTG CGAGAAACCG
ATGGCGATGA ATGCGATGGA AGCCGAGCAG ATGGTCGGGG CGGCGAAATC GGCGGCGCTG
CCGCTGGGGG TTGCACAGGT GTTTCGCTTT GAGGACAGCA CCCGGCGTCT TCGTGAACGC
GTCCGCAACG GCAACATCGG GCGGCCGGTG CTGGCGCGCA CCGAATTCTG CTATCCCGGT
ATCGACAGCG CGCGTACGTG GATCACCGAT CCGACCCTCG CGACCGGCGG TCCGATTGCC
GACGTCGGCG TGCACTGCAT TGATGCGCTA CGTTTCATTC TCGGCGACGA AGTCACTGCC
GTAAGCTGCA CGGCTGTGAA AGACGAGCTC TCCGAACCGG TGGAGTGCGC AGGTGTGGTC
ACACTCGAGT TCATGCGCAA GACACTGGCG ACGGTGTCGG TTTCGACGCG GGCGCAGTAC
CGCACGTTCA TCGAGATTAC CGGCGAGACG GGAGTGCTGA CGGCCTTTGA CGCGTTAAAC
GTCGAGCGGC CCCTGACCAT CGAGTACCGG CCCCACGCTG ATCCGCGACA GGTCGAGCGC
GAAGAGGTTT CCAACCAGCT TGCGTACGCG AGGCAGCTGG ACGCCTTCGC CGCCACGGTG
CGCGGAGAGT CTGCCTTCCC CGCACCGGGC TCGGAGGGGC AGCGCAACCA GCAGATTCTG
GACGCAGCGT ACGCGAGCTG GCGCAGCGGG CAGCGCCAGG TCCTGCAGTC ATAG
 
Protein sequence
MPTSKVRFGI VGFGLHAVKR LMPAFGIAEH SEVVALSRRD AARARASAEQ FGIANAFASV 
EELAHCAEVD AVFIASPDAL HLHDTLVCLR AGKPVLCEKP MAMNAMEAEQ MVGAAKSAAL
PLGVAQVFRF EDSTRRLRER VRNGNIGRPV LARTEFCYPG IDSARTWITD PTLATGGPIA
DVGVHCIDAL RFILGDEVTA VSCTAVKDEL SEPVECAGVV TLEFMRKTLA TVSVSTRAQY
RTFIEITGET GVLTAFDALN VERPLTIEYR PHADPRQVER EEVSNQLAYA RQLDAFAATV
RGESAFPAPG SEGQRNQQIL DAAYASWRSG QRQVLQS