Gene Acid345_2084 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2084 
Symbol 
ID4069683 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2498220 
End bp2499218 
Gene Length999 bp 
Protein Length332 aa 
Translation table11 
GC content59% 
IMG OID637984099 
Productpyridoxal-5'-phosphate-dependent enzyme, beta subunit 
Protein accessionYP_591159 
Protein GI94969111 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1171] Threonine dehydratase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.432383 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0544348 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCTCCC TTGAATCTTT CCGCCAAGCT CAAGAGCGTC TCCGCGGCAT TGCGGCGCGA 
ACCCCGTTGG TCCAATACTT CCCTCCCGGC CACACCCACA ACGACCGCAC TCTGCGCATC
AAGCCAGAGT CGCTGCAGCC GATTGGCTCG TTCAAACTCC GCGGCGCTTA CAACAAGATC
GCAACGCTTT CCGACGAGGA ACGTCAAAGA GGTGTAATCA CCTATTCGAG CGGAAACCAT
GCGCAGGGCG TAGCCTATGG TGCGCGCGCC ATGGGTGTGA AGGCGTGCAT CGTAATGCCG
CGCAACGCTC CCAAGGTCAA AATGGACGCG ACCAAGGCCC TCGGGGCCGA AATAGTAACT
GTAGGACCAG CCAGCTCCGA GCGCCGTAAG AAGGCGGAAT CGTTGGCTCA GGAGCATGGC
TACGCGATCG TCCCGCCCTA TGACGACGAA CAGATCATCT CCGGCCAAGG CACCGTCGGC
ATGGAAATCT ACGAAGACCT CCCTGAGGCC GACATCGTAC TCGTCCCCAT TGGCGGCGGC
GGTTTGATCA GCGGGGTCTC CGCAGCACTG AAGATGAGCG GCAGTAGGGC GAAGATCATC
GGCGTCGAGC CCGAACTGGC CAACGACGCC CAGCAGTCGC TGCGAACCGG TAAGATCGTC
ACCCTCCCTG CTGAGCGTGT TTCGAGTACC CTTGCCGACG GCCTCCGCAC TCAGTCCGTT
GGAGATTTGA ACTACGAAAT CATCAAGCAA TATGTAGACG ACATCGTCAC GGTAGAAGAG
GACGAAATCC GCGAGGCCAT GCGCCGCATG ATGAGCGAGT CGCGCCTCGT GGTTGAACCA
AGTGGCGCCG TGACATTTGC TGCTTATCTC TTCCACGAAA AAGAGCTTCC TGCCGGCCGC
AACGTGGTTG TCGTAATGAG CGGCGGCAAT ATTGAACCAT CGCTGCTCGC GCAAGTAATG
ACCGAGTCCG ACGCGCAATC CGCGCAGACT GGGCGCTAG
 
Protein sequence
MVSLESFRQA QERLRGIAAR TPLVQYFPPG HTHNDRTLRI KPESLQPIGS FKLRGAYNKI 
ATLSDEERQR GVITYSSGNH AQGVAYGARA MGVKACIVMP RNAPKVKMDA TKALGAEIVT
VGPASSERRK KAESLAQEHG YAIVPPYDDE QIISGQGTVG MEIYEDLPEA DIVLVPIGGG
GLISGVSAAL KMSGSRAKII GVEPELANDA QQSLRTGKIV TLPAERVSST LADGLRTQSV
GDLNYEIIKQ YVDDIVTVEE DEIREAMRRM MSESRLVVEP SGAVTFAAYL FHEKELPAGR
NVVVVMSGGN IEPSLLAQVM TESDAQSAQT GR