Gene Acid345_3812 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3812 
Symbol 
ID4071096 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4506702 
End bp4507706 
Gene Length1005 bp 
Protein Length334 aa 
Translation table11 
GC content52% 
IMG OID637985835 
ProductADP-glyceromanno-heptose 6-epimerase precursor 
Protein accessionYP_592886 
Protein GI94970838 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID[TIGR02197] ADP-L-glycero-D-manno-heptose-6-epimerase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.284553 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.244227 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCATCG TTACCGGTGG CGCCGGATTC ATTGGAAGCA ATCTCGTTCA TGAACTGAAT 
GCCGAGGGAA TCACCGACGT CCTTGTTGTT GACAATCTTG CAAATGCAGC AAAGTTTGAA
AATTTGCTGG GCGCGAAATT CGCCGACTAC ATGGATAAAC GGGCCTTTCG AGCCGCAATT
CGAGAGAGGT CGCTGGGAGC CCCGAAGATA GAGGCTATCT TGCATCAAGG AGCCTGTTCC
AACACGCTAG AAGACGATGG CGTGTACATG ATGGACAACA ATTATCAGTG TACTAAGGAG
CTCCTGCATT TCGCGATTGA ACAGGGAGCG CGCTTCGTCT TTGCTTCCAC GGCCGCCGTA
TACGGGCTAG CAGGTCCTGG ACATTTTGCG CCAATTCCAG GGAATGAGCG TCCGCTCAAT
ATTTACGGCT ATTCGAAATT AATGTTCGAC AATTATTTAC GCCATAAGAT AGCAGCAGAC
GAAGTGTCAA TCACGGCTGT GGGTCTGCGG TACTTCAACG TCTACGGGCC GCGCGAGCGT
CACAAAGGAC GTATGTCTTC AGTGATCCAT CATTTCACGG GACAGATGAA GAAAGAGCAG
AAACTGCGGA TGTTCCAAGG ATCCGGCGGT TATGGAGATG GTGAACAAAG AAGGGATTTC
GTATATGTCC GCGACCTCGC AAGGATGAAT TTATTCTTCG CGCAGCTCGG ACGTTTCGAG
GCGGCTAAAG GCGAACCAGA GAGGACATAC CGTGGCATCG TCAACGCTGG CACTGGACTG
AGCCGAAGCT TCAATGATGT CGCGGCTGCA CTAATGACGA TTCACGGAAA GGTCCCGGTC
GAGTACATGC CGTTTCCATC CGATCTAATT GGTCGATATC AGCATTTCAC CGAGGCAGAC
ATATCGGGAC TCCGCAAACT CGGCTGGATT GAGGAACCGA CCACGCTGGA AGCAGGCATC
GACGAGACAT ACGCGACACT ACGGCAGTTG GGCCGCGAGT CTTGA
 
Protein sequence
MVIVTGGAGF IGSNLVHELN AEGITDVLVV DNLANAAKFE NLLGAKFADY MDKRAFRAAI 
RERSLGAPKI EAILHQGACS NTLEDDGVYM MDNNYQCTKE LLHFAIEQGA RFVFASTAAV
YGLAGPGHFA PIPGNERPLN IYGYSKLMFD NYLRHKIAAD EVSITAVGLR YFNVYGPRER
HKGRMSSVIH HFTGQMKKEQ KLRMFQGSGG YGDGEQRRDF VYVRDLARMN LFFAQLGRFE
AAKGEPERTY RGIVNAGTGL SRSFNDVAAA LMTIHGKVPV EYMPFPSDLI GRYQHFTEAD
ISGLRKLGWI EEPTTLEAGI DETYATLRQL GRES