Gene Acid345_3499 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3499 
Symbol 
ID4072757 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4127145 
End bp4128338 
Gene Length1194 bp 
Protein Length397 aa 
Translation table11 
GC content59% 
IMG OID637985521 
Productglycosyl transferase, group 1 
Protein accessionYP_592574 
Protein GI94970526 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCCGCG TCGTTTTTCT CGCTCCGTAC GTCAAACACT ATCGCTTGCC TTTCTTCGAT 
CTGCTACACG CCGTGCTACA GAGTGACGGC ATCGAGATGC GCGTACTCTA CGGGCATCCG
AACTCGCTTC ATGAAGCGCG CAAGGACAAT GCGGTGCCGC CGGACTCTTA TGGCCGGCCG
GTGAAGAGCT ACTGGATTGC CGACCGGTTT GTGTATCAAG CAGCGTGGCG GGAGATCAGC
GAGGCGGACC TGGTCATCAC TCCGGCAGAA AACAAAATGC TGTTGAACCC GATGCTGATT
GCTCTGCGTT CGGCAGGAGT GAAACGCGTC GCATTCTGGG GAAAAGGCGA CATCCAACCT
GCCAGGCTTT CGCAGCCGGA CCGATGGCTG CGACACCGGC TGGCCGCGAC CGTGGACTGG
TGGTTCGCAT ACACACCACA GTCGGCGCAG AACCTGCGCA AGAACGGTGT GAGCTGCGGC
ATCACGCCGG TGGGAAACAC GATTGATACT GCGGAACTAC AACGAGAGTG CGACGCGATC
TGCTCGGCAT GCGTGCAGGA GGCGCGCGTC GGAATGGACA TCCGGCCGGG GCCAGTCGGC
ATTTATTGTG GCAATTTGAG CCGGAACAAG CACCTCGATT TTCTCTTCGC AGCCGCGAGG
CGGATTGCAC AGGAGATCCC CGAGTTTACC CTTCTCGTAG TTGGGAATGG TCCGCTACGG
AATCAGGTGG AACTGGTCGC CGTACATGAG CGATTCGTCC ATTATGTTGG CCCGCGGATA
GGTCGAGAAA AAGCGCTTTT GTTGAAGATG GCGGATGTTT TCCTCCTTCC GGGTGCAGTC
GGTCTGGCGA TCCTCGATGC GTTCGCGGCA CAACTTCCGC TGATTACAAC GGCACTTCCG
GACCACGGTC CGGAGATCAG CTATCTCACT CCGGGAAAGA ACGGCCTCAT CACGGCGCAC
GAGGAGCAGG CGTATGCGTC GGCTGTAGTT TCACTACTGA AGGCCCGGTC CCGACTTGTG
GCGATGAGCC AGGCCGCGCG GATCGGCGCG GAAGAGCACA CCATGGAAAC AATGGTCCGA
AACTTCCGTC GCGGCATCCT GGAATGCCTT GGATTGCCAG TGGTGGACTC GAAAATAGAG
TCGCGGCGCG AGTCTCTCGC CAAATTGCAG TTCGTTCCAA ATGGAAGAAC GTAA
 
Protein sequence
MRRVVFLAPY VKHYRLPFFD LLHAVLQSDG IEMRVLYGHP NSLHEARKDN AVPPDSYGRP 
VKSYWIADRF VYQAAWREIS EADLVITPAE NKMLLNPMLI ALRSAGVKRV AFWGKGDIQP
ARLSQPDRWL RHRLAATVDW WFAYTPQSAQ NLRKNGVSCG ITPVGNTIDT AELQRECDAI
CSACVQEARV GMDIRPGPVG IYCGNLSRNK HLDFLFAAAR RIAQEIPEFT LLVVGNGPLR
NQVELVAVHE RFVHYVGPRI GREKALLLKM ADVFLLPGAV GLAILDAFAA QLPLITTALP
DHGPEISYLT PGKNGLITAH EEQAYASAVV SLLKARSRLV AMSQAARIGA EEHTMETMVR
NFRRGILECL GLPVVDSKIE SRRESLAKLQ FVPNGRT