Gene Acid345_3362 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3362 
Symbol 
ID4071280 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3988710 
End bp3989978 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content57% 
IMG OID637985384 
ProductN-glycosyltransferase 
Protein accessionYP_592437 
Protein GI94970389 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGATCCCGC TGGCTTACGA ATCGCTTCTC AAAGCCTTGC GCGCGGTAGA CCACACCGTG 
GTGTACGTGT ATGCGCTGCG GTTTTACGGC CTCTATCCGA TCCTGATGAG TTGGGTGTGG
ATCTCGCTGT CGCTTTTCTT CCGTCGTCGA CAGGAAGATA CCGAAATGGA AATGTCGGGC
CCTGCCCCGA TGGTCTCGAT TCTCGTACCC GCGTTTGCTG AAGCGGAGAC GATCGACGAC
ACCATTGAAG CGCTTCTGAA GCTCGATTAT CCGAACTACG AAGTCATCCT CGTGAACGAT
TGCTCACCGG ACAACACCGC CGAAGTCGTT CGCCAATATC TCGACGATCC GCGCATCAGG
CTATTGAACA AGCAGGTGAA CGAAGGCAAG GCCATGGCTT TGAACGATGC GTTGCCGATG
TGCCGCGGCG AGATTCTTGT GGTGATTGAC GCCGACATCA TCGTGTCGCG CGATCTTCTG
AATTACATGG TGCCGCACTT TGCCGGCACG CGCGTGGCAG CCGTGACCGG CAATCCGCGG
GTACGCAACC GGGTCTCGAT CCTGCAGCAC CTGCAGGCGG TGGAATTCTC TTCGATCGTC
TCAATGCAGC GCCGTGCGCA ACGCGTATTG GGCCGCGTGT TGACCGTGTC TGGCGCGGTT
TTCGCGGTTC GCCGCAGCGC TTTACTCGAG CTTGGTGGGT TCACACCGCA CATGGCGACC
GAAGACATCG ACCTGACCTG GCGTTTGCAG ATGAAATTCT GGGATGTCCG TTACGAACCG
CGCGCCGTGG TGTGGATGCA GGTGCCGCTC AGCTTGCGCG AGTTGTGGAA GCAGCGAAAG
CGTTGGGCGC GCGGGCTCGT CCAGGTGCTC AAGCGCCATC GCGAAGTACC GACCAACTGG
AAGATGCGTC GCATGTGGCC CATCTTTTAC GAATCGATCT TCTCGATCCT GTGGTCGTAC
GTCTTCGTGC TGATGACCTC GTACTGGCTG ATTTCCTTGG CAGTTGGCTA CGCGCCACGA
GGCGTATCGC CGTTCCCAAA TTTCTGGGGA ATGATGATCG CTACGACCTG TCTTTTGCAG
CTATTCATTG GCGCGTGGGT TGACCGGCAG TACGACCCGG GAATTATGTG GTCGTTTCCG
GAAGCAGTTT TCTATCCGGT CATTTATTGG ATGTTGATGG CACTGATTAC TTCGTTCTAC
ACGATTCCGG CGTTGTTCAA GAAACCGCCG AGAGTACAGA CGTGGCGAAT TCGGCGGGGT
CCTGCATGA
 
Protein sequence
MIPLAYESLL KALRAVDHTV VYVYALRFYG LYPILMSWVW ISLSLFFRRR QEDTEMEMSG 
PAPMVSILVP AFAEAETIDD TIEALLKLDY PNYEVILVND CSPDNTAEVV RQYLDDPRIR
LLNKQVNEGK AMALNDALPM CRGEILVVID ADIIVSRDLL NYMVPHFAGT RVAAVTGNPR
VRNRVSILQH LQAVEFSSIV SMQRRAQRVL GRVLTVSGAV FAVRRSALLE LGGFTPHMAT
EDIDLTWRLQ MKFWDVRYEP RAVVWMQVPL SLRELWKQRK RWARGLVQVL KRHREVPTNW
KMRRMWPIFY ESIFSILWSY VFVLMTSYWL ISLAVGYAPR GVSPFPNFWG MMIATTCLLQ
LFIGAWVDRQ YDPGIMWSFP EAVFYPVIYW MLMALITSFY TIPALFKKPP RVQTWRIRRG
PA