Gene Acid345_2849 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2849 
Symbol 
ID4070368 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3387584 
End bp3388582 
Gene Length999 bp 
Protein Length332 aa 
Translation table11 
GC content56% 
IMG OID637984867 
Productglycosyl transferase family protein 
Protein accessionYP_591924 
Protein GI94969876 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0463] Glycosyltransferases involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.659984 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATCCA TCTCCGTCAT GACGCCTTGC TACAACGAAG AAGGCAACGT GCAGGAAGTG 
TACCAGCGAG TGCGGGCCGC GATTGCGGGC CTCGGGCCGG GATACATTTA CGAGCACGTG
TTCATTGACA ATGCGTCGCG CGACAACACA TGGGCGGAGC TTCGCAAACT GGCGGCAGCC
GACAAGAACG TCAAAATTAT TCGCAATACG AGGAATTTCG GTCACATTCG CTCGCCCATG
CACGCATTCC ATCAGTGCAG CGGCGATTGC GTGATCGGGC TCGTTGCCGA TCTGCAGGAC
CCGCCGGAGA TGATTCCGCA AATGGTGGCC AAGTGGGAGG AGGGCTTCCC CGTCGTTGTG
TGCGTGAAAA CCGGCAGCGA CGAGCACGGC CTCATGTATT GGATCCGGAC GAAGTACTAT
CGGCTCGTGA ACCGCCTCTC TGGCGTGGAG ACTTACGAGA ACTTCACGGG CTTTGGGCTC
TACGACCGCA GAGTCGTGGA TGCAATTAAG AGTATGCGCG ATCCCTATCC GTATTTCCGC
GGGCTCGTGG CGGAAATCGG ATACCCGCAC TACTCGATCG AGTTTCACCA GCCGCTGCGG
CGGCGGGGCA TCACCAAGAA CAACTTCTAC AGCCTCTACG ACAATGCCAT GCTCGGCATC
ACGAACCTGT CGAAGGTGCC GCTGCGACTG GTGAGTTTTG CAGGCTTCTT AGGGGCGTTG
CTTAGCGTGT GCCTTGGCTT TGCATATCTC ATCTACAAGC TGGTTTTCTG GAAGAACTTC
TCCGTCGGAA TTGCGCCGCT GGTGATCGGT ATGTTCTTTC TGGCATCAAT CCAGCTGGTA
TCGCTGGGAA TCATCGGCGA GTACATTGGG CAAATCCATA CCCAGATTCA AGATCGCCCG
TTTGTTTTTG AGCAGGAACG CGTGAACTTC GAGTATCCGC CCGGAGAACC GCTCATATCG
GCGCTAACGG AGATTGCGAA CGAGGAACGG AAGGCGTGA
 
Protein sequence
MKSISVMTPC YNEEGNVQEV YQRVRAAIAG LGPGYIYEHV FIDNASRDNT WAELRKLAAA 
DKNVKIIRNT RNFGHIRSPM HAFHQCSGDC VIGLVADLQD PPEMIPQMVA KWEEGFPVVV
CVKTGSDEHG LMYWIRTKYY RLVNRLSGVE TYENFTGFGL YDRRVVDAIK SMRDPYPYFR
GLVAEIGYPH YSIEFHQPLR RRGITKNNFY SLYDNAMLGI TNLSKVPLRL VSFAGFLGAL
LSVCLGFAYL IYKLVFWKNF SVGIAPLVIG MFFLASIQLV SLGIIGEYIG QIHTQIQDRP
FVFEQERVNF EYPPGEPLIS ALTEIANEER KA