Gene Acid345_1265 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1265 
Symbol 
ID4073235 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1536522 
End bp1537652 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content58% 
IMG OID637983274 
Productglycosyl transferase, group 1 
Protein accessionYP_590341 
Protein GI94968293 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAGGTCG CGATCGACAT CCGCCGTATC AGCGACTTCG GTGTCGGCAC CTACATTCGC 
AATGTGGTGC GGACCCTGGG CCGGCTGGAC CGCGAAAACG AGTATCTCCT GCTCGGGACG
CCTGGTCGTA TTCACGATAT GGGGCAGCTG CAGGAGAACT TTTCGCACCT TGAGTGCCCT
GATAACGACT ACTCTCCGGC GTCTTATTTC GAATTCCATC GCGCGCTGAA GCGGCAAAAA
GTAAATGTGC TGCACGTCCC GCACCTGTTC TGGATCCCGC AAGGAATTCC GTGTCCGTAC
GTGGTGACGG TCCACGATCT TCTCGACTAC CTCTACCGCA GCAACAGCGC TTCGCCGGCG
AAGCGGTTTG CGCACTTTCA CTTCACGAAG CGTGTGCTGA ACAAGGCTTC GCGGATCTTC
GCGGTGTCGA AGTTCTCGAA GGAGGATACG GTGCGGCTTT TCGGCGTGCC GGAAGAGAAG
ATCGAGGTGG TCTATAACGC AATTGACGAC CGCTTCCGCC AGGGCCACAC CACCGACTCC
GACAAGTTGA TGATCGCCGA GCGTTACCAG GTGAACTATC CGTTCATCCT GTATGCGGGG
CGGATCAGTC CGCATAAAAA CGTGGTGCGC ATCATCGAAG CGTTTTCACT GTTGAAGTCG
GAGCTGGCGA AGGAAGACTC GTATCCCGAC CTGAAGCTGA TCATTATTGG CGATGAAGTC
TCGCGGCATC CTGATCTTCG GCGCGCGGTA ATCAAGGGTA GAGTCCAGCA GGACGTCCGC
TTCCTCGGGT TCGTGCCGAT CGAAGTGCTG CGAATCTTCT ACGACGCCGC CAAGGTGTTC
ATTTTCCCGT CGCTTTACGA GGGATTTGGG CTGCCGCCGC TGGAGGCGAT GTCGCACGGG
ACGCCGGTGA TCACAAGCAA TACCTCGTCA CTGCCGGAAG TAGTGGGGAA TGCTGCGGTG
CTGGTAAATC CGGAGAACGT CTTCGAGATC CAGCGCGCCC TGCAGCGCGT GCTGCTCGAC
CAGCCGTTGC GCGAGAAGCT CAAACTACGA GGCGAAGAAC AGATCAGGAA GTTCTCGTGG
GAGAACTCAG TGGGGCGAAT GCTGGAGATC TTCCGGCAGG TCGCCAAGTA G
 
Protein sequence
MKVAIDIRRI SDFGVGTYIR NVVRTLGRLD RENEYLLLGT PGRIHDMGQL QENFSHLECP 
DNDYSPASYF EFHRALKRQK VNVLHVPHLF WIPQGIPCPY VVTVHDLLDY LYRSNSASPA
KRFAHFHFTK RVLNKASRIF AVSKFSKEDT VRLFGVPEEK IEVVYNAIDD RFRQGHTTDS
DKLMIAERYQ VNYPFILYAG RISPHKNVVR IIEAFSLLKS ELAKEDSYPD LKLIIIGDEV
SRHPDLRRAV IKGRVQQDVR FLGFVPIEVL RIFYDAAKVF IFPSLYEGFG LPPLEAMSHG
TPVITSNTSS LPEVVGNAAV LVNPENVFEI QRALQRVLLD QPLREKLKLR GEEQIRKFSW
ENSVGRMLEI FRQVAK