Gene Acid345_3823 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3823 
Symbol 
ID4071107 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4519284 
End bp4520369 
Gene Length1086 bp 
Protein Length361 aa 
Translation table11 
GC content54% 
IMG OID637985846 
Productglycosyl transferase, group 1 
Protein accessionYP_592897 
Protein GI94970849 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.372091 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.833222 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTTTTCCG ACATCGAGCA TGAGCCGATT GGAAACGTGC GGTTTGTCAC GGGGCCCAAG 
TATGAACGCA AGACCGCAGC ACGACGATTC AAGACGTGGT TCAAGTATTG CTGGCAGGCG
ACAAGGCTCG CTTTCCGCAC CAAGGGTGAT CCGAAGCTGT TCATCGTGGC GCAACCGCCG
TTTCTCTCGT TGCTAGGCTA TTTGCAAAAG AAGTTGATGG GCCGCAGATA CTTTCTTTGG
ATTGACGATG TGTGGCCTGA CATCATTGTT GGACAAAAAA TGCGGGAAGG CTCTTCGTGG
GGCATTCGGC TCTGGGCTGG CTTTAACCGC GTGACTTTCA GGCATGCAGA GCACGTATTT
ACTCTTGGGC CATACATGAG AGACAAGGTC AGACAGTATG TGCCGGAGAA CATCCCGATA
ACTATCATTC CGACGTGGGT TGATATCGAT TCGATCCGGC CAATTCCGAA GGAGCAAAAT
CCGTTTGCCG CTGAACACGG ACTGGGCGAC AAATTGACAG TCCTCTATTC TGGGAACCTG
GGCTTGACCC ACGATATTCA GAGCATCCTT GAAGCAGCGC GCATTCTTCG TAATGAGGTG
TCTTTGCATT TCATGATCAT CGGCGCCGGG CCACAGTGGG ATTCAATCGA GCGATCGATC
AAGGAACATC AGGATGCGAA CGTGACGCTT CTGCCTCTGC AGCCGATTGA TGTTTTGCCG
TTCTCTCTGG CGACTGCTGA CATCGCGATT GCTTCGCTGG AACAGGGAAT TGAGGGAGTA
AGCATGCCGA GCAAGACCTA CTACAGCATG GCCGCAGGGT CGGCTATTGT CGGCATCTGC
GAGACGAACA GCGACTTGGC ACACGTGGTT CTTTCGAACC AATGCGGCGG AGTGGTTCGT
CCCAAGAGTC CGGAAGCTCT GGCTGAACTC ATTCTGCGAA TGGCCACAGA TCGGGAGCAG
CTAGGGCGAT TGCGAGAGAA CGCTCGCCAC GCGGCTGTGA ACTGTTATTC GCGGAGTGCA
AATACTCCGA AGTTGCGCGC GATTCTGGAA GGAAAAGTGG AACCGGTAGC ACAGGGCCAA
TCATGA
 
Protein sequence
MFSDIEHEPI GNVRFVTGPK YERKTAARRF KTWFKYCWQA TRLAFRTKGD PKLFIVAQPP 
FLSLLGYLQK KLMGRRYFLW IDDVWPDIIV GQKMREGSSW GIRLWAGFNR VTFRHAEHVF
TLGPYMRDKV RQYVPENIPI TIIPTWVDID SIRPIPKEQN PFAAEHGLGD KLTVLYSGNL
GLTHDIQSIL EAARILRNEV SLHFMIIGAG PQWDSIERSI KEHQDANVTL LPLQPIDVLP
FSLATADIAI ASLEQGIEGV SMPSKTYYSM AAGSAIVGIC ETNSDLAHVV LSNQCGGVVR
PKSPEALAEL ILRMATDREQ LGRLRENARH AAVNCYSRSA NTPKLRAILE GKVEPVAQGQ
S