Gene Acid345_1835 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1835 
Symbol 
ID4072896 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2215682 
End bp2216869 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content61% 
IMG OID637983844 
Productglycosyl transferase family protein 
Protein accessionYP_590910 
Protein GI94968862 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTTGGT GGCACGCGCA TTGGGAGGCG CTCGTCGCCT TCGCGATTGG CTTGGTATGG 
CTCTCGCGTC TGCTTGCCGC GTCGCGTGGC ATGCCGAAGC TCGCAGAGAT CTCTCGTCCA
GAGTGGGACC TGCGGCCTGA TCCCGCGCCG CGTGTGAGTA TCGTGGTTTG CGCGCTGAAC
GAGGAAGGGA AGATTGAGCC GGCGCTACGC TCGCTGCTGG AACTGGATTA TCCCGATTAC
GAAGTTGTGG CGGTGGACGA CCGCTCGACG GATCGCACCG GCGAAATCAT GGACCGCATC
GCCGAGGAGT ATCGCGCCAA CGCGCATCAT CACTTGCGCG TCGTTCACGT CACCGAGTTG
CCGCCGGGCT GGCTCGGCAA GGTTCACGCT ATGTGGAGTG CGACCCGCGT CGCGGATGGC
GACTGGATCC TGTTTACCGA TGCCGACGTG GTCTTTCAGA AGGAAACTCT ACGCCGCGCG
ATCGCTTACG CAGAACGCGA GCGCGCCGAT CATGTCGTCC TGTTTCCGAC GATGCTGATG
TACACTTGGG ACGAGCGCAT GATGATCGCA TTCTTCCAGG CGATGTTCGT GTTTGGGCAT
CGTCCATGGA AGACCGCCGA CCCGAAGTCG CGCGACCACA TGGGCGTCGG CGCGTTTAAC
CTCATCCGCC GCAGCGTCTA CGAGAAGATC GGTACCTACG CGCGGATGAA GATGGCCGTG
GTGGACGACA TGAAGCTCGG CGAGATCGTA AAGAAGGAAG GTTACGCGCA ACGCAACGTC
TTCGGCCGCG ACCTCATTCA GTTACATTGG CATTCCGGAG CGCTCGGAGT CGTGCGCGGG
CTAACCAAGA ACTTCTTCGC CATCCTGCGC TTCAATCCTT TCCTTACGCT TGGCGTGATC
CTCGGCATGC TGCTCTTCAA TCTCACGCCG TTTGTCGGGG TCTTCCTGAC GCACGGATGG
GCGCGCGCGG GGTACGCGCT GGCCTTGGCT TCGATTGCCG GCATTTACTA CGGAATGTCG
GACCGCTCCA CGATTCCGTG GTATTACGTG GTGCTGCATC CAGTCAGCAC GGTGCTCTTC
GCGTACACCG TGGGGAGATC GATGGTGGTC ACGCTGGCGC AGGATGGGAT TACATGGCGC
GGAACGCATT ACTCGCTCAA CGAGCTGCGG AAGGGCGTAG ACGTTTAG
 
Protein sequence
MIWWHAHWEA LVAFAIGLVW LSRLLAASRG MPKLAEISRP EWDLRPDPAP RVSIVVCALN 
EEGKIEPALR SLLELDYPDY EVVAVDDRST DRTGEIMDRI AEEYRANAHH HLRVVHVTEL
PPGWLGKVHA MWSATRVADG DWILFTDADV VFQKETLRRA IAYAERERAD HVVLFPTMLM
YTWDERMMIA FFQAMFVFGH RPWKTADPKS RDHMGVGAFN LIRRSVYEKI GTYARMKMAV
VDDMKLGEIV KKEGYAQRNV FGRDLIQLHW HSGALGVVRG LTKNFFAILR FNPFLTLGVI
LGMLLFNLTP FVGVFLTHGW ARAGYALALA SIAGIYYGMS DRSTIPWYYV VLHPVSTVLF
AYTVGRSMVV TLAQDGITWR GTHYSLNELR KGVDV