Gene Acid345_2085 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2085 
Symbol 
ID4069684 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2499246 
End bp2500286 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content62% 
IMG OID637984100 
Productglycosyl transferase, group 1 
Protein accessionYP_591160 
Protein GI94969112 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.057085 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCATTC TGTACGTCGC TTATCCGCTT CATCACGTGT CCGACGCCAG CGCCGGCGGC 
GCCGAGCAAA TGCTCTGGAC ACTCGAACGA GAAATGCACC TCCGCGGACA TGAAACGACA
GTCGCTGCTT GTGCCGGTTC GCGCGTCAAC GGGCGGCTTT TCTCGACCGG TGATATCCCA
ACTCAGTCCG ACACTTTTGA AGAGCGTAAT CGCGAGCACC ACGCCGCCAT TCGCAGCCTC
CTTGCATCGG AATCTTTCGA TCTTATCCAC GACAAGAGCG GCTCCTTCTT CGCCAGCGCG
GCGGATGTCG CCGTCCCGAT CCTCGCCACC GCACATCTTC CCCGCAGCTT TTACCCGGGC
GTAAACTGGC ACGTGCTCGG CCACAACATC AACGTCAATT GCGTCTCCGC GACTCAGGCC
CACACCTTCG CCGACGTGCC GAACCTCGTG GGATGGGTGC AGAACGGCAT CGCCATCGAC
CGATTCAAGT TCCGCGAACA GAAGGACGAC TATCTCCTCT GGCTCGGGCG CATCTGCGAA
GAGAAGGCCC CGCACCTCGC CATCGAAGCC GCCAAACGCA GCGGCAACCG ACTCATCCTT
GCCGGCCAGG TTTATCCCTT CACCTATCAC GAAGCGTATT TTGCGCGCGA AATTCAGCCG
CGCCTCGACG ATCAAATCAC ATTCATCGAC AGCCCCACTT TCGACGAAAA GCTCGACCTC
CTCTCCCGCG CCTCCGCTCT CCTCATCCCG AGCCAGGTCG ACGAAACCAG CTCCCTCGTG
GCGATGGAAG CTATGGCCTG TGGTACGCCG GTCATCACCT GGCGCCGCGG AGCCCTTCCG
GAGATCGTTG CCGACGGCGT CACCGGCTAC ATCGTCGATT CCCTCGAAGC CATGGTCAGC
GCTATTTCCG ACGTCAGCCG CATCCGCCCC GAGGCCTGCC GTGCCCGCGT GGAACAGCAC
TTCTCGGCCA GCCGCATGGC CGCAGACTAC GCCGCGGTTT ACCAGCGGGT TCTCGGACGA
AGCATTGGAG AAGCAGCCTG A
 
Protein sequence
MRILYVAYPL HHVSDASAGG AEQMLWTLER EMHLRGHETT VAACAGSRVN GRLFSTGDIP 
TQSDTFEERN REHHAAIRSL LASESFDLIH DKSGSFFASA ADVAVPILAT AHLPRSFYPG
VNWHVLGHNI NVNCVSATQA HTFADVPNLV GWVQNGIAID RFKFREQKDD YLLWLGRICE
EKAPHLAIEA AKRSGNRLIL AGQVYPFTYH EAYFAREIQP RLDDQITFID SPTFDEKLDL
LSRASALLIP SQVDETSSLV AMEAMACGTP VITWRRGALP EIVADGVTGY IVDSLEAMVS
AISDVSRIRP EACRARVEQH FSASRMAADY AAVYQRVLGR SIGEAA