Gene Acid345_2286 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2286 
Symbol 
ID4073280 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2708690 
End bp2709667 
Gene Length978 bp 
Protein Length325 aa 
Translation table11 
GC content59% 
IMG OID637984302 
Productglycosyl transferase family protein 
Protein accessionYP_591361 
Protein GI94969313 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0463] Glycosyltransferases involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.398997 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCCAAGT ATTCGATCGT TGTACCTTTC CACAATGAAG AAGAGAACGT AACCGAGCTC 
TACGACCGCC TCAAGGTCGT GATGGAGACT GTCGGCGACA CGTTCGAACT GGTGTTCGTC
GACGACGGCA GTCGCGACTG CACCTTCAAA CTTCTTCAGC AGATCGCCGC GGTCGACAGC
CGCGTGGTGG TCGTTAAACT GCGTCGCAAC TTCGGCCAGA CATCGGCGCT TGCTGCCGGC
TTCCACAACG CGCAGGGCGA TTACGTAATT GCCATGGACG GCGACCTCCA GCACGATCCC
AACGACATCC CATTGTTCGT GGAAAAGGTG AACGAGGGCT TCGATATCGT CAGCGGCTGG
CGCAAAGTGC GCATTGATAA TTTCGTGCTG CGCCGCTTCC CGTCGCGGTG CGCCAACTGG
CTCATGGCCA AGCTCAGCGG CGTCAACATC CACGACTTCG GTACCACGTT CAAAGCGTAT
CGCCGCGACC TGCTGCACCT GGTGCCGCTC TACGGCGAGA TGCACCGCTT CATCCCGGCA
CTGGCGTCGT GGCACGGCGC AACCATCTGC GAAATTCCGA TCAAGAATGT GAACCGTGAG
CGCGGCGTCT CGCACTACGG CATCTCGCGC ACCTTCCGCG TTTTCTTCGA TCTGCTGACG
ATTCGATTCC TGCTCAAGTA TCTGTCGCGC CCGCTGCACT TCTTCGGGAC GGTCGGGATG
ACTGGTGTTA CCGCCGGTCT CGGCATTGCG CTGTGGATGA TGATCGACAA GCTCATCCAT
CACAGCGATG TCATGGCAGC ACACGGCCCA CTCATGCTTT TTGCTGCGGT GCTGATCGTT
GCCGGCGTGC AGTTGGTCGC TCTCGGTTTG CTCGGCGAGT TACAGGTACG CCACTACTAC
GACCCGACAG AACGCACACC GTATTCGGTG GAGCGCGTGC TGCGGTCGCA GGAAGAACAG
TCGCACTTAA CGGAATAG
 
Protein sequence
MPKYSIVVPF HNEEENVTEL YDRLKVVMET VGDTFELVFV DDGSRDCTFK LLQQIAAVDS 
RVVVVKLRRN FGQTSALAAG FHNAQGDYVI AMDGDLQHDP NDIPLFVEKV NEGFDIVSGW
RKVRIDNFVL RRFPSRCANW LMAKLSGVNI HDFGTTFKAY RRDLLHLVPL YGEMHRFIPA
LASWHGATIC EIPIKNVNRE RGVSHYGISR TFRVFFDLLT IRFLLKYLSR PLHFFGTVGM
TGVTAGLGIA LWMMIDKLIH HSDVMAAHGP LMLFAAVLIV AGVQLVALGL LGELQVRHYY
DPTERTPYSV ERVLRSQEEQ SHLTE