Gene Acid345_4718 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4718 
Symbol 
ID4070656 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp5581086 
End bp5582417 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content61% 
IMG OID637986762 
Productthree-deoxy-D-manno-octulosonic-acid transferase-like 
Protein accessionYP_593791 
Protein GI94971743 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1519] 3-deoxy-D-manno-octulosonic-acid transferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.647141 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACACTGC GGCTTAGGGT TAGTGGTAAG TACAATGGCT GCGTGTACTT CGTCTACAGC 
CTGCTGCTGG GTCTGCTGAT GCTGTTGGCC ACTCCGTGGT GGCTGCTGCA AATGGCCCGG
CATGCGAAGT ACCGCGCCGG ACTGGCGCAA CGATTCGGCG CCGTACCTGC TCAACTCAAG
AACATTCACC AGCGCGTGAT CTGGGTGCAT GCGGTGAGCG TGGGAGAAGT GCTCGCCGTG
AGCACGCTGG TGCGCCAACT GCGCGAACGC CATCTGAACC ATCGGGTACT CATCTCCACG
ACAACAGCGA CCGGCAATCA ATTGGCCCGG GATCGCTTCG GCTTCGACAA CGTCTTCTAC
TTCCCTCTGG ATTTTGGATT CGCGATTCGT CCTTACCTGA AGGCGCTGCG TCCGGAGATG
GTGGTCGTGG CAGAGACGGA GTTCTGGCCG AACTTCCTGC GGTTGTCGGG AAATGCCGGC
GCAAAGATTG CCGTGGTAAA TGCGCGGATT TCCGATCGCT CGTTTCCGCG ATACCGGAGA
TGGAAGAGCG TCTTCGCACG CGTGCTGCGG CCGGTGGGCG TGTTCCTGGC ACAGAGCGAA
GAGGACGCGC GCAGGATCAT CGAGATCGGT GCCTCGAAGG ACTGCGTACA TGTGGGCGGC
AACCTGAAGT TCGAGGTGAA TGCCACGGCG AATGCAGAAA TTGTGCATCG TGTGCGTGAA
GGCTTCGCGG ACGGCGGATC ACAGCCGCTG ATCATCGCAG GCAGTACGGT GGAAGGCGAA
GAGCCCATGC TGCTGCATGC TTTCGCCGAG GTGCTGAAGG AATATCCGCG GATGGCGGTG
ATCTTGGCAC CACGGCATCG GGAGCGTTTT GCGGCGGTGG CTAAGCTGGT GGCGGACTCG
CCGTTTGAGC TAGTGCGGCG TTCAGATTGG GCCGATGAGC CGCTGCCGCC GGGAACGGTG
CTGCTGCTCG ACAGCATCGG CGAACTCGCT TCTCTGTATG CGCTGGCAGA TGTCGCATTC
GTTGGCGGAA GCCTAGTACG CAAGGGCGGA CACAACATTC TGGAGCCGGC GCAACACGGC
GTGCCGATCG TGATCGGGCC GCATTACGAG AATTTTCGCG ACATCATTGC GATTTTCCAA
CGCGCCGACG CGGTGCGAAT CGTGGAAGCG CCGCAACTTG GCAGTGAGTT TATTCGGTTG
CTCAAAGATC ATGCGGAGCG ATCGGGGCCG TCGTCGTTGG GCCAGCGAGC GGCACAGGTG
ATGCGGGCGC AGGCGGGCGC AACCGAGCGG ACGATGCAGG CACTGGAAAA TTTTCTGGCG
GGTGGGCGAT GA
 
Protein sequence
MTLRLRVSGK YNGCVYFVYS LLLGLLMLLA TPWWLLQMAR HAKYRAGLAQ RFGAVPAQLK 
NIHQRVIWVH AVSVGEVLAV STLVRQLRER HLNHRVLIST TTATGNQLAR DRFGFDNVFY
FPLDFGFAIR PYLKALRPEM VVVAETEFWP NFLRLSGNAG AKIAVVNARI SDRSFPRYRR
WKSVFARVLR PVGVFLAQSE EDARRIIEIG ASKDCVHVGG NLKFEVNATA NAEIVHRVRE
GFADGGSQPL IIAGSTVEGE EPMLLHAFAE VLKEYPRMAV ILAPRHRERF AAVAKLVADS
PFELVRRSDW ADEPLPPGTV LLLDSIGELA SLYALADVAF VGGSLVRKGG HNILEPAQHG
VPIVIGPHYE NFRDIIAIFQ RADAVRIVEA PQLGSEFIRL LKDHAERSGP SSLGQRAAQV
MRAQAGATER TMQALENFLA GGR