Gene Acid345_0891 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0891 
Symbol 
ID4069141 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1110938 
End bp1112119 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content54% 
IMG OID637982898 
ProductO-antigen polymerase 
Protein accessionYP_589968 
Protein GI94967920 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.335212 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCTGCGG CATCGCTCCA AATTCGCGCG GCAATTACAA ACGCAAGTCC TGTCGCGTTT 
TTGTTCGGCT GGTTCTTGTC AGCTCGCATA GCGTTGACAC TGCTTGCATT CCAGGCGAAT
CCCGCGTCCG GTAGCGCGGC TGAAATTGGC GTGCTGTTCC TTTTCGTTTT TCTAGCTTGG
ACCTTTACAA GCAGTCAACA GCAATCCCTA GATGGCGCAA CACCGCTTCG ATGGATCTGC
GCGTATCTTG CAATGACTGG GGTCAGCCTC TTCTGGTCGG TCACAGACTC CGTGGTGGTG
GGGTTAGCGT ATTGGGCAGG CCTCGCAGCG GAGTGTTTCG TAATTTACCT GATCATGAAC
TCAGGGGATG CGAACGAAAA CTGTGAGCGA ATCCTCTTCG GGTTCGTCGG AGGCGCAGCA
TTCGTCGGGC TTATTGCCTG GTACCTCCCT ACTCTCTCGG ACCTCCGAAT CGGAGATGAA
GATTTTCTAC ATCCCAATGC CTTGGGGTAC GTCCTTGCTC TCGCAACACT GTGCGGGATG
CATCTTGCCC GCAAGTCGCG GCTAGCGGGA CTGCTTGCGG TGTTTTGCGG AATTACGCTC
TGGAGGACAA TCAGCAAAGC GTGCATCGCA GGCTTCATTG CCTCTGCGGC GTTTTACCTC
TTGAGAGCGT CGCACTTGAG CCGCCGAGCC AAGATCGCCA TCTATGCGAT AGCGGCGAGC
AGCATCGTCT TTGGATGGAG TTTGGTTGAA GCCTATGTCG ATATGTATGA CCAAGGCAGC
CACATCGAGA CACTTACCGG ACGGACCACC ATCTGGAGCA TCGCCTGGGA AGAGGGAATT
AAAACACCGT GGCTGGGCCA TGGTTTCTAT TCTTTTCGCT TCGTCGTTCC AATGCTCGGC
GACTTTTTCC CTTGGCAGGC ACACAACGAG CTTCTTCAAC AATTGTTTTG TTACGGAGTC
GTGGGCTTGG CAGTGTTCGC CGTTTTATAC GTGTCCTTCG CCCGCTTTCT GTACGTCCAT
CGAGGCCACG AATGGTTCTC GCTCGTGGTG GCGATATTCG TATTCGTGTT GGTTCGAGGC
ATCGCCGATA CTGAGCGCTT TGATCTCAAC TTTCCCCTGT GGCTGTTGAC TCTGTTCACG
ATGGTTATTG CCCGGACGCA ACAGGAACGA GTGACAGCAT GA
 
Protein sequence
MSAASLQIRA AITNASPVAF LFGWFLSARI ALTLLAFQAN PASGSAAEIG VLFLFVFLAW 
TFTSSQQQSL DGATPLRWIC AYLAMTGVSL FWSVTDSVVV GLAYWAGLAA ECFVIYLIMN
SGDANENCER ILFGFVGGAA FVGLIAWYLP TLSDLRIGDE DFLHPNALGY VLALATLCGM
HLARKSRLAG LLAVFCGITL WRTISKACIA GFIASAAFYL LRASHLSRRA KIAIYAIAAS
SIVFGWSLVE AYVDMYDQGS HIETLTGRTT IWSIAWEEGI KTPWLGHGFY SFRFVVPMLG
DFFPWQAHNE LLQQLFCYGV VGLAVFAVLY VSFARFLYVH RGHEWFSLVV AIFVFVLVRG
IADTERFDLN FPLWLLTLFT MVIARTQQER VTA