Gene Acid345_0903 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0903 
Symbol 
ID4069114 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1131222 
End bp1132379 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content56% 
IMG OID637982910 
Productxylose isomerase 
Protein accessionYP_589980 
Protein GI94967932 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2115] Xylose isomerase 
TIGRFAM ID[TIGR02631] xylose isomerase, Arthrobacter type 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.122683 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGATG CTTACCAGCC GCGACCGGAA CACAAGTTCT CGTTCGGCCT TTGGACGATC 
GCGAACCGCG GCCGCGATCC TTTCGGCGAT GCAGTGCGCC CCACTATCCC TCCGAACGAC
ATTGTCGCTT TGCTCCGCGA GGTAGGCGCA TGGGGGGTCA ATCTTCACGA TAACGATCTT
GTTCCAATTG ACGCGACGCC ATCCGAGCGC GACAAGATCG TCCGCGATTT TCAGGCGGCC
TGCAGGCAGC ACGGCATCGT CGTGCCGATG GCGACGGTCA ACTTGTTCTT CGATCCAATC
TTCAAAGATG GAGCTTTCAC CGCAAACGAC GCCGATGTGC GTGCGTACGC TCTTCAGAAG
ACGATGCGCG CTATGGATCT CGGCGCCGAG TTAGGCGCTA AGCTCTTTGT GCTCTGGGGT
GGGCGCGAAG GAACTGAGAC TGATGCGTGC CGCCGTCCCG AAGAACCCTT CAAGCGGTTG
CGCGAAGCCA TCGATTATTT GTGCGAATAC AATCTCGACA AAAAGTATGG TTTCAAATTT
GCGTTGGAGG CCAAGCCAAA CGAACCTCGC GGCGACATAT ACATGCCGAC GACTGGTGCC
TATCTCGGTT TCATCCCAAC CCTTGCGCAT CCGGAGATGG TTGGTGTAAA TCCTGAGGTC
GCGCACGAGC ACATGGCGGG ATTGAACGCG CTTCACGCGG TTGCGCAAGC ATGGGAAGCG
GGCAAACTCT TCCACATCGA TCTAAACGAT CAGAACCCTG GGCGCTATGA CCAGGATTTT
CGTTTTGCAT CTGCAACCCC AAAATCAATG TTCTGGTTGG TGAAGTTCCT TGAAGACTCG
GGGTATCAAG GGCCGCGCCA CTTTGACGCG CACGCTTACA GGACAGAAGA CATCGCCGGC
GTAAAGGATT TTGCGCGCGG ATGCATGCGA AGCTACCTGA TCCTGAAGGA AAAGGCGCAG
CGCTGGAATG CCGACAAGGA GATCCAGCAA ATCTTCTCCG AGATCAACCC GCAAACCACC
GGCAGCTCGA AATATTCACA CGATGGCGCT CTGTCTCTTC TCAACCGCAC CTATGATCGC
GCAGCCATTG CGAAGCGCGG CCTGCAATAC GAGCGCCTCG ATCAGCTGAC TATGGAACTG
TTGTGGGGAA TACGGTAA
 
Protein sequence
MSDAYQPRPE HKFSFGLWTI ANRGRDPFGD AVRPTIPPND IVALLREVGA WGVNLHDNDL 
VPIDATPSER DKIVRDFQAA CRQHGIVVPM ATVNLFFDPI FKDGAFTAND ADVRAYALQK
TMRAMDLGAE LGAKLFVLWG GREGTETDAC RRPEEPFKRL REAIDYLCEY NLDKKYGFKF
ALEAKPNEPR GDIYMPTTGA YLGFIPTLAH PEMVGVNPEV AHEHMAGLNA LHAVAQAWEA
GKLFHIDLND QNPGRYDQDF RFASATPKSM FWLVKFLEDS GYQGPRHFDA HAYRTEDIAG
VKDFARGCMR SYLILKEKAQ RWNADKEIQQ IFSEINPQTT GSSKYSHDGA LSLLNRTYDR
AAIAKRGLQY ERLDQLTMEL LWGIR