Gene Acid345_0326 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0326 
Symbol 
ID4070088 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp353576 
End bp355078 
Gene Length1503 bp 
Protein Length500 aa 
Translation table11 
GC content60% 
IMG OID637982329 
ProductL-arabinose isomerase 
Protein accessionYP_589405 
Protein GI94967357 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2160] L-arabinose isomerase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.562727 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGATC TGAAGAAGTT CGAGGTCTGG TTTGTTACCG GTAGCCAGCA TCTCTACGGC 
CCGGAGACGC TGGAGAAGGT CGCGGAACAT TCGCGCGAGA TTGCGGGTGG ACTCGATGCC
ACGCCGCAGA TGCCTGTTCG CGTCGTTTTC AAGCCGGTGC TCACCACCGC AGACGCCGTC
CACGAACTCT GTCGCGAGGC CAACAACGCC GCTCACTGCA TCGGTCTCGT CACCTGGATG
CATACCTTCT CACCCGCCAA GATGTGGATT GCCGGGCTGA AGGCGCTGCA GAAACCATTC
CTACATCTCC ACACGCAATA CAACCGTGAG TTGCCGTGGG CCACCATCGA CATGGATTTC
ATGAACCTGA ACCAGGCCGC GCATGGCGAC CGTGAGTTCG GCTTCATCGG CAGCCGCATG
CGCCTCGACC GCAAGGTCGT GGTTGGCTTC TGGCAGGATC TCGAAGTTAT CTCCGAGCTT
GGCACCTGGG CGCGCGCTGC GGCGGGTTGG CACGATGCGC AACATTTGAA AGTCGCACGT
TTTGGCGACA ACATGCGAAA CGTTGCCGTG ACTGAAGGCG ACAAAGTGCA GGCGAAAATC
CAGCTCGCCT ACTCGGTAGA TGGCTTCGGT GTCGGCGATC TCGTGGCCCG CATTCACGCC
GCAAGCGACA GGGATGTAGA CCATCTAGTA TCAGAATACG AGGACACCTA CACCCTCTCC
GAGCCGCTGA CCGCGAAGGG CAAGCAACGC GCGTCTCTGC TCGACGCTGC ACGCATCGAG
CTTGGCCTGC GCCATTTCCT CAAAGACGGC AACTTCCACG CCTTCACCGA CACCTTCGAA
GACCTCCACG GCCTTAACCA ACTCCCGGGC ATCGCGGTGC AACGTCTGAT GGCGGACGGT
TACGGCTTCG GCGCTGAAGG CGATTGGAAG ACTGCCGCGC TGGTTCGCAC CATGAAAGTG
ATGGCCGCCG GACTCGATGC CGGTACGTCA TTCATGGAGG ACTACACCTA TCACCTTGAG
AATGGCGGGC TCGTACTCGG GGCTCACATG CTTGAGATTT GCCCCTCGAT CGCCAGCGGC
AAGCCTTCGT GCGAGATCCA TCCCCTCAGC ATCGGTGGCA AGGGCGATCC CGTGCGCCTT
GTCTTCGACT CGCAGACCGG TCCTGCCGTC GTGGCGACAA TCGTGGACGT CGGCGAGCGC
TTCCGGATGG TCATCAACAA AGTGAATGTC ATTCCGCCCG AGGTGCCTTT GCCCAAATTG
CCCGTAGCGC GCGCTGTCTG GATTCCTGAG CCGAACCTGG CCGTGGCCGC CGCATGCTGG
ATCTACGCCG GCGGCGCACA CCACACCGGC TTCAGCTTGT GCCTTACCGC CCAACATCTC
CAGGACTATG CCGAAATGGC GGGCATCGAG TGCGTGCTGA TCGACAACGA CACCACTGTT
CACGCTTGCA AGAACGAGTT GCGCTGGAAC GACGCTTATT ACCGCTTGAC GGGTTGGCGC
TGA
 
Protein sequence
MIDLKKFEVW FVTGSQHLYG PETLEKVAEH SREIAGGLDA TPQMPVRVVF KPVLTTADAV 
HELCREANNA AHCIGLVTWM HTFSPAKMWI AGLKALQKPF LHLHTQYNRE LPWATIDMDF
MNLNQAAHGD REFGFIGSRM RLDRKVVVGF WQDLEVISEL GTWARAAAGW HDAQHLKVAR
FGDNMRNVAV TEGDKVQAKI QLAYSVDGFG VGDLVARIHA ASDRDVDHLV SEYEDTYTLS
EPLTAKGKQR ASLLDAARIE LGLRHFLKDG NFHAFTDTFE DLHGLNQLPG IAVQRLMADG
YGFGAEGDWK TAALVRTMKV MAAGLDAGTS FMEDYTYHLE NGGLVLGAHM LEICPSIASG
KPSCEIHPLS IGGKGDPVRL VFDSQTGPAV VATIVDVGER FRMVINKVNV IPPEVPLPKL
PVARAVWIPE PNLAVAAACW IYAGGAHHTG FSLCLTAQHL QDYAEMAGIE CVLIDNDTTV
HACKNELRWN DAYYRLTGWR