Gene Acid345_0753 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0753 
Symbol 
ID4068629 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp929268 
End bp930848 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content58% 
IMG OID637982759 
ProductAlpha-L-arabinofuranosidase 
Protein accessionYP_589832 
Protein GI94967784 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3534] Alpha-L-arabinofuranosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.406878 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTTCCTGC GCACAATTCT CGTTGTAGTG ACTCTTTTCG GTCTTCTTTC TAACGCAACG 
ATTGCGGCTG CCCAAAAGGT GCAGGTGTCG ATTGATCCAT CCCGGCCTGG GGTGAAGATC
GATCGTAATC TGTTCGGGCA ATTTGCTGAG CATCTAGGAC ATGGCATTTA CGAAGGAATC
TGGGTCGGCA GCGACTCCAC GATTCCGAAC ACCCGGGGCA TTCGCAACGA TGTGGTTGCC
GCACTGAAAG CGATCCATGT GCCGAATGTG CGCTGGCCCG GCGGGTGCTT CGCCGACGAG
TATCACTGGC GCGACGGGAT CGGGCCGCAA AGAGTGGTGA GGCTGAACCC GAACTGGGGC
GGCGTGATCG AACCCAACAC CTTCGGCACC CACGAGTTCA TGGACTTCAT TGGGCAGATC
GGGAGCGAGG CGTACGTGTC GGTCAACGTG GGCTCTGGTA CTCCGCAGGA AGCGTCAGAT
TGGCTGGAGT ACATGACGGC AGCTCAGGCA ACGACGCTCC AGAAGGAGCG CGCTGCGAAC
GGGCATCCGG CACCGTACAA GATCGCATTG CTGGGCCTCG GCAATGAAAG CTGGGATTGC
GGCGGCAACA TGACGCCCGA TTACTACCTG GACCGGATGA AGGTCTTCAG CCGATTCGTT
CGCAACTACA ATCCGGCGCA GACGGACAAG AACCAGATGT TGAAGATCGC AGTCGGTCCG
GGCGGAGGCG AAGAGCGCTG GACGGAGTGG ACCGATACGG TGATGAAGGC TTACCAGAAG
CACACGTGGA GCTGGGACAT CAACGGCCTC TCGATGCACA GTTACACGAC GGTGAAATGG
CCGCCGGCGT ACAAGTCCGT GGGGTTCGGA GAGGACGAGT ACGCGCAGAT TCTGAAATCG
ACGCTGGAGA TGGAAGACCT GGTCAAGAAG CATTCCGCGA TCATGGACAA GTACGATCCG
GAAAAGAAGG TCGCCCTCAT CGTGGACGAA TGGGGCAGTT GGTATGCGCC CTTGCCGGGG
AGCAATCCGG GCTTTCTCGT ACAGCAAAAC AGCATTCGCG ATGCGATCCT GGCCGCGCTG
AACATCAACA TCTTTGCTCG CCACAGTGAT CGGGTGCGCG GCGCGAACAT TGCCCAGATG
ATCAACGTGC TGCAGGCGAT GATCATCACC GATAAAGAGA AGATGGTGCT GACACCGACC
TACTATGTTT ACAAGATGTA CCTGCCCTTC CAGGATGCGA CTTTCGTTCC GGTGACATTT
GACGCGGGCA CCTACAAGCA CGGCGACAGC ACGCTGCCGC GCATCGATGC GCTCGCTGCG
AGAGGAAAAG ACGGCAAACT GTGGCTGGAG ATCACGAATG TGGACCCGAA CCAGACGGCG
GATGTGGAGT TGAATGTGAC TGGGTTTGCT ACGAAGTCTG CGTCGGGAGA AACGCTCGCC
GGACCGAAGG TCGACAGCGT GAATACGTTC GAGGCACCGA ACACGGTTGT GCCGAAACCC
ACATCGGCCC GCGTAGAGGG TGGAAAGGTG ATGCTTAAGT TGGAGCCCAA GTCCGTCACG
GTGGTGTCAC TGGAGCAATA G
 
Protein sequence
MFLRTILVVV TLFGLLSNAT IAAAQKVQVS IDPSRPGVKI DRNLFGQFAE HLGHGIYEGI 
WVGSDSTIPN TRGIRNDVVA ALKAIHVPNV RWPGGCFADE YHWRDGIGPQ RVVRLNPNWG
GVIEPNTFGT HEFMDFIGQI GSEAYVSVNV GSGTPQEASD WLEYMTAAQA TTLQKERAAN
GHPAPYKIAL LGLGNESWDC GGNMTPDYYL DRMKVFSRFV RNYNPAQTDK NQMLKIAVGP
GGGEERWTEW TDTVMKAYQK HTWSWDINGL SMHSYTTVKW PPAYKSVGFG EDEYAQILKS
TLEMEDLVKK HSAIMDKYDP EKKVALIVDE WGSWYAPLPG SNPGFLVQQN SIRDAILAAL
NINIFARHSD RVRGANIAQM INVLQAMIIT DKEKMVLTPT YYVYKMYLPF QDATFVPVTF
DAGTYKHGDS TLPRIDALAA RGKDGKLWLE ITNVDPNQTA DVELNVTGFA TKSASGETLA
GPKVDSVNTF EAPNTVVPKP TSARVEGGKV MLKLEPKSVT VVSLEQ