Gene Acid345_0328 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0328 
Symbol 
ID4070090 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp355532 
End bp357187 
Gene Length1656 bp 
Protein Length551 aa 
Translation table11 
GC content60% 
IMG OID637982331 
ProductAlpha-L-arabinofuranosidase 
Protein accessionYP_589407 
Protein GI94967359 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3534] Alpha-L-arabinofuranosidase 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.48322 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.88573 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACTC GCCGTGATTT CCTTCGTTCC ACTGCCATTG GCGCTGCTGG ATTGGCGCTG 
ACCCGTTTCT CGCTGCCGTC GCTCGCGCAA AGCCGCAGCG TCGATTCCCG GATCGAGGTG
CTGCTCGACG AACCGCTCGG GACCATCTCG CCGAACATTT ACGGTCACTT CACCGAGAAC
CTGGCCGGGG TGTTTTACGA CGGAATCTGG GTTGGCGAGA ATTCGAAGGT CCCGAACGTC
GGCGGGCTGC GTAAGGCCGT GATCGACCAC ATGCGCAAGA TCAAGGCCCC CGTGATTAGA
TTCCCAGGCG GATGCTTTGC CGACACCTAC GACTGGCGCG ACGGCATCGG CCCACGCGAA
AAGCGGCCGC GGCGCCCGAA CTTCTGGGGC AATGGCGACC CGAAGCAAAA CGTAAAGCAC
AAGTACGACC CCAATGAGGT TGGTACCGAC GAGTTCATGC ATTTCTGCCG CGAGATTGGC
GCACAGGGAT ATCTTGCAGC CAACGTGCGC AGCCTTCCCG CGGAGCAATT CCAGCAGTGG
GTGGATTATT GCAATTCGCC TGCGGGGAGC ACGACGCTGG CGGAAACGCG CGCGACGAAC
GGCTCGCGTG AGCCTTACAA GGTGGAGTTC TGGGGCGTGG GGAACGAGTC TTGGGGATGC
GGCGGCGACT TCGAGCCCGG CCAATATGCG ATGGAATTTC GCCGCTACGC CACGTGGGTG
CCAGGCTTCG GAATGAACTT GAAATTCATT GCCTCCGGGC CAAACGTGGA GGACTACCAC
TGGACCAGCG GCTTCTTCGA GGCGATGCAG AAGCGGATGG GCTCCCTGCA CATGGTCTAT
GGCTGGGCGC TCCACCATTA CGCCTGGAAC CTGAGCCGCG GCAAAACCAA CGACTGGGAC
AAGGGCAAAG GCGAAGCTGT GAACTTCGAC GCCACCGATT GGTACGAGCT GATGAAGGAA
GGTCAGCGCA TGGAAGGTCT GATCGAGGGA CACTGGCAGG TGATGGGTGA AACCGATCAC
GAGCATCGCG TAAAGCTAGT AGTGGACGAG TGGGGCCCCT GGTACCGCCC GGGCAGCGAA
GCAACTCCCG GGGATGCGCT GGAGCAGATG CCAACCCTCC GCGACGCGGT GTTCAGTGGC
ATGACCCTCG ATATCTTCAA CCGCCATCCT GAAAAAGTGG CGATGGCGAA CTGCGCGCAG
CTCATCAACT GTCTGAACAG TTTGTACCTG GCGCATGAAG ACAATTTCAC GGTGACTCCA
GTTGGGCACG TGTTCGATAT GTATGCGCCT CACCAGGGCG GACAGTCGCT GCGCACGATC
TTCTCTTCAC CGCAGGTGAA GTACGAGCGC GACGGCACAC CGGCGACCTT CTGGGGGCTG
CGCGGTGCCG CGTCTTTAAC CGGGAAAAAA CTCTTCGTCA CGGCGGTCAA TCCGGATACC
AGCTCACCAC GAGAAAGCGA GATTGCGATC CGCGGGGCGA GCGCGGCTTC TGGCACGCTG
ACCGTGCTCT CGGCGCCCGA CATCCACGCG CACAACACCT TTGAACATCC GGATGCGGTA
GTACCCCGTC GGAATGAATT GAAGGTGAAT GGCGCGACGG TGTCGCTCGT GATTCCGCCA
GCCTCCGTGG TCGCAATCGA ATTGGAACTG AGTTGA
 
Protein sequence
MSTRRDFLRS TAIGAAGLAL TRFSLPSLAQ SRSVDSRIEV LLDEPLGTIS PNIYGHFTEN 
LAGVFYDGIW VGENSKVPNV GGLRKAVIDH MRKIKAPVIR FPGGCFADTY DWRDGIGPRE
KRPRRPNFWG NGDPKQNVKH KYDPNEVGTD EFMHFCREIG AQGYLAANVR SLPAEQFQQW
VDYCNSPAGS TTLAETRATN GSREPYKVEF WGVGNESWGC GGDFEPGQYA MEFRRYATWV
PGFGMNLKFI ASGPNVEDYH WTSGFFEAMQ KRMGSLHMVY GWALHHYAWN LSRGKTNDWD
KGKGEAVNFD ATDWYELMKE GQRMEGLIEG HWQVMGETDH EHRVKLVVDE WGPWYRPGSE
ATPGDALEQM PTLRDAVFSG MTLDIFNRHP EKVAMANCAQ LINCLNSLYL AHEDNFTVTP
VGHVFDMYAP HQGGQSLRTI FSSPQVKYER DGTPATFWGL RGAASLTGKK LFVTAVNPDT
SSPRESEIAI RGASAASGTL TVLSAPDIHA HNTFEHPDAV VPRRNELKVN GATVSLVIPP
ASVVAIELEL S