Gene Acid345_0907 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0907 
Symbol 
ID4069118 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1136677 
End bp1137735 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content56% 
IMG OID637982914 
ProductABC sugar transporter, periplasmic binding protein 
Protein accessionYP_589984 
Protein GI94967936 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4213] ABC-type xylose transport system, periplasmic component 
TIGRFAM ID[TIGR02634] D-xylose ABC transporter, substrate-binding protein 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.40288 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0322478 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCAACT TCATCCTCTT ACTCTGTATA GTGGCTCTGC TCGAATTCTC AACATCCTGC 
CACCGTGGGC ATGAAGAGGC GCGGCAATCT CGCGGCAAGG GCCCGGTGAA GATTGGATTG
TCGCTCGACT CGCTGCAATT GGAACGCTGG CAACATGATC GGGATGCCTT CGTGGCGAAA
GCCAGCCAAC TCGGAGCCGA AGTATTCATA CAATCCGCCA ACGGAGTGGA TGCTGTCCAG
ATCCGACAGT GCGAAAACTT GCTGACGATG GGCGTCGATG TTCTTGTCAT AGTGCCGCAT
AACGGCGAAG TCATGGCATC GGCGGTGCGC AGTGCTGAGG CGCAGGGCGT GCCCGTGATC
TCGTATGACC GACTGATTCG CGATTCGAAT GTGAGTCTGT ATGTCTCATT CGACAACAAG
CTGATTGGTG AGTTACAGGC GAAATACCTC TACGCCCGCG CACCAGCGGG CAATTACATT
CTTATCGGCG GTTCACCAAC GGATAACAAC GCTCACTTAA TTCGCGAAGG CCAAATGCAG
GTTCTTAGTC CCGCCATCAA GCGCGGCGAC ATTCGCATCA TCGCGGACCA ATGGGCCAAA
GACTGGCTGC CTAGTGAAGC TCTGCGTCAC ACCGAAAACG CTCTCACGCA GGCAAATAAT
CATGTCGCCG CGGTGGTGAC TTCAAATGAC AGCACCGCCG GTGGCGCCAT TCAGGCGCTT
GGAGAACAGG GCTTGGCGGG GAGAGTGCTT GTATCGGGTC AAGACACCGA CCTCGCGGCC
GCGCAGCGCG TCGTCGAGGG CACGCAGTCG ATGACCGTGT ACAAGCCGAT CAAGCCCCTC
GCAGAAAATG CAGCAGCGGC GGCGGTAGCG CTTGCGCGTG GCGAGAAGGT CCAGTCAAAT
TCGAACGTTA ACAACGGCGC GAAAGAAGTT CCTTCAATTC TTCTCGCGCC GATTGTTGTT
GATCGCACGA ACATCGATTC TACTGTCATT AAAGACGGCT TCTTGAAGCG CGAAGACATA
TACAAAAATG TCTCGCGCAC GCAGTGGCCG AAGGACTAG
 
Protein sequence
MRNFILLLCI VALLEFSTSC HRGHEEARQS RGKGPVKIGL SLDSLQLERW QHDRDAFVAK 
ASQLGAEVFI QSANGVDAVQ IRQCENLLTM GVDVLVIVPH NGEVMASAVR SAEAQGVPVI
SYDRLIRDSN VSLYVSFDNK LIGELQAKYL YARAPAGNYI LIGGSPTDNN AHLIREGQMQ
VLSPAIKRGD IRIIADQWAK DWLPSEALRH TENALTQANN HVAAVVTSND STAGGAIQAL
GEQGLAGRVL VSGQDTDLAA AQRVVEGTQS MTVYKPIKPL AENAAAAAVA LARGEKVQSN
SNVNNGAKEV PSILLAPIVV DRTNIDSTVI KDGFLKREDI YKNVSRTQWP KD