Gene Acid345_1053 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1053 
Symbol 
ID4068702 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1323081 
End bp1324373 
Gene Length1293 bp 
Protein Length430 aa 
Translation table11 
GC content60% 
IMG OID637983061 
Productsugar transporter 
Protein accessionYP_590130 
Protein GI94968082 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00879] MFS transporter, sugar porter (SP) family 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.55252 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTGCCG CTCTCGGCGG CCTTCTCTTC GGTTTCGACA CCGCCGTCAT CTCCGGCGCT 
ACCCACGCTC TTACCGAAGC CTACTCGCTC TCGCCCAGTC TCCTCGGGGT TACAGTCTCG
AGCGCTCTCT GGGGGACGGT TGCAGGCTCC CTTCTCGCCG GTATTCCCGC CGATCGCATC
GGCCGCCGCG ACAGCTTGCG CTTTATGGCG GTGCTGTATC TCGTCTCGTC GCTCGGCTGC
GCCTTTGCCT GGAACTGGCC GGCGCTTCTC GTCTTCCGCG TTATCGGCGG CCTCGGCATC
GGAGGCTCCA GCGTCATCGG TCCTATGTAC ATTGCGGAGA TCGCTCCCGC CAAGTGGCGC
GGACGCCTCG TCGGCTTCTT CCAATTCAAT GTCGTCTTCG GAATCCTTCT CGCGTACTTC
TCCAACTATC TCGTCGGACT CGGATGCTTC GGCGCAAATG AATGGCGCTG GAAGCTCGGC
ATTCCCGCCG TACCCGCGGC GCTTTTCCTG ATCATGCTCT TCGGAATCCC GCGCAGCCCG
CGTTGGCTGG TGAAGAAACA GCGCGTCCCT GAAGCCCGCG AGTCGCTGCA ACTGATAGGC
GAAGAAGATT ACGAGAAGGA ACTCCACGAC ATAATCGAGT CCATCGACGC CGATCACGCG
CAAGGCGACA GCCTTTTTGA TCGCAAATAC TTGTTCCCGA TCTTCCTCGC GGTTTCCATC
GGCATGTTCA ACCAGCTCGC CGGTATCAAC GCCATCCTCT ATTACCTCAA CGACATCTTC
GCCCAGGCCG GATTCAACAA GGTCTCCAGC GATTTGCAGG CCGTCGCCAT CGGCGGCACC
AATCTCCTCT TTACCATGCT CGCCATGAGC ATCATTGACT ACGTCGGCCG CAAGACGCTC
CTGCTCATCG GCGCGGTCGG TACTGCGCTC TGTCTCGGTG GTGTCGCGTG GATCTTCCAC
GTCCACCAGC ACCAGGGCTA TCTGCTTTGG CTGCTTGTGG TCTACATCGC GTTTTTCGCT
TTCTCGCAGG GCGCAGTCAT CTGGGTCTAC ATCAGCGAGG TCTTTCCCAA CCGAGTGCGC
GCTGGCGGAC AAAGCCTCGG CAGCTCCGCG CACTGGATCA TGAACGCCAT CATTGCCGGC
GTCTTCCCGG CTCTCGCGGC AAAGTCCGGT TCCATTCCGT TCGCATTTTT CGCCGCGATG
ACGGCCATCC AGTTCTTCGT GGTGCTGTTC GTCTATCCGG AAACCAAGGG CATCACCCTC
GAAGCCATGC AGAAGAAACT CGGAATCTCA TGA
 
Protein sequence
MVAALGGLLF GFDTAVISGA THALTEAYSL SPSLLGVTVS SALWGTVAGS LLAGIPADRI 
GRRDSLRFMA VLYLVSSLGC AFAWNWPALL VFRVIGGLGI GGSSVIGPMY IAEIAPAKWR
GRLVGFFQFN VVFGILLAYF SNYLVGLGCF GANEWRWKLG IPAVPAALFL IMLFGIPRSP
RWLVKKQRVP EARESLQLIG EEDYEKELHD IIESIDADHA QGDSLFDRKY LFPIFLAVSI
GMFNQLAGIN AILYYLNDIF AQAGFNKVSS DLQAVAIGGT NLLFTMLAMS IIDYVGRKTL
LLIGAVGTAL CLGGVAWIFH VHQHQGYLLW LLVVYIAFFA FSQGAVIWVY ISEVFPNRVR
AGGQSLGSSA HWIMNAIIAG VFPALAAKSG SIPFAFFAAM TAIQFFVVLF VYPETKGITL
EAMQKKLGIS