Gene Acid345_1149 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1149 
Symbol 
ID4069958 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1432841 
End bp1434064 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content62% 
IMG OID637983159 
Productmajor facilitator transporter 
Protein accessionYP_590226 
Protein GI94968178 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.299661 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAATTCGT CGGCCCCGAA CCCGGACAGA AGTACCAGCA TGAGCCTGAT CGCGGTCATG 
CTCGCGGGCG CGTGCACATT TCTCAACGTC TATTGCACCC AGCCTCTGCT GCCATTTCTT
CGACGCTTAT TTCACGCATC GGAACTCCAG GTCAGTCTTA CGGTGAGCGC GACCACGTTC
GGTGTCGCCA TCGCGGCGCC CATCGTCGGC TTAATGGCAG AGCGCATTGG ACGCAAGAAG
GTCATTGTTC CCGCGCTGTT CTGGCTTACC GTTCCAACTC TGCTCGCTGC AACCTCCACT
GGCATCTGGT CGATGGTGCT GTGGCGCTTC CTGCAAGGCA TCTGCGTCCC GGGAATTATT
GCGGTGATGA TCGCCTACAT CGGCGAAGAA TTCAGCGGCA TCAATGTAGG CAGCGTGATG
GCTTCCTATG TGACCGGCAC CGTGTTCGGC GGTTTTCTTG GCCGTTTCAT CGCCGGACTG
GTCGCCACCC ACTGGCATTG GCGCGCCGCT TTTGTCGTCA TCGGCGTCAT CAACCTGTGC
GGAGCGATCG CCGTTCGACA ATGGCTGCCC AAAGCCAGGA ACTTCAAAAA AGCCGAAGAC
ATTAACGCAA CGCTCAATGA CATGCGGATG CACCTGCGCA ATCCGCGGCT GCTGGCCACG
GTCGCCATGG GATTCGGCGT ACTCTTCTCG CTGGTCGGCG CATTCACTTA CGTGAACTTC
TACTTGGCAG CGCCGCCATT CCACCTCAGC AGTGCCGCTC TGGGCACGAT CTTCTGCGTC
TACCTGCTCG GTCTCATCAT TACGCCGCTC TCCGGACGCT TCCTGGACCG CAGCGGCTTC
CGCAATACCG CGATTGTGGC CACGGCGTTC GCGCTCACAG GACTTGCCTG CACGCTCTCG
CAGCACCTCA GCATAGTGAT CGTGGGCCTG GCGCTGTTCT CCTCCGGCAT CTTCATCTAC
CAGGCCGCCG CCACGGTACA GACCGGCATC AACGCCGGCC GCGCCCGCTC TTCTGCCGCC
GGACTCTACG TCACTCTCTA CTACATTGGT GGCAGCGTGG GCGCGACAGC GCTCGGTTGG
GTGTGGCTCT GGCGCGGCTG GCACGCCTGC GTTGCCGCGA TCGCCGTGGC CTCGTTGCTC
ACGCTCGTCT GTGCGTTCCT CAGCAGCTCG CCGACGGAGC GCATCCCTGC GCGCGTCGTC
ACCGAGTCTG CCGAGGTCAG CTAA
 
Protein sequence
MNSSAPNPDR STSMSLIAVM LAGACTFLNV YCTQPLLPFL RRLFHASELQ VSLTVSATTF 
GVAIAAPIVG LMAERIGRKK VIVPALFWLT VPTLLAATST GIWSMVLWRF LQGICVPGII
AVMIAYIGEE FSGINVGSVM ASYVTGTVFG GFLGRFIAGL VATHWHWRAA FVVIGVINLC
GAIAVRQWLP KARNFKKAED INATLNDMRM HLRNPRLLAT VAMGFGVLFS LVGAFTYVNF
YLAAPPFHLS SAALGTIFCV YLLGLIITPL SGRFLDRSGF RNTAIVATAF ALTGLACTLS
QHLSIVIVGL ALFSSGIFIY QAAATVQTGI NAGRARSSAA GLYVTLYYIG GSVGATALGW
VWLWRGWHAC VAAIAVASLL TLVCAFLSSS PTERIPARVV TESAEVS