Gene Acid345_2380 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2380 
Symbol 
ID4071378 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2814134 
End bp2815558 
Gene Length1425 bp 
Protein Length474 aa 
Translation table11 
GC content59% 
IMG OID637984396 
Productmajor facilitator transporter 
Protein accessionYP_591455 
Protein GI94969407 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0389897 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.03889 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCTGGAGC CGCGAGGCAC CAGCGCGCAT CCAAACGGTA AGATGGGAGA GATGGCGAGC 
CATACCCCGG CGGGAGAAGG AATTCAAAAA TCGAAATTTC TGAATCGGCT GCCGGCGCTG
AAGGTCCGAA ACTTTCAGCT CTTTTTTGCG GGACAGCTGA TTTCGCTGAT CGGCACGTGG
ATGGACAACG TGGCCGAGGC GTGGTTGATC TACCGCCTGA CCGGTTCGTC GTTGAAGCTT
GGGACGGTCG GGTTTTGCAG CCAGATCCCG GTGTTCCTGT TCGCGCCGCT GGGCGGAATT
GTCGCCGACC GATATAACCG CCACAAAATC ATCATCGCCA CGCAGGCGAC TTCGATGGTG
CTGGCGGGCA TTCTTGCAAT CCTTACGCTT ACACATCGGG TGCAGGTCTG GCACGTGTTC
CTGCTGGCAG CGCTGATGGG TGTGGTAAAT GCATTCGACA TTCCCGCGCG TCAGGCCTTC
CTCTCCGACA TGGTCGGTCG CGAAAATCTG ATGAATGCCA TCGCGCTGAA TTCCTCGATG
TTTAACGGAG CACGCATTGT TGGGCCGGCA GTGGCCGGCA TTTTGGTGGC AAGCATTGGT
GAGGGCTGGT GTTTTGGCGC GAATTCGCTC AGCTATATCG CGGTCATCAC CGGGCTGCTC
ATGATGAAGC TGAATCTTCC AGTCCGCATT GCGAGCGGGA AGTCGCCGCT GCAGGACATC
GTCGAGGGGT TCCAGTTCGT CAAGGAAGCC GCGCCCATCC GGACGCTGCT CCTACTGCTT
GGATTGGTGA GTTTAGTCGG CATGCCTTAC TCCGTGCTGA TGCCGATTTT CGCGGACCAC
ATTCTGCATG GCGGCGCGAG AGGGCTGGGC ATCCTGATGG GCGCAACCGG CGTTGGTGCG
CTCGGCGGAG CGTTAACGCT CGCATTGAAG AATGGTCTTA AGGGGATTAG CCGGATTATC
AGCTACTGTG CATTCGGCTT CGGCACGAGT TTGATCCTGT TTTCGTTCTC GCGCTGGTTC
TGGCTCTCCG CGGCGCTCCT GATCCCGGTG GGCTACTCGA TGATGGTGCA GATGGCGAGC
TCAAACACGC TGCTGCAATC CATGACGCCA GATCGGCTGC GCGGGCGGGT GCTCGCTGTG
TATTCGATGA TGTTCATGGG TATGGCGCCG TTCGGAGCGT TATTCGCGGG AGCGATCGCT
GAGCGCATCG GTGCGCCTTG GACAGTGGCA GTCGGTGGAG TCGCCTGTAT TTGCGGCGGC
TTGTTCTTCC GGAGGAACCT GGCTACCTTC CGCGATGGGG CTCGCAAGAT GGTCCTCGCA
CAACAGATGG TCGGTGGCGA ACCGGCACCG GAAGTCACGG CGGGATCGCT GGTGCCAGCG
ACCGATGCGG AACTCGGCGA GGAGCCGATC AGTTCTACGT CTTAA
 
Protein sequence
MLEPRGTSAH PNGKMGEMAS HTPAGEGIQK SKFLNRLPAL KVRNFQLFFA GQLISLIGTW 
MDNVAEAWLI YRLTGSSLKL GTVGFCSQIP VFLFAPLGGI VADRYNRHKI IIATQATSMV
LAGILAILTL THRVQVWHVF LLAALMGVVN AFDIPARQAF LSDMVGRENL MNAIALNSSM
FNGARIVGPA VAGILVASIG EGWCFGANSL SYIAVITGLL MMKLNLPVRI ASGKSPLQDI
VEGFQFVKEA APIRTLLLLL GLVSLVGMPY SVLMPIFADH ILHGGARGLG ILMGATGVGA
LGGALTLALK NGLKGISRII SYCAFGFGTS LILFSFSRWF WLSAALLIPV GYSMMVQMAS
SNTLLQSMTP DRLRGRVLAV YSMMFMGMAP FGALFAGAIA ERIGAPWTVA VGGVACICGG
LFFRRNLATF RDGARKMVLA QQMVGGEPAP EVTAGSLVPA TDAELGEEPI SSTS