Gene Acid345_0654 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0654 
Symbol 
ID4069746 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp805567 
End bp806898 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content56% 
IMG OID637982660 
Productmajor facilitator superfamily sugar transporter 
Protein accessionYP_589733 
Protein GI94967685 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00879] MFS transporter, sugar porter (SP) family 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.866655 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.707022 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCATTA ACCGCCACGT GGTAAAGAGC ACATTCGTCA GCGCGCTTGG TGGTTTGCTC 
TTTGGATTCG ATACGGCAGT CATCGCGGGT ACCACTCATC AACTGACTGA GGTATTCAAG
CTCACCCCCA ATGCGCTGGG AATTACCGTC TCCAGCGCTC TGTTGGGAAC CGTTTTGGGA
GCGATCACTG CTGGATACCC AGGGCAGCGC ATCGGTCGGC GCGACAGCCT TCGCATCATG
GCCATTTTCT ATATGTTGTC AGCCTTGGGC TGCGCGCTCG CCTGGAATTG GAGTTCGCTG
CTCGTTTTCC GCGTGATCGG CGGGCTAGGT ATTGGCGGAT CGTCGGTGCT GGCGCCGATG
TACATTGCAG AGCTTTCCCC GCCGAAGTGG CGTGGACGCT TGGTCGGATT CTTCCAGGTA
AACATCGTTA TCGGCATACT CGTTGCGTAT CTCTCGAACT TCGTGATCGG CAGGATGAAC
TTCGGCGCGA TGGAATGGCG ATGGATGTTG GGTATCGCCG CGGCGCCGGC GGTTCTTTTC
TTCGTAATGT TGTTCTTCAT TCCGCGCAGC CCGCGGTGGT TAGCGATGAA AGGACGCACC
GCCGAAGCAC TTCGGGTTAT GCGTCTTACC GGGACTGACA ATCCGGAGGA AGAGCTTAAC
GGGATCGTGC GTTCCATCCA TCTTGAGCGC TCCACTAAAA GCGAGTCGCT CTTGCAGCGC
AAGTACCTCT TGCCAATCTT TCTCGCCGTT AGCGCTGGCG CGTTTAATCA GCTAACCGGC
ATCAATGCCT GCCTTTACTA CCTGAACGAT ATCTTTGCCG CCGCTGGAGC AAGCAAGTAT
TCCGCCGGTA TGCAGTCGGT ACTCATCGGC TGCACCAATC TGTTCTTTAC TCTGGTCGCG
ATGACGATGA TCGACAAACT CGGGCGAAAG AAACTTCTGC TGGTCGGCAC AACAGGTGTG
TGCATCTTCC TTGCGATCAT CGGACAGATT TTCCACAGCG GAGGCCATGG CGGCTCGCTG
ATCTGGCTGC TCATCGGCTT CATGGGCTTC TTCGCCATCT CGCAAGGCGC GGTGATTTGG
GTTTACATCA GCGAGGTCTT TCCCACACGT GTTCGAGCCC AAGGCCAAGC GCTTGGCACT
TCAACCTTGT GGATCACCAA CGCGCTGATC TCGTGGATGT TCCCGGTGCT CGCCGCGAAA
TCGAGCGCTA CGCCGTTCTT CTTCTTCGCG GCAATGATGT TCATTGACGT GCTGATTATC
GCGATGATCT ACCCGGAGAC GAGTGGCGTA TCTCTTGAGC AGTTGGAACA GAAGTTGGGC
GTAGTCGAAT AA
 
Protein sequence
MAINRHVVKS TFVSALGGLL FGFDTAVIAG TTHQLTEVFK LTPNALGITV SSALLGTVLG 
AITAGYPGQR IGRRDSLRIM AIFYMLSALG CALAWNWSSL LVFRVIGGLG IGGSSVLAPM
YIAELSPPKW RGRLVGFFQV NIVIGILVAY LSNFVIGRMN FGAMEWRWML GIAAAPAVLF
FVMLFFIPRS PRWLAMKGRT AEALRVMRLT GTDNPEEELN GIVRSIHLER STKSESLLQR
KYLLPIFLAV SAGAFNQLTG INACLYYLND IFAAAGASKY SAGMQSVLIG CTNLFFTLVA
MTMIDKLGRK KLLLVGTTGV CIFLAIIGQI FHSGGHGGSL IWLLIGFMGF FAISQGAVIW
VYISEVFPTR VRAQGQALGT STLWITNALI SWMFPVLAAK SSATPFFFFA AMMFIDVLII
AMIYPETSGV SLEQLEQKLG VVE