Gene Acid345_3976 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3976 
Symbol 
ID4072449 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4701522 
End bp4702745 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content61% 
IMG OID637986003 
Productmajor facilitator transporter 
Protein accessionYP_593050 
Protein GI94971002 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCACCC TTCGCAAGGT CTTCAAGGCA TTCTCCTATC GCGACTTCCG CCTCATGTGG 
TTTGGGGCGT GCACCTCCAG TATTGGCACG TGGATGCAGA TCGTCGCGCA GGGCTGGCTC
ATTTACCGGC TGAGTCATTC CGCCTTCCTG CTTGCGTTGG ACCAGTTCAT GGGCGGCATT
CCGATCTTCG TCCTCACCCT GATCGGCGGC GTGGTGGCCG ATCGCGTCGA ACGCCGCAAA
GTTCTGCTGA TCTCGCAGTT CATCCAGATG GCGAGCGCCA TCACCCTCAC CGTCTTAGTC
GCGACGAACT ACCTGAAAAC TGCACACGAT GTGTGGATGA TCATCACCTT GTCCTTCGTC
TCCGGCGTGG CGCAGGCTTT CGGCGGACCG GCTTATTCCG CGCTGATCCC GACGCTGGTG
GACCGCGAAG ACATGCCGAA CGCGATTGCG CTGAACTCCA TCCAGTTCAA CATGGCGGTG
ACGGTTGGCC CAGCGCTGGC AGGCATCACG CTCGCCAAGC TCGGAGAGAA GTGGTGCTTC
GGCCTGAATG CGGTGTCGTT CCTGGCGCCG GTGCTCTCTC TCCTGATCAT CTCGGCGCGG
TTCCAGCCGC AGCGCACCAA AGAGAGCGTG CTCACCAGCC TGAAGCAGGG CATCAGTTTT
GTTCGGCAGC GCGAAGCGAT GGTTGCGCTG ATTGTGCTCG CGTTCTGCAT GACCGCCCTC
ACCGCGCCCA TGCGCACCTA CTTCCCAGTC TTCGTGAAAG ACATCTTCCA CCGCGGCGCC
GAGACGTACG GTTGGCTGCT GTCAGCCATG GGCATTGGCT CGATCATTGG CTCGCTGATC
ATCGCGAGCC GCGGCAACAT CCACAACAAA GGGCGCGTGG CGCTGGTCAC CATGACCTGC
CTCGGCGCGG CGATTTCCGC CTTTGCGCTC TCTCGCGCTC TGCCGTTCAG TTATTTCTCG
GTGATCATCG TGGGTGCTTC CATGATGGCC GTCTTCGCCA CCGTGACCTC GCTGGTGCAA
TTGATTACGA CAAACGAAAT GCGCGGGCGC GTGATGAGCG TCTATAACTG CGCCTTCCGT
GGCGGCATGC CGATGGGCAA CCTCGTCACC GGCCAACTGG TTCCGATCTA CACTGCGCCA
ATCGTCGTCG GTGCGAGCGG GGGATTGCTG GTGCTGATTG CGATTTACTA CCTGCTGTTC
CAGCGCCGGG TCGCCGCGCT CTAG
 
Protein sequence
MLTLRKVFKA FSYRDFRLMW FGACTSSIGT WMQIVAQGWL IYRLSHSAFL LALDQFMGGI 
PIFVLTLIGG VVADRVERRK VLLISQFIQM ASAITLTVLV ATNYLKTAHD VWMIITLSFV
SGVAQAFGGP AYSALIPTLV DREDMPNAIA LNSIQFNMAV TVGPALAGIT LAKLGEKWCF
GLNAVSFLAP VLSLLIISAR FQPQRTKESV LTSLKQGISF VRQREAMVAL IVLAFCMTAL
TAPMRTYFPV FVKDIFHRGA ETYGWLLSAM GIGSIIGSLI IASRGNIHNK GRVALVTMTC
LGAAISAFAL SRALPFSYFS VIIVGASMMA VFATVTSLVQ LITTNEMRGR VMSVYNCAFR
GGMPMGNLVT GQLVPIYTAP IVVGASGGLL VLIAIYYLLF QRRVAAL