Gene Apre_1113 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApre_1113 
Symbol 
ID8397900 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerococcus prevotii DSM 20548 
KingdomBacteria 
Replicon accessionNC_013171 
Strand
Start bp1195951 
End bp1197102 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content41% 
IMG OID644995460 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003152861 
Protein GI257066605 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000378306 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGAAAAAG AAAAAATATA TTGGCCGAAT TTCATCTTAC TTTTAAGTGT GACCTTTATG 
GCAATCATCT CAGAGCTTAT TCCTTCGGGG ATTTTGCCAG AACTTACAGA GGGTCTAAGG
ATTAGCGAAA CTCAGGCCGG AAATCTCTTG GGCTTTTATG CTATAGCGAG TGCAATTTTT
TCGATTCCTC TTATATCTGC TACGGTTGGA TTTTCTAGGA AGAAATTGCT CCTGGCTCTT
CTTATGGGCT TTGCCCTAGG CAATTTCCTT GTGGGAATTT CCACTACATA TGGCATCGCT
CTTCTTGGGA GGATGATAGG AGGAATTTGT GCAGGGATTC TTTGGCCCAT GGTTGCATCT
TTTGGGATGA AGTTAACTGA TAAGGCCCAC AAGGGTTTTG CCGTCGCCTT TATCATGAGT
GGGACGACCT TTGGTATGAG CCTGGGCCTT CCGATTTTTA CTGCAATAGG TAGAAATATT
TCTTGGAGAG CAGAGTTTTT TGCTGTTTCA CTCGTTATTT TCCTTATAGG AATCCTTATA
TATTTTATCC TACCAGAAGT TAGCGGTGAG ATTCGTGATA GGACGAACTC CCCTTTTACC
TTGATTAGAA ACAAGGGCGT GCTCATAGTA ATGCTTCTTA CTTCCCTTGC AGCCATTGCA
AACTACGGGG TTTACACTTA TACCACAAAC CTAATAAGGG CTATCGATTA CACAAGAGGA
ATTGGCTTTG CTCAAGTTTT GTTTGGTCTT GGCTCTATTA TTTCTGTCAT TATTGCAGCA
AAGGTAATTG ACAGGCACAT CAGATCTCTT ACTATTTTTA TGTTCGGATC AGCTCTTTTG
TCCCTAATCA TATTTGCTTT TTTCGCAAGT TATAGCCTCT TTTGTGATAT GGCCTTTTTA
CTTCGGGGAA TAGGATTTGG GGCCTTGGTT AGCCTATTTC AAACAGCGGT AGCTAGGCAG
GTTAGGGAAA ATGCATCGGC TGTAGCCACA TCCCTCCAGT CAGCAAGCTT TAACTTCTCT
ATAATGCTTG CAAGTTCCCT TGCGGGGAGC CTTCTTACAA ATTATTCTGT TAAATTTATG
CTTAGCTTTA TGTGTCTGGT CCTTGTAGTT GGCATATTTG TAGCGTTTTT ATCAAAGAAA
ACTTTATATT AA
 
Protein sequence
MEKEKIYWPN FILLLSVTFM AIISELIPSG ILPELTEGLR ISETQAGNLL GFYAIASAIF 
SIPLISATVG FSRKKLLLAL LMGFALGNFL VGISTTYGIA LLGRMIGGIC AGILWPMVAS
FGMKLTDKAH KGFAVAFIMS GTTFGMSLGL PIFTAIGRNI SWRAEFFAVS LVIFLIGILI
YFILPEVSGE IRDRTNSPFT LIRNKGVLIV MLLTSLAAIA NYGVYTYTTN LIRAIDYTRG
IGFAQVLFGL GSIISVIIAA KVIDRHIRSL TIFMFGSALL SLIIFAFFAS YSLFCDMAFL
LRGIGFGALV SLFQTAVARQ VRENASAVAT SLQSASFNFS IMLASSLAGS LLTNYSVKFM
LSFMCLVLVV GIFVAFLSKK TLY