Gene Lferr_1968 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagLferr_1968 
Symbol 
ID6877955 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidithiobacillus ferrooxidans ATCC 53993 
KingdomBacteria 
Replicon accessionNC_011206 
Strand
Start bp1963714 
End bp1965030 
Gene Length1317 bp 
Protein Length438 aa 
Translation table11 
GC content52% 
IMG OID642789836 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_002220392 
Protein GI198284071 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.878316 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGAAAAC AATCCCTGGT TCCCAGTCCA CCGCCCTCTG AATCTTCCGC AAAAAATCAT 
ATTCGTCACG GCACCCGCTC ATTTCGAAAT ACGAGCCTGG CGCTTTTCGC AGCAGGATTC
GCCACATTCA GCATGATTTA TTGTGTACAG CCCCTGATGC CTGCCTTCAG CAGAGAGTAT
GGAGTGGCCG CCACGAGTAG TGCTCTTTCC CTGTCTTTGA CTACGGGGAT TCTGGCTTTC
ACGATGCTCC TGGTCGGGAA CTGGTCGGAT CGTCTGGGAC GTAAGCCAAT CATGGTCTGG
TCACTGTTCA TGTCCGCATT TCTGGTGCTC GCCACGGGAT TCGCTCCAAA TTGGGATGTT
TTTCTGCTGG CGCGGGCCTT GCTGGGCATC AGCATCAGTG GTTTGCCGGC AGTAGCTATG
ACGTATCTCA ATGAGGAGGT ACACGCGGAC TCTATTGGCA TAGGTATGGG GTTATATATC
AGTGGAAGTG CAGTGGGTGG CATGTCAGGT CGGCTGGTTG CTGGCGTTTT GGCCAATTAT
TGGGGTTGGC ATGTTGCCAT TGTCAGCATC GGCGTCATCA GCTTGATCGC TGCCGTTCTG
TTTGTTCGCA GCTTGCCAGA TTCGCTGCAC TTTTCCCCAC AGCATGTGTC CTGGAACGGC
AGAGTCCAGC AAATACGTAT GTTGTTCAAA GATTCGGGGC TCCCTTGGTT GTTTGCGGAA
GGCTTTTTCC TGATGGGCAT TTTCGTGACC TTTTATAATT ATCTGACTTA TCGCCTGGTG
ACTTCTCCTT ACGACTTCAG TCAGGCTCAG GTGGGACTGA TCTTCAGCGT CTACGGGGTT
GGTATTTTCA GCTCTCCAGT GATGGGACAT ATTGCCGGAT GGGTGGGGCG TCGAAAAGTG
CTGTGGATGG CTTTCGCGTT GGTGATTACT GGAGTCCTTC TCAGTTTTGC GGGGGCCGTA
TGGGCGATTA TGCTCGGCAC CATATTGTTG ACTTTTGGTT TTTTTGGAGG ACATTCCATT
GTCAGCAGTT GGGTGGGTCG GAGAGCGCAT GGAGCCAAGG CGCAGGCAGC ATCATTGTAT
CTCTTTTTTT ATTATCTGGG TTCGGCGGTA CTGGGAAGTT CCGGAGGATA TTTCTATTCT
GGCTGGGGTT GGGATGGTGT GGCCGGGCTG CTGACTTTTC TGGCAGCATC TGGACTGCTC
ATCGCCTGGA AACTGCGTGC GCTGCCGCCT CTGACTTCAA TATCGTCAAT AACTGTCGGA
GAAGCGGTGA ATGGTCCCTA CAACGGTAAA CCACATGGAT ATCAGAACGC GCGCTAG
 
Protein sequence
MRKQSLVPSP PPSESSAKNH IRHGTRSFRN TSLALFAAGF ATFSMIYCVQ PLMPAFSREY 
GVAATSSALS LSLTTGILAF TMLLVGNWSD RLGRKPIMVW SLFMSAFLVL ATGFAPNWDV
FLLARALLGI SISGLPAVAM TYLNEEVHAD SIGIGMGLYI SGSAVGGMSG RLVAGVLANY
WGWHVAIVSI GVISLIAAVL FVRSLPDSLH FSPQHVSWNG RVQQIRMLFK DSGLPWLFAE
GFFLMGIFVT FYNYLTYRLV TSPYDFSQAQ VGLIFSVYGV GIFSSPVMGH IAGWVGRRKV
LWMAFALVIT GVLLSFAGAV WAIMLGTILL TFGFFGGHSI VSSWVGRRAH GAKAQAASLY
LFFYYLGSAV LGSSGGYFYS GWGWDGVAGL LTFLAASGLL IAWKLRALPP LTSISSITVG
EAVNGPYNGK PHGYQNAR