Gene Acry_1047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcry_1047 
Symbol 
ID5159495 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidiphilium cryptum JF-5 
KingdomBacteria 
Replicon accessionNC_009484 
Strand
Start bp1167951 
End bp1169168 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content72% 
IMG OID640552965 
Productmajor facilitator transporter 
Protein accessionYP_001234182 
Protein GI148260055 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCATCGA TCGCCGCCCC TCCCACCGCC ACGACCGCGA CCGACCGCGA GGCCGCGAAG 
GTCATGGCGG CGGGCATCGC CAGCATGGTC CTCACCGTCG GCCTCGCCCG CTTCCTCTAC
ACGCCGCTGC TGCCGGTGAT GCAGGAGCAG GCGCATCTCT CCGTCACCGG CGGCGGCTGG
CTCGCCACCA TCAACTACGC CGGCTACATG ACGGGCACGC TGCTGATCGC CGCGATCGGC
GACTTGCGCA CCCGTTTCCT GTTCTACCGC GCCCTGCTGG TCATCGCCGT CATCACCACG
GCGGCGATGG GCCTGACCAC CAGCCTCGTC GCCTGGGCGG TGCTGCGCTT CTTCGCCGGA
ATGACGGCGG TGGGCGCGCT GCTGCTCGGC ACCGGGCTGA TGCTCGCCTG GCTGCGCCAT
CACGGCAGGC GGCTCGAACT CGGCGTGCCC TTCGGCGGCC TCGGCCTCGG CATCGTCGTC
TCCGGCCTGC TGCCGATGGC GATGGCCTTC CGGCTCGACT GGTCCGGCGA GTGGATCGCC
TCCGGCCTGT TCGGCATCGT CTTCCTGATC CCCGCCTGGA TCTGGATGCC GATGCCCCCG
CAAGCCGACG CCGCCGGCCC CGCCGCCCTG GCGGACACGC CGCCGGTCGC CCGCTGGATG
GCGCTGTTCG TCGCCGCCTA TTTCTGCGCC GGCGTCGGCT ATGTCGTCAG CGCCACCTTC
ATCGTCGCCC TGGTGGCGCA TCTGCCGGCG CTGCGCGGCA TGGGGAACCT GGTCTGGGTC
GTGGTCGGCC TCGGCGCGAT CCCGACCGCT CCGCTCTGGG ACCGCGTCGC CCGCCGCACC
GGCGACATCA GGGCGCTGCT CGCCGCCTTC GCCCTGCTCA CGCTCAGCAT CGCAATCGGC
GCGACCACGC GCGGCGTCGC CCTCTCGCTG GTCGGTGCCG CCCTCTACGG ATGTTCGTTC
AACGGCATCA CCAGCATGAC GCTGACGATC ATCGGCCGGC TCTACCCGCG CAACCCCTCG
AAGGCGATGG CGCGGATGAC GATCAGCTTC GGCGCCGCGC AGATCATCGC GCCCGCCGTC
TCGGGCTATA TCGCCGCGCT CACCGGCAGC TACGACGGCG CGCTGTGGAT GGCCGCCGCG
GTGATGGTCA CCGGCATGGC CTGCCTGCTG CTGCTCCCGC GCCGGCGCAC CCAGGCGGCA
GCCGAAGCCG CCGCCTGA
 
Protein sequence
MSSIAAPPTA TTATDREAAK VMAAGIASMV LTVGLARFLY TPLLPVMQEQ AHLSVTGGGW 
LATINYAGYM TGTLLIAAIG DLRTRFLFYR ALLVIAVITT AAMGLTTSLV AWAVLRFFAG
MTAVGALLLG TGLMLAWLRH HGRRLELGVP FGGLGLGIVV SGLLPMAMAF RLDWSGEWIA
SGLFGIVFLI PAWIWMPMPP QADAAGPAAL ADTPPVARWM ALFVAAYFCA GVGYVVSATF
IVALVAHLPA LRGMGNLVWV VVGLGAIPTA PLWDRVARRT GDIRALLAAF ALLTLSIAIG
ATTRGVALSL VGAALYGCSF NGITSMTLTI IGRLYPRNPS KAMARMTISF GAAQIIAPAV
SGYIAALTGS YDGALWMAAA VMVTGMACLL LLPRRRTQAA AEAAA