Gene Acry_1598 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcry_1598 
Symbol 
ID5162219 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidiphilium cryptum JF-5 
KingdomBacteria 
Replicon accessionNC_009484 
Strand
Start bp1765299 
End bp1766669 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content71% 
IMG OID640553513 
Productgeneral substrate transporter 
Protein accessionYP_001234723 
Protein GI148260596 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0615669 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGCGTT CCTCCCTCGC CATCCTCGAC GACCAGTCCT TCGGCACCTT CCACTGGCGC 
GCCGTGTTCA CCACCGGCAT GGGCGTGCTC GCCGACGGCT ACGACCTGTC CTCGATCGGC
ATCGTCCTGC CGACCGTGCT CGCCTCCTTC GGCCTGACGA AGATCGACAG CCTCGAATCC
GGCCTGCTCG CCGGCTCCGC CCTGGTCGGC GCCGCCCTCG GCGCGCTGAT CTTCGGCTTT
CTCGGCCAGC GCGGCCGCAA GACCTTCTAC GGGCTCGACG TCGCGCTGAT GGCGATCGCG
GCGGTCGCCC AGGCCTTCGC CCCCAACCTC GCCTGGCTGA TCGCGATCCG CTTCATCCTC
GGCATCGGCG TTGGCGCCGA CTACGTGCTC TCCCCCACCA TCATGGCCGA GCACGCCAAC
CGGCGCGACC GCGGCCGCGC CCTCGGCGTC GGCTTCTGCC TCACCTGGTG GCTCGGCGCG
GCCCTCGCCG GCCTGCTCGC CCTCGTCCTG CACGCGCTCG GCGTGGCGCC GGACATGGTC
TGGCGCATCG TCCTCTCCGC CGGCGCGCTG CCCGCGCTCT CGGTGCTCTG GCTGCGCCGC
CGGATGCCGG AGACCGCGCG CTACCTCGCC CGCGTCGCCG GCGACCAGGA TGCCGCCGAA
ACCGTCATCC GCGGCATCAC CGGCACCGGC CACGCCGCCC CCGAGGCCGA CCGCCGCGAG
GTGATGGCGG TGCTCCGCCG CCATGCCGGG CAGATCTTCG CCGCCGCCCT GCTCTGGTTC
ATCTTCGACA TCGTGATCTA CTCGACCGTC CTGTTCGGCC CCTCGCTGAT CGCCCACGGC
CTTGGCCTGA GCCCGACGAT GTTCTCCCTG CTGATGACCT TCGCCTTCAT CATGCCGGCG
GTGCTGATCG GCGCCTTCGC CCTGCTCGAC CGCTTCGGCC GCAAGGCGGT GCAGATCGGC
GGCTATGCCG GCGCCGCCCT TCTCCTCGTC ATCTTCGCGC TGATCCACAA GGATATCGGC
CAGAACCCGG TGCTCGGCCT CGTCGTCTAC GGCCTGTTCA ACGTGATGAT CATGGGCCCC
AGCATGGTCA GCGGCGCGGC GATGCTCGGC GTCGAGCTGA GCCCGACCCG CATCCGCACC
ATGGCGCAGA GCTTCACCGT GGTCGGCGGC CGCCTCGGCG CCTCGCTCAG CGCCTTCGTC
TTCCCGCTGG TCTTCGCCAA GCTCGGCGAG GTGGCGGCGA TCGGCGTGCT CGCCGGCCTC
TCGGTCCTCG GCGCCATCCT CACCTACGCG CTGATCCCCG AGACCGCCGG CCGCTCGCTC
GAAGACCTGA ACGACGAGGC CGAGGCCCTC GCCCCCGCGG CCGCGCAATG A
 
Protein sequence
MQRSSLAILD DQSFGTFHWR AVFTTGMGVL ADGYDLSSIG IVLPTVLASF GLTKIDSLES 
GLLAGSALVG AALGALIFGF LGQRGRKTFY GLDVALMAIA AVAQAFAPNL AWLIAIRFIL
GIGVGADYVL SPTIMAEHAN RRDRGRALGV GFCLTWWLGA ALAGLLALVL HALGVAPDMV
WRIVLSAGAL PALSVLWLRR RMPETARYLA RVAGDQDAAE TVIRGITGTG HAAPEADRRE
VMAVLRRHAG QIFAAALLWF IFDIVIYSTV LFGPSLIAHG LGLSPTMFSL LMTFAFIMPA
VLIGAFALLD RFGRKAVQIG GYAGAALLLV IFALIHKDIG QNPVLGLVVY GLFNVMIMGP
SMVSGAAMLG VELSPTRIRT MAQSFTVVGG RLGASLSAFV FPLVFAKLGE VAAIGVLAGL
SVLGAILTYA LIPETAGRSL EDLNDEAEAL APAAAQ