Gene Acry_3608 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcry_3608 
Symbol 
ID5159410 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidiphilium cryptum JF-5 
KingdomBacteria 
Replicon accessionNC_009471 
Strand
Start bp24319 
End bp25728 
Gene Length1410 bp 
Protein Length469 aa 
Translation table11 
GC content57% 
IMG OID640538910 
Productgeneral substrate transporter 
Protein accessionYP_001220343 
Protein GI148244107 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones121 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAACAT CAGCAGCACT TCCGCCCGTA GAAAATACCG TCTCCGAAGC GCTCAACCAG 
GTTGTATTTA GTCCGTTCCA TTTGCGAGCC ATTTTTGCGG CGGGAATGGG ATTTTTTGCC
TCTGCCTACG ACCTGTTCAT CATCGGCACG GCTCTGACCC TGATCAAGGG CGAATGGCAT
TTGTCGCCCG GATCGGTTGC GCTGATCGGC TCGATTTCGC TGGCCGCCAC CTTTGTCGGC
GCCTTCATGT TCGGGCGTAC CGCGGATATT TTTGGTCGCA AGTCGATCTA TGGCCTGGAA
GCGCTCCTGA TGACGGCGGG CGCTCTCCTC TCGGCGTTCG CACCCAATGT CACCATTCTT
TTGATTGCGC GAGTGATCCT CGGATTCGGT ATCGGTGGTG ACTATCCGCT TTCCGCCGTT
TTGATGAGCG AGTATTCCAA CACGAAATCA CGCGGCCGCA TGGTCAGCCT CGTGTTTTCC
ACCCAGGCTG CCGGCCTGGT GGTTGGTCCG GCGATCGCGC TCACTCTGCT AGCAGCCGGC
ATCGATCACG ACATCGCATG GCGCATCATG CTCGGCCTCG GCGCTCTACC GGCAGCAATG
GTGATCTATA TTCGACGCAC CCTGCCAGAA AGCCCGCGAT GGCTTGCCAG GGTGAAAGGG
GACGGTACGC GCGCTGCGCG GGAACTCGCT TCGTTCAGCT TGGGAACTGC GACGTCCGCA
GGGCGCGACA AGGTGGTGAA ACAGCCATTC AGCCGCTATT TGCTCGTTTT TCTGGGAACG
GCCGGCACGT GGTTCGTGTT CGACTATGCC TATTATGGCA ACACGATTTC GACGCCCATG
ATCATGCAGC AGATCGCACC GCATGCAGAC CTGCTGATGA GCACGGCAAT GAGCCTGATT
ATCTTCGCGG CGGCGGCGGT GCCCGGTTAC ATCCTCGCGA TCCTGACGGT TGATCGGATT
GGCCATAAGC GACTGCAACT GATTGGTTTC GCGGGCATGG GATTGATGTT TCTGATCATC
GGACTGTTTC CGATGCTGAT TTCAACCGTT GGACTATTCC TGATTATCTA TGGTCTGTCC
TATTTTTTCG CCGAATTCGG TCCGAATACG ACTACATTTT TGCTCTCCAG CGAGGTCTTT
CCGGTCAACA TACGGACCAC CGGCCACGGC GCATCGGCGG GAGTAGCGAA AGTGGGTGCG
TTCATCGGCG CATTTATCTT CCCGATCCTG ATCACAGATT TTGGGCTTTA TGGCACGTTG
CGCATCACTT TTCTCTTCTC GATGGTAGGA CTGGTGTTGA CGGCCGTATG CCTGCGCGAG
CCGGCCGGTC TCAGTCTGGA GGCTGTGAGT AATGAGACCA CGGACGAGGT CGCCATGTCA
CCGATGATGG CGAGGGCATC ATCGACGTGA
 
Protein sequence
MTTSAALPPV ENTVSEALNQ VVFSPFHLRA IFAAGMGFFA SAYDLFIIGT ALTLIKGEWH 
LSPGSVALIG SISLAATFVG AFMFGRTADI FGRKSIYGLE ALLMTAGALL SAFAPNVTIL
LIARVILGFG IGGDYPLSAV LMSEYSNTKS RGRMVSLVFS TQAAGLVVGP AIALTLLAAG
IDHDIAWRIM LGLGALPAAM VIYIRRTLPE SPRWLARVKG DGTRAARELA SFSLGTATSA
GRDKVVKQPF SRYLLVFLGT AGTWFVFDYA YYGNTISTPM IMQQIAPHAD LLMSTAMSLI
IFAAAAVPGY ILAILTVDRI GHKRLQLIGF AGMGLMFLII GLFPMLISTV GLFLIIYGLS
YFFAEFGPNT TTFLLSSEVF PVNIRTTGHG ASAGVAKVGA FIGAFIFPIL ITDFGLYGTL
RITFLFSMVG LVLTAVCLRE PAGLSLEAVS NETTDEVAMS PMMARASST