Gene HY04AAS1_0694 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHY04AAS1_0694 
Symbol 
ID6743498 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHydrogenobaculum sp. Y04AAS1 
KingdomBacteria 
Replicon accessionNC_011126 
Strand
Start bp627302 
End bp628681 
Gene Length1380 bp 
Protein Length459 aa 
Translation table11 
GC content39% 
IMG OID642750493 
ProductGeneral substrate transporter 
Protein accessionYP_002121359 
Protein GI195953069 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAAAAG TTGTCAAAGG TAGTATAGCA GAAGCTTTAA ACACTTCGCA GTTTAGTCTT 
TTTCACATCA AGGCTATGTT TGCTTCAGCC ATGGGATTTT TTACATCGGC TTACGACTTG
TTCATAATAG GTACTGCCCT TGTGCTAATA AAAGATGAAT GGCACCTAAG CGCCCAACAA
GTAGGTCTTA TAGGTTCTAT ATCTCTTATA GCTACTTTCT TTGGTGCTTT TATCTTTGGA
AATCTCGCAG ACAGGCTTGG TAGAAAATCT GTTTACGGCA TAGAAGCTAT TTTGATGGTT
TTAGGCGCTT TAATGTCCGC TTTTTCTTTT AACGTAGGCT TTTTACTTTT ATCTCGTTTT
ATATTGGGCT TAGGAGTAGG GGGAGATTAT CCTCTTTCTG CGGTTATAAT GAGCGAGTAT
GCAAACACTA CCACAAGAGG TAGAATGGTT ACATTGGTAT TTAGCGCTCA GGCACTGGGT
TTGATAGCTG GACCTATGGT GGCGCTCACT CTTTTGGCGG CTGGAGTAGA CAAAGATTTA
GCCTGGAGAA TAATGCTTGG GCTAGGTGCC TTACCAGCAG CTACGGTTAT ATATTTAAGA
AGAAGGTTAC CAGAATCCCC AAGATGGCTT GCAAGAGTTA AAGGTGAGAA AGAAGTGGCG
GCAAAAGATT TAGCATCGTT TTCACTTGGT GATATAGTCA TAGAAGAAGT CAAAGACCAG
ATAGTCAAAA AACCTTTATC AAAATATTGG TTACAGCTTT TAGGTACCGC TGGTACATGG
TTTTTGTTTG ACTACGCTTA CTATGGCAAC ACTATATCAA CACCTTTAGT ACTAAAACAT
ATAGCAACCC ATGCAAATTT AATACAAAGT ACAGCTATAA GCTTTCTAAT ATTTGTGGTA
TTTGCTGTAC CTGGTTATTT CATAGCCGCT GCCACTATAG ACAAAATAGG ACATAAATTT
TTGCAAATGC TTGGATTTTT CATGATGGGT CTTATGTTTT TCATAATAGG AATGTTTCCT
TCAATAGTAC ACAATTTTCC TCTTTTCGTA ACGTTGTATG GTTTGTCTTA CTTCTTTGCA
GAGTTTGGCC CAAATACCAC CACCTTTGTA TTGCCAGCGG AGGTATTTCC AGTAAACGTT
AGAACCACAG CCCATGGTAT ATCTGCTGGC GTAGCTAAGA TAGGCGCATT CATAGGAGCT
TACTTCTTCC CAATCTTGTT AAAGTCGTTG GGATTAAGCC ATACCTTGCT TTTGACATTT
GTATTCTCGT TAGCTGGACT AATACTTACT TATATAGCAA TCCCAGAACC AAAAGGAAAA
TCTTTGGAAG AAGTATCTCA AGAAGATACG ACCCTTTCAA AGCCCTCACC TGCTACCTAA
 
Protein sequence
MAKVVKGSIA EALNTSQFSL FHIKAMFASA MGFFTSAYDL FIIGTALVLI KDEWHLSAQQ 
VGLIGSISLI ATFFGAFIFG NLADRLGRKS VYGIEAILMV LGALMSAFSF NVGFLLLSRF
ILGLGVGGDY PLSAVIMSEY ANTTTRGRMV TLVFSAQALG LIAGPMVALT LLAAGVDKDL
AWRIMLGLGA LPAATVIYLR RRLPESPRWL ARVKGEKEVA AKDLASFSLG DIVIEEVKDQ
IVKKPLSKYW LQLLGTAGTW FLFDYAYYGN TISTPLVLKH IATHANLIQS TAISFLIFVV
FAVPGYFIAA ATIDKIGHKF LQMLGFFMMG LMFFIIGMFP SIVHNFPLFV TLYGLSYFFA
EFGPNTTTFV LPAEVFPVNV RTTAHGISAG VAKIGAFIGA YFFPILLKSL GLSHTLLLTF
VFSLAGLILT YIAIPEPKGK SLEEVSQEDT TLSKPSPAT