Gene Hhal_0393 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_0393 
Symbol 
ID4711409 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp458195 
End bp459460 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content68% 
IMG OID639854856 
Productmajor facilitator transporter 
Protein accessionYP_001001989 
Protein GI121997202 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.464327 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGAGCTG TCAATGCCAC CGGTTTCGCC ATCCTCGGCG CGGGGCTCAT CGCCATCGCC 
TACGGGCTTG CGCGTTATGC GTACGGGTTG TTCGTCCCGT CGATTCGCTC GGAGCTCGGG
TTGTCCGCCG ACGCGGTCGG GGTCGTCGGA TCCATGGCCT TTATCAGCTT CTGCCTGGCC
AGCGTCGTGG CGCCACTGAT TGTCGACCGG CTCGGCGCAC GCTACTCGGC GGTGCTCTCC
GGTCTGTTTG CTTTGGCCGG GCTGACCCTG ATCAGCCAGG CCGGTGACGC GATCACCCTC
GGCGCCGGGG TGTTTGCCTG CGGGATCTGT ACCGGCCTGA TGATGCCGGC CCTGTCCTCC
GGCGTGCAGA CGAACATTCG TCCGGACCTC CGCGGCCGCG TCAACGCCGT CATGAATGCC
GGTACCAGTG CCGGCCTGAT CCTCTGCGTG CCCGCCGTGC TTCTGCTCAG CGGCGCGTGG
CGTATGGCCT ACGGCTCTTT CGCGGTGCTC GCGGCCCTGG GCATTGTCGC GGCCCTCCTC
CTGCTCCCTT CGGCCTCGAA GGTCGGTGGC GGAAAGGCCA AGCCGGCTCC CTTGCCGCTG
GATACTCAGC GGTGGCTGAC TGTCGGGCGG CTGACGGTGT TCTGCTTCGC CATGGGGGTG
GCCGGTTCGG CCTACTGGAT CTTCGCACCC GATTTGGTGG TCGAGATCGG CGGGCTGTCC
GAGCGGTTGA CCGGAATGCT CTGGCTCGTG GTCGGCATTG CCGGGCTCGC CGGCGCCTGG
GCGAGCGACC TCGGCGATCG GCTGGGGGCG CCTGCCACCC AGGCCATCGC GTTGGTGGCT
CTGGGGGCGG CGACGGCGCT GGTCGCCGCG GCACCGGGTG ACGTGTGGAT GGCACTGGTG
TCGGCCGCCG TGTTCGGTTG GGCCTTCATG ACCCTGACCG GGCTGTATCT GGTCACCGGC
ATCCGGCTGT TGCGCGAGCG CCCGTCCATG GGGCCGGTCG TGCCGTTCCT GGCCATCACC
GTCGGGCAGG CTGTCGGATC GCCCTTGGTC GGGTGGGCCA TCGGCAACGC GGGCTATGTG
GAGGCGTTCC TGATGTTCGC AACGCTGGCG GTTCTGATTG CGGCCTTTTC GTTCCTGTTC
CCCCGTCCTG CCAGCGACGC AGCGGATGAA GGGGGGGAGG GCGCGGCCGA GTCGCGGATG
GCCGCGGCCC CTGCCACGTC GAAGCGGGCT ATGCAGGATG GACAGTCCGA AACGGAATCC
ATCTGA
 
Protein sequence
MRAVNATGFA ILGAGLIAIA YGLARYAYGL FVPSIRSELG LSADAVGVVG SMAFISFCLA 
SVVAPLIVDR LGARYSAVLS GLFALAGLTL ISQAGDAITL GAGVFACGIC TGLMMPALSS
GVQTNIRPDL RGRVNAVMNA GTSAGLILCV PAVLLLSGAW RMAYGSFAVL AALGIVAALL
LLPSASKVGG GKAKPAPLPL DTQRWLTVGR LTVFCFAMGV AGSAYWIFAP DLVVEIGGLS
ERLTGMLWLV VGIAGLAGAW ASDLGDRLGA PATQAIALVA LGAATALVAA APGDVWMALV
SAAVFGWAFM TLTGLYLVTG IRLLRERPSM GPVVPFLAIT VGQAVGSPLV GWAIGNAGYV
EAFLMFATLA VLIAAFSFLF PRPASDAADE GGEGAAESRM AAAPATSKRA MQDGQSETES
I