Gene Hhal_2295 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_2295 
Symbol 
ID4709114 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp2522796 
End bp2523962 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content69% 
IMG OID639856770 
Productmajor facilitator transporter 
Protein accessionYP_001003860 
Protein GI121999073 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.855468 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCATTG CCCAGCTGCT GATCACCATC TACTGCACGG TCCTGGCCTT CTCGGCCATC 
CATGCCCCGC AGCCGCTGCT ACCGACGCTG CAGGCGGCTT TCTCGGTGAG CGAGCCGAAG
GCCTCGCTGC TGCTCACCGC TACCCTGCTG CCCCTGGCGG TCGCGCCCAT CGCCTACGGC
TTCGTCCTCC AGCGCGTCTC CTCGCGGCAG ATGCTGGTGG TGGCCTCGGG GCTACTGGCG
CTCACCCAGC TAGCGGTGGC CATCGCACCC ACCTTCGAGA TCCTGCTGGG GCTGCGCCTG
GTACAAGGCC TGCTGATCCC GGCCATCCTC ACCGCCCTGA TGACCTACCT GGCCGCCAGC
GCCGCGCCGG GGCGGACCAC GCGGGTGATG GCCGGCTATG TAGCGGCCAC GGTCATGGGC
GGTTTCCTGG GCCGGGCCAT CGCCGGCGCC ATGACCACCG CCGCCAGCTG GGAGGCGGCC
TTCCTGCTCT TCGGGATCGC CCAGCTGCTC TGCACCGCTC TGTTGCTGCG CCTGGACGCC
GACCCGCAGG CCGGATTCGG CCGCCTGGAC CGCCGCGCCG TCGGCCAGAT CCTGCGCCAG
CCGCGGGCCC TACGGGTCTA CGGCGCCATC TTCTGCGCCT TCTTCGTCTT CCTCTCGCTG
CTCACCTTCC TGCCCTTCCG TCTGGTGGAG TTGGAAACCG GACTGAGCGA TCTGGGCATC
TCGCTGATGT ACACCGGTTA CCTGATGGGT GTGGTCACCG CGCTCAGCGC CCTGCGCGTG
GCTGATCGCA TCGGCGGCGT GGTCAACACC ATGCTGTTGG GCATCGCCAT CTTTGCTGCC
TCGCTGGCGA TGTTCCTCGG CCCCTGGCTG GCGGTGATCT TCGTCGGGAT GTTCGTCTTC
TGCGCCGGGA TGTTCCTGCT CCACGCCCTG GCGCCCGGGT TCCTGAACCA GGAGGTGGAC
GGCGATATCG GCGTGGTCAA CGGCCTCTAC ATCGCCTTCT ACTACGCCGG CGGCGCGGTG
GGCTCCTGGC TACCGGGCTA CCTCTACCAC GGCCTGGGCT GGGAGGCCTA CGTGGCCTCC
CTGGCGGCCA TGCTCGGCCT GGCCGGGTAC TGGATCTGGG GGCTGCGGTC CGCCCCACGC
GCTGAACGGG GCACCTACTC CGGCTGA
 
Protein sequence
MPIAQLLITI YCTVLAFSAI HAPQPLLPTL QAAFSVSEPK ASLLLTATLL PLAVAPIAYG 
FVLQRVSSRQ MLVVASGLLA LTQLAVAIAP TFEILLGLRL VQGLLIPAIL TALMTYLAAS
AAPGRTTRVM AGYVAATVMG GFLGRAIAGA MTTAASWEAA FLLFGIAQLL CTALLLRLDA
DPQAGFGRLD RRAVGQILRQ PRALRVYGAI FCAFFVFLSL LTFLPFRLVE LETGLSDLGI
SLMYTGYLMG VVTALSALRV ADRIGGVVNT MLLGIAIFAA SLAMFLGPWL AVIFVGMFVF
CAGMFLLHAL APGFLNQEVD GDIGVVNGLY IAFYYAGGAV GSWLPGYLYH GLGWEAYVAS
LAAMLGLAGY WIWGLRSAPR AERGTYSG