Gene Hhal_2014 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_2014 
Symbol 
ID4710291 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp2217420 
End bp2218700 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content69% 
IMG OID639856487 
Productmajor facilitator transporter 
Protein accessionYP_001003580 
Protein GI121998793 
COG category[R] General function prediction only 
COG ID[COG2270] Permeases of the major facilitator superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.273868 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTCGATT TCGCCAACCA GGCCTACACC CTGCTGATCA TCACCGTCAT CTACGGCGAT 
CTGTTCACCC GGGTCATCGT CGGCGACGCC GGTGACGACT ACCGCCTGGG CAACCTGCTC
TGGAGCACGG CCCTGGCGCT GAGCTACCTG GGGGTGGTGG CCACCGCCCC GGTGTTCGGT
GCGGTCATGG ACTACGCGGC GGCCAAGCGT CGCTTCCTGT TCCTGAGCTA CGTCACCACG
GTGGCCGCCA CCGCCGCGCT CTACTGGGTG GAGCCCGGCT ACGTGGTGCT CGGCTTCGTG
CTGATTGTCC TCTCCAGCTA CGCCTACTCC ATGGGCGAGG CGTTCATCGC CGGTTTCCTG
CCGGATATCG CCGGGCCGCA GGAGATGGGG CGGGTCTCCG GGTTGGGCTG GTCCTTGGGC
TACCTCGGTG GGCTGTTCGC CACCGCGTTC ACGGTGCTGC TGCTCGGTGA GGTCAGCAAG
GAGAACTTCG ACCGCATCCG CTGGGTGGGC CCCTTTGCTG CCGCCTTTTT CCTGCTCTGT
GCCATCCCCA CCTTCCTGTG GCTGCGCGAG CGCGGGCAGC CCCGGCGCCT GCCGGCCGGA
CAGGGCTACG TTCGCCTCGG TATCCAGCGG GTGGGGCAGA CCGTGCGCAG CCTCGGTTAT
CTGCGTGACC TGGGGGTGTT CATGATCTCG CTGCTGATGG CCATGTCGGG GCTGGCCATC
GTCATCGCCT ACGCCTTCAT CTACGGTGCA CAGGTCATCG GCTGGGATGA GCGGGCGCGC
CTGATCATGT TCGTGGTCAC CCAGTTCTCC GCGGCGGCCG GCGCCATCGG CTTCGGGGTG
ATCCAGGACC GCTTCGGTGC CCTGCGCACC TACATGGTGA CCCTGGTGAT GTGGGTAGCG
GCGATCCTGC TGATCTGGGT GACCCCGGAG CTGACCGAGT GGCTCAATCG TTGGCTGGGC
ACGGAGTGGC AGGCGCAGTA CGTCTTCCTG ACCGCCGGGG TGGCCGCGGG GCTGGCCCTG
GGGTCGTGCC AGTCGGCCGG GCGGACGCTG GTGGGGCTGT TCGCTCCGCC GGGACGTGCG
GCGGAGTTCT TCGGGTTCTG GGGGCTGGCG ACCAAGCTGG CTGCCGCCTT CGGCCTGGTG
GCAGTGGGGG CGCTGCAGGC GGCCGTGGGT CTGCAGTCGG CGATCCTGCT CTGCGCCGTG
CTGTTTGCCG GCGCGCTGGT GGTGGCCTGG GGGGTCGACG AGGCGCGGGG TCGGGCGCGC
GGGCAGGCTT TGGTAGAGTG A
 
Protein sequence
MFDFANQAYT LLIITVIYGD LFTRVIVGDA GDDYRLGNLL WSTALALSYL GVVATAPVFG 
AVMDYAAAKR RFLFLSYVTT VAATAALYWV EPGYVVLGFV LIVLSSYAYS MGEAFIAGFL
PDIAGPQEMG RVSGLGWSLG YLGGLFATAF TVLLLGEVSK ENFDRIRWVG PFAAAFFLLC
AIPTFLWLRE RGQPRRLPAG QGYVRLGIQR VGQTVRSLGY LRDLGVFMIS LLMAMSGLAI
VIAYAFIYGA QVIGWDERAR LIMFVVTQFS AAAGAIGFGV IQDRFGALRT YMVTLVMWVA
AILLIWVTPE LTEWLNRWLG TEWQAQYVFL TAGVAAGLAL GSCQSAGRTL VGLFAPPGRA
AEFFGFWGLA TKLAAAFGLV AVGALQAAVG LQSAILLCAV LFAGALVVAW GVDEARGRAR
GQALVE