Gene Hlac_0450 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_0450 
Symbol 
ID7401068 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp466948 
End bp468186 
Gene Length1239 bp 
Protein Length412 aa 
Translation table11 
GC content72% 
IMG OID643707514 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_002565122 
Protein GI222478885 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2271] Sugar phosphate permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.122733 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.430828 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCTCG GCCGGATCAG GGAGTACGAC GTACTCGTGC TCACGTCGCT GATCTGGTTT 
CTCGGGAAGT TCGTTCGGTA CGCGTTCCCG CCGCTGTTCG AGCCCCTGCA GGCGAGCTAC
GGGGTGAGCA ACGCCGCCGT CGGAGCCGCG TTCTCGGGTT TCATGGCCGT CTATGCCCTC
CTGCAGTTCC CGAGCGGAGC GATCGCCGAC CGCGTCGGGG CCGTTCGGGT GATCGCGTTG
GGCGCGGTCG TGGCCGGTGT CGGTTCCCTC GCGTTGCTCT TTGACACCCC CTTCGCGGTG
CTGGCGGGGG CGATGCTCGT GATCGGCGCG GGGACGGGCG CGCACAAAAC GGTCGCAATA
CGGCTGCTCT CGCGGGTGTA CCCGGTTCGG ACCGGGCGCG CGCTCGGCGC CCACGACACC
GTCGGCGCGC TCGGCGGGGT GGCCGCGCCC GCGGCGGTCA CCCTCTTCGT CGCCGCGCCG
CCCGCGCTCG CCGGGTTCCT CTCTCGGCTC CCGGGCGCTG ACTGGCGCGG CCTGTTCGTC
GTCACGGGCG TCATCGCGCT CGCGCTTGCG GGAGCCTTTG CCCTCCGCGT TCCGGGTCGG
CTCCCGGCTG ACGCCGACCG CGGCCCGGAG CGAGAGGGGT CGGAGCCGGG CGCGAGCGAC
TACCTCGCGT TGTTCGAGGA CCGACGACTC GCGGCGTTCG TGATCGTGAC GATCGCCTTC
TCGTTCGCGT ACAACGGGGC CGTCGCCTTC CTCCCGCTGT ATCTCTCGCA AGCGGCCGGG
CTGTCGACCG CGACCGCAAA TCTGCTGTAC TCCGCGCTGT TCGCGGTCAC CTTCGTCCAA
CTCGTCTCCG GCGACCTCTC CGACCGATTC GGCCGCTTCC CCGTGATGGT TGCTGCGCTC
GCGCTCGCGG CGGCGGCTCT CGTCGGCGTC GTCGCGCTCG CAAGAGGGGA AACAGGGGCG
GGACCGATCG TTCTCGGCGC TCTCGTCGTC GCGTTCGGGC TCGGTTCTCA CGGGTTCCGC
CCGGTCCGCG GCGTGTACCT GATCGAGGCG CTCCCCGAGC GACTCGCTGG TGGCGGGCTC
GGCGTCGTCC GCACCCTGTT GATGGGTGCC GGCGCGCTTG CTCCCGCGAC GGTGGGGGCG
ATCGCCGACG CGTCCGGGTT CCGACCCGCG TTCGGGCTGC TCGCCGGGGC GCTCGCGCTG
GCGGCGGTCG CGGCGGCGGG GCTGTGGGCG ACCGAGTGA
 
Protein sequence
MTLGRIREYD VLVLTSLIWF LGKFVRYAFP PLFEPLQASY GVSNAAVGAA FSGFMAVYAL 
LQFPSGAIAD RVGAVRVIAL GAVVAGVGSL ALLFDTPFAV LAGAMLVIGA GTGAHKTVAI
RLLSRVYPVR TGRALGAHDT VGALGGVAAP AAVTLFVAAP PALAGFLSRL PGADWRGLFV
VTGVIALALA GAFALRVPGR LPADADRGPE REGSEPGASD YLALFEDRRL AAFVIVTIAF
SFAYNGAVAF LPLYLSQAAG LSTATANLLY SALFAVTFVQ LVSGDLSDRF GRFPVMVAAL
ALAAAALVGV VALARGETGA GPIVLGALVV AFGLGSHGFR PVRGVYLIEA LPERLAGGGL
GVVRTLLMGA GALAPATVGA IADASGFRPA FGLLAGALAL AAVAAAGLWA TE