Gene Hlac_0331 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_0331 
Symbol 
ID7399721 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp353992 
End bp355167 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content72% 
IMG OID643707393 
Productphosphate transporter 
Protein accessionYP_002565005 
Protein GI222478768 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0306] Phosphate/sulphate permeases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.799438 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.0469398 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTGTCG TAACTGTCGC CACCCTCGGT GTCGCCGCCG CCGCCAGCCT CTTTATGGCG 
TGGTCGATCG GCGCCGGATC CTCCGGATCG ACACCGTTCT CTCCGGCGGT CGGTGCCAAC
GCCATCTCCG TGATGCGCGC GGGGCTCGTG GTCGGTGTGT TGGGGTTAAT GGGCGCGATC
CTCCAGGGCG CGAACGTGAC AGAGGCGGTG GGAACCGAGC TGATCGGCGG CGTCACCCTC
ACCGCCGGCG CGGCCATCGT CGCGCTGCTC ACTGCGGCCG CGCTCGTCGC GATCGGCGTG
TTCGCGGGGT ACCCGATCGC GACCGCCTTT ACCGTCACCG GCGCGGTCGT CGGCGTCGGG
CTCGCGATGG GCGGCGACCC GGCGTGGCCG AAGTACACCG AGATCCTCAC GCTGTGGATC
CTCACCCCGT TCGTCGGCGG CGGCGTCGCC TACGGCGTCG CGCGGATGCT CATCGGCGAG
CGGCTTCCCG AGCGAGCGCT CACCGCGGCG CTCGCCGGGC TGGTCGGCGC GATCGTCGCG
AACGTCGGGT TCGCGCTGCT CGGGCCGGCG GGCCAGCAGG CGTCGCTGTC GGAGGCGTTC
GGTTCCGGGC TCGGGATCGG CGCGATCGGC ACGCCTCTGG TCACGGTCGC GGTGGCGGCG
GTCGTCGCGC TCGCGGTGTA CGCCGACCTC GGTCGCGACC GCGAGGGCGC CCAGCGCCGA
TTCCTCCTCG CGATGGGCGG ACTGGTCGCG TTCTCGGCCG GCGGCTCGCA GGTCGGGCTC
GCGATCGGCC CGCTCGTCCC GATCTTCAGC GATGTCGGGG TCCCGCTGTG GGCGCTGCTC
GTCGGCGGCG GCGTGGGACT CCTCGTCGGA TCGTGGACCG GCGCGCCGCG GATGATCAAA
GCGATCTCGC AGGACTACGC CTCGATGGGG CCGCGGCGGT CGATCTCGGC GCTCATCCCG
TCGTTCGCGA TCGCGCAGAT CGCGGTCGCG TTCGGGATCC CCGTCTCGTT CAACGAGATC
ATCGTCTCCG CCATCGTCGG CGCGGGTTAC GCCGCGGGCG ACGCGGGCGT GAGCCGGTCG
AAGATGGGGT ACACCGTGTT CGCGTGGATC GCGTCGCTCG TCGGGTCGCT GGCGCTCGGG
TTCGGCGTAT ACTCCGCCGT GCAGTTCGTG CTCTGA
 
Protein sequence
MVVVTVATLG VAAAASLFMA WSIGAGSSGS TPFSPAVGAN AISVMRAGLV VGVLGLMGAI 
LQGANVTEAV GTELIGGVTL TAGAAIVALL TAAALVAIGV FAGYPIATAF TVTGAVVGVG
LAMGGDPAWP KYTEILTLWI LTPFVGGGVA YGVARMLIGE RLPERALTAA LAGLVGAIVA
NVGFALLGPA GQQASLSEAF GSGLGIGAIG TPLVTVAVAA VVALAVYADL GRDREGAQRR
FLLAMGGLVA FSAGGSQVGL AIGPLVPIFS DVGVPLWALL VGGGVGLLVG SWTGAPRMIK
AISQDYASMG PRRSISALIP SFAIAQIAVA FGIPVSFNEI IVSAIVGAGY AAGDAGVSRS
KMGYTVFAWI ASLVGSLALG FGVYSAVQFV L