Gene Hlac_2521 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_2521 
Symbol 
ID7401573 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp2499609 
End bp2500655 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content62% 
IMG OID643709593 
Productbinding-protein-dependent transport systems inner membrane component 
Protein accessionYP_002567164 
Protein GI222480927 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1175] ABC-type sugar transport systems, permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.519073 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCGATT TTTACCGTCG ATTGCACCGG ACGATCAGTC CCTCCTCGAG GGACGACGAC 
AGCGAGGTCC GTACCGACGG CGGGGTCGTG AGCGACGGCC CTGGCGCCGA CCCGGCGCCG
GAACCGGACC GGGACCTGAC GACCCGTCTC AAGGCGACCT TGGACCAGCG CTTCGGGAGC
GACTTCATCG AGTCGTCCGT CTTCTGGCTC CCTCCGTTCC TGCTGATGGG GCTGTTCGTC
TACGGTGCGA TCATCTGGAA CCTGCTGATC TCGCTGACCG ACTACCAGCG CTTCGAGAAC
GCGCCGGACT ACTCGAACCT CGACTTCGAG ATGTACACGC GTGCGCTCGC AGACACCGGG
TTCATCGACG CCGCGATCAA CACGCTCATC CTGCTTATCG CGTTCACGGC GGGGACGCTC
GTGGTCGGCC TCGTGCTGGC TATCCTAATC GATAGAGGGA TCCGGTTCGA GAACACGTTC
CGGACGATCT ATCTCCTGCC GATGAGCCTC TCGTTCGTGG TGACCGCCCA GTTCTGGCTG
TGGATCTACA ACTACAACAA CGGGATCGCC AACAACGTCA TCGGCACTGT CGGTCTCGGC
CCAGTGAGCT GGCTCGGCAA CCAGGACATC GTCCTCTACG CGGTCATCTT CGCGTTGATG
TGGCAGTTCT CGGGGTACGC GATGGTCGTG TACCTCGCTG GGCTCCGAGC CATTCCGACA
GAGCACTACG AGGCGGCCAC GGTCGACGGC GCGTCGACCC TGAAGATGTA CTGGCGCGTT
ATCATCCCCC AGTTGAAGGG CGCGACGATC AGCGCCGCCG TAGTGCTGAT GGTGTTCGGG
ATGAAGGCCT TCGACTTCCT CTACTCGCTG TCAGGGGGAT ACCGGCCGCC GAACGGCGCC
GATATCTTAG CGACGAAGAT GGTTCGTGAG GCGTACGCGA ATCTCAACTG GGCGTACGGG
TCGGCGATCG CGATCGTCCT GTTCGGAATG GCGCTCAGCG TCATCGGCCC CTACCTTGTG
TACGAATACC GGAGGGACAA CCTATGA
 
Protein sequence
MLDFYRRLHR TISPSSRDDD SEVRTDGGVV SDGPGADPAP EPDRDLTTRL KATLDQRFGS 
DFIESSVFWL PPFLLMGLFV YGAIIWNLLI SLTDYQRFEN APDYSNLDFE MYTRALADTG
FIDAAINTLI LLIAFTAGTL VVGLVLAILI DRGIRFENTF RTIYLLPMSL SFVVTAQFWL
WIYNYNNGIA NNVIGTVGLG PVSWLGNQDI VLYAVIFALM WQFSGYAMVV YLAGLRAIPT
EHYEAATVDG ASTLKMYWRV IIPQLKGATI SAAVVLMVFG MKAFDFLYSL SGGYRPPNGA
DILATKMVRE AYANLNWAYG SAIAIVLFGM ALSVIGPYLV YEYRRDNL