Gene Hlac_0631 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_0631 
Symbol 
ID7401766 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp650748 
End bp651743 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content64% 
IMG OID643707697 
Productbinding-protein-dependent transport systems inner membrane component 
Protein accessionYP_002565303 
Protein GI222479066 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1175] ABC-type sugar transport systems, permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.40027 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCTCTT CGCAAATCGG CACCGACGTC AGCGGGCTGT CGCGCCTCCG CGCAGCGCTG 
CCCTTCTCCA GTCGCGACTG GGGGCTCCTG CTCGTCGCCC CCGGCGTGCT CCTCTTCTCC
AGTTTCATGC TGTACCCGAT CTTCTATCTC TTCTACATCT CGCTCACCGA CGCAACGTTC
GCCGGGTCGG TGATCGGGGG CGGTGCCGAG CTGATCGGGC TGGCCAACTA CGTCCAGCTG
ATCGGCGACT CGCAGTTCTG GACATCGATG ACGACGACGT GGCTGTTCGT CGCCGTCTCG
CTCGTTCTCA AGGTGTTCCT CGCCGTGGGG ATCGCCCTCC TGTTGAACCA CGTGCGCGTC
GCCGGCAAGC GGTACATGCG CGCGGCGGTG ATCGTCCCGC TGGGATTCCC CGGCATCTTC
ACGATCACCG TCTGGCGTGG GATGTTCAGC GACGCACGGT ACGGCGTGTT CAACACAATC
TTGGGCCGCT ACAACGAGTT TATGTCGTCG CTGTCGGCCC CCGAACTCCT CCTCTTTGAC
GTGCCGATCG GCTTCCTCAG CGGTCGCTGG GAGGCGTTCT TCGCGTACGT CACCACAGAG
GTATGGCTGG CGTACCCGTT CATGGTGATC ATCATCGTGA GCGCACTGCA GGACGTGCCC
CGATCGCTCC ACGAGGCCGC GATGGTCGAC GGCGCCGGGG CGCTCCAGCG GTTCCGAACC
GTGACGCTGC CCGCGATCAA GGGGCCGGTG TTGTTCGCAT CCATCCTCAC CGCTGCGACG
TCGTTCCAGC AGTTCCTGAT CCCGTGGGTG TTCAATCAGG GCGGCCCGTC GCGGCAGAAC
GAGCTGATCA TCGTGTACGG CTACCGCGAG GCGATCACGT TCAATCAGTT CGGCTTATCG
GCCGCAATCT TGATCGTCGG CATCGTGTTC GTCGGGCTGT TCATGTACGC CGCCGTCCGC
TACGGCGGCC TTGCCGAGGG GGTGGGTGAC GAATGA
 
Protein sequence
MASSQIGTDV SGLSRLRAAL PFSSRDWGLL LVAPGVLLFS SFMLYPIFYL FYISLTDATF 
AGSVIGGGAE LIGLANYVQL IGDSQFWTSM TTTWLFVAVS LVLKVFLAVG IALLLNHVRV
AGKRYMRAAV IVPLGFPGIF TITVWRGMFS DARYGVFNTI LGRYNEFMSS LSAPELLLFD
VPIGFLSGRW EAFFAYVTTE VWLAYPFMVI IIVSALQDVP RSLHEAAMVD GAGALQRFRT
VTLPAIKGPV LFASILTAAT SFQQFLIPWV FNQGGPSRQN ELIIVYGYRE AITFNQFGLS
AAILIVGIVF VGLFMYAAVR YGGLAEGVGD E