Gene Hlac_3531 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_3531 
Symbol 
ID7402374 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012030 
Strand
Start bp278114 
End bp279313 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content58% 
IMG OID643710069 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_002567635 
Protein GI222481399 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2271] Sugar phosphate permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGGCCA GCGATAGACA GATAACCAGT TTCACGACGC TCGGTCACGC ACTGTTTCAC 
ACGTACGAGT TGTCGATTCC GCTCTTCATC GGGCTGTGGA TAACCGAATT CGGTCTCTCT
GCAGCACTGA CCGGCCTCGT AGTTGGGGCA GGATACGCGC TTATTGGAAT TGGAGCACCA
GTCAGTGGCG TCCTCTCAGA TTATTTCGGG TCTCGGCGGC TGATACTTCT ATCCGTGCTC
GGAATGGGTG GTGGCTTCGC CCTGCTTGGA GCAGCACAGG GGCCCCTGTC GCTGGCAGCC
TGTGTCGTAC TCTGGGGAGC ATTTGCAAGT TTGTACCATC CTGCAGGACT CTCGTTGATT
AGCCGGGGGG CCTCCGAACG AGGAACGGTG TTTGCCTATC ACGGTGCTGG CGGCAATATC
GGGACGGCAG CTGGACCGCT CTGTACAGCT CTCCTGCTAT CGGTGTTTCA CTGGCGCATA
GCGGCGGTAG TTCTATTCGT CCCAGCGGCT GTCGCCGCCT TCGTCGGAAT GCGGATCTCG
TTCGACGATA TCAAATCGAA GGACGGCGAT AACCCGGACT CGATGCGAGG TGCGTTCACA
GAGACACTTG TCGATTCACG TCGGCTGTTC ACAGTCGGAT TCAGCATCGC ATTCATCACT
GTACTGCTGT ACGGAACCTA CTACCGTGGT CTCCTGACGT TCTTGCCGGA CATACTGGGT
AATTCCTCGC TGGACGACCT GACAATTCTG AGCTACTCGT TGGGACCCGC GGAGTATATC
TACACCAGCA TGTTGACCTT CGGAATCGCG GGACAGTACG CCGGCGGAAA ACTCACCGAC
CGTATCCCGA GTCGGACGGC GTTTCTCGGT GCGTTGAGTT CGCTCGTTGT GCTTGCCCTC
CTCTTTATTC TCGTCCAAGG ACAGGGCTTC GTGCCGCTGG TTCTGGTCAG TCTGGCGCTC
GGATTTTTCG TCTACGCGAC GGCACCCATC TATCAGGTCG TCATCGCCGA GCACGTTCCG
AGCGAGAGTC ACGGCCTCTC CTATGGCTTC ACCTACCTGG CCATGTTCGG CATCGGGGCC
CTCGGGGCAA CGATTGCCGG CACGCTGCTG ACGTACGCGA CGACAACGAT ACTGTTCGTC
GCACTGGCTA TGCTGGCGGC GACCGGATGT CTCTGCCTCC TTGTCCTCCG GTGGCTCTGA
 
Protein sequence
MQASDRQITS FTTLGHALFH TYELSIPLFI GLWITEFGLS AALTGLVVGA GYALIGIGAP 
VSGVLSDYFG SRRLILLSVL GMGGGFALLG AAQGPLSLAA CVVLWGAFAS LYHPAGLSLI
SRGASERGTV FAYHGAGGNI GTAAGPLCTA LLLSVFHWRI AAVVLFVPAA VAAFVGMRIS
FDDIKSKDGD NPDSMRGAFT ETLVDSRRLF TVGFSIAFIT VLLYGTYYRG LLTFLPDILG
NSSLDDLTIL SYSLGPAEYI YTSMLTFGIA GQYAGGKLTD RIPSRTAFLG ALSSLVVLAL
LFILVQGQGF VPLVLVSLAL GFFVYATAPI YQVVIAEHVP SESHGLSYGF TYLAMFGIGA
LGATIAGTLL TYATTTILFV ALAMLAATGC LCLLVLRWL