Gene Hlac_2098 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_2098 
Symbol 
ID7400618 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp2088483 
End bp2089604 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content68% 
IMG OID643709168 
Productinner-membrane translocator 
Protein accessionYP_002566745 
Protein GI222480508 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0559] Branched-chain amino acid ABC-type transport system, permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.0236105 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGAACAA CTGAATCGTC GATCGTCGAC TCGGCGCGGG CCCGGCCGGG GCTTCTCCTC 
GTCGTTCTGC TCGGCGGCCT GTTGCTCGTC GACCTCGCCG CGAAGCTCGC CGGTTTCTCG
ATTCTCCCGA TCGGCGAGGC GATCTCGATC GATCGGCTCG GCTCGAACCT CTGGAACGGG
GTCGTGATCG GGCTCGTAAT CGGGCTCGCC GGAATCGGCC TCTCGATGAC GTACAGCATC
CTCTCGTTCG CGAACTTCTC GCACGGCGAC CTGCTTAGCA CCGGGGCGTT CACCGGCTGG
GGCGTCGCGT TCCTGATCGC CGGATTCGGC GATATCCCGG TTCGGGCGCT GCTGACCGTT
GGGGACGCCG GGAGCGCGAC CCCCGGCGAC ATCGGGGCGC ACATCCTCTC GACGCCGGTC
GCGATACTCG TCGGGCTGCT CGTGGCCTTT GCGGCCACCG CCGCCGTCGC GCTGGCGCTC
GACCGCGCGT TCTACAAGCC GATGCGGGAC CGCGACGGGA TCTCGATCCT CATCGCGTCG
ATCGGCGCTG CGCTGATCGT CCGGTACGTG ATCCAGTTCG TCTACGGCTC CGACCGGCGC
GGCGTCACGG CGGCCATCGA CGCCTCGAAC CTGGCGTTCG ACCCGCTCGG GCTCTCCGTC
AACGCTCACG AGCTGACCAT CGTCGTCGCC GCGATCGGGC TCATGCTCGC GATGCACTTC
ATGCTCCAGC GCACGAAGCT CGGCACCGCG ATGCGGGCGA TGGCCGACAA CAAGGACCTC
GCCCTCGTCA CCGGCATTCC GGCCGAGCGC GTCGTCACTG CCACGTGGAT CATCGGCGGC
GGGCTGGCGG GCGCCTCGGG GTACCTCTAC GTGCTGCTCC GCGGGACGAT CCAGTTCGAC
TTCGGCTGGC TGCTGCTCCT CTTAATCTTC GCGGCCGTGA TCCTCGGCGG GATCGGCTCG
GTGTACGGCG CGATCGCCGG CGGGCTCGTC ATCGGGATCG TCTTCACCAC CTCGACGGTC
TGGATCCCGT CCGACTTCAA CCAGGCCGCC GCGTTCGCCG TGATGATCAC CATGCTCCTG
TTGCGCCCCG AGGGGCTCTT CGGAGGTGTT TCGACCGCAT GA
 
Protein sequence
MGTTESSIVD SARARPGLLL VVLLGGLLLV DLAAKLAGFS ILPIGEAISI DRLGSNLWNG 
VVIGLVIGLA GIGLSMTYSI LSFANFSHGD LLSTGAFTGW GVAFLIAGFG DIPVRALLTV
GDAGSATPGD IGAHILSTPV AILVGLLVAF AATAAVALAL DRAFYKPMRD RDGISILIAS
IGAALIVRYV IQFVYGSDRR GVTAAIDASN LAFDPLGLSV NAHELTIVVA AIGLMLAMHF
MLQRTKLGTA MRAMADNKDL ALVTGIPAER VVTATWIIGG GLAGASGYLY VLLRGTIQFD
FGWLLLLLIF AAVILGGIGS VYGAIAGGLV IGIVFTTSTV WIPSDFNQAA AFAVMITMLL
LRPEGLFGGV STA