Gene Hlac_1093 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_1093 
Symbol 
ID7400165 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp1097152 
End bp1098819 
Gene Length1668 bp 
Protein Length555 aa 
Translation table11 
GC content68% 
IMG OID643708159 
ProductABC transporter related 
Protein accessionYP_002565758 
Protein GI222479521 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0411] ABC-type branched-chain amino acid transport systems, ATPase component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.403079 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACTCGCC CGATCTTAGA GACCGAGGGG CTGACTAAAC GGTTCGGCTC GCTCGTCGCA 
AACGACAGGC TCTCCGTAAC CGTCGAGGAG GACACGATCC ACGGCATCAT GGGGCCGAAC
GGCTCCGGGA AGTCGACGTT TTTCAACACC GTCACCGGGT TCTACCGCCC CGACGGCGGC
ACGGTCCGGT TCGACGGAGA GGACGTGACC GGGTGGAAAC CCGACGAAAT CGCCCGACGC
GGGCTCGCGC GGACCTTCCA GATCCCGTCG CCGTTCGAGG ACCTCACGGT CAAGGAGAAC
ATGCTCGCCG TGTTCACTGG CGGGCTTCGC TCCGGGATGC GGATCTCGGA GGCGAAACGC
GCCCGCGCCG ACGAACTGCT CGAACTCCTC GAGATCGACC ACGTCGCCGA TCAGGAGGCC
GGCGGGGTCT CGGGCGGGCA GGAGAAACTG CTCGAACTCG GGCGGATCTT GATGCTCGAA
CCGGCCTGCG TGATGCTCGA CGAGCCCACA GCGGGGGTCA ATCCCTCGCT CCGGAATCGC
CTGCTCGAAC ACTTAGAGAC GCTCAACGAC CGCGGCACGA CGTTCGTGAT CATCGAACAC
GACATGCGCG TCATCGCGGA CGTGTGTGAC CGCGTAACCG TGTTCAATCA GGGACAGGTC
CTCGTCGAAG GCGACTTCGA GTCGGTGACG AGCGACGAGC GCGTCCGCGA CGCGTACCTC
GGCGGCGCCG CCGAGCACGA CGCGTCGCTG GAGACGCTCA TCGGGGAGGA GGCGGATGCC
CCGGTCTCCG AGGAGCGTGG CACACCCGCC GACGCCGCGA CGGAACCGAC GCCGACCGCC
GGCGGTGGCG GAGCGGTGCC GGCCGACGGC GCGGGCGAGG CGGCCGGGTC CGCGGCGACC
GGGGGTGCGG CGAGCGCTAC GTCCGCGCTC TCCGGGATCG GCTCGGAGTC CGACGGGGAG
CCGTGGCTCG TCGGCGAGGA CCTCATCAGC GGCTACGGGA ACCACCGGGT CGTCGACGGA
ATCTCGATGG AGAGTCGGGA CGGCGTGACC TGCATCTTCG GGCCGAACGG GTCGGGGAAG
TCGACGCTTT TGAAGACGCT CGCCGGGGTC GTCCCGGCGT GGGAGGGCCG AGTCACCCAC
CGCGGCACCG ACGTGACCCA CAACAGGCCG GCCGAGAACG TCCACCGCGG CGTCACGATG
CTCCCGCAGG ACGGCGGGAT TTTCGGCGGC CTCACCGTCC GTGAGAACCT CCTCCTCGGC
GGGTACACCG TCGGTGAGGG GGCGGTCCGC GAGGAGCGAC TCGACGAGGT GTTGTCGTCG
TTCCCGGAGC TCAAGGACAA ACTGGACGAT CGGGGGCGGT CGCTGTCGGG CGGCCAACAG
ATGATGCTCA GCTACGGTCG CGCGATGATG ACCGGCGCGG AGGTGTACCT CCTCGACGAG
CCGTCGAGCG GGCTGGCCCC CTCCCTCATC GATCAGGTGT TCGAGATGAC GCGACGGCTC
GTCGCCAGCG GCGCGCAGGT GATCCTCATC GAGCAGAACG TCCGCGAGGC GCTTCGGATC
GCCGATTACG TCTACATCCT GGCGCAGGGC CAGCTCCAGT TCGAGGGGAC CCCGGCAGAC
TTGACCGACG AGGACGACCT CGTCGAACTG TACCTCGGAC TCGACTAA
 
Protein sequence
MTRPILETEG LTKRFGSLVA NDRLSVTVEE DTIHGIMGPN GSGKSTFFNT VTGFYRPDGG 
TVRFDGEDVT GWKPDEIARR GLARTFQIPS PFEDLTVKEN MLAVFTGGLR SGMRISEAKR
ARADELLELL EIDHVADQEA GGVSGGQEKL LELGRILMLE PACVMLDEPT AGVNPSLRNR
LLEHLETLND RGTTFVIIEH DMRVIADVCD RVTVFNQGQV LVEGDFESVT SDERVRDAYL
GGAAEHDASL ETLIGEEADA PVSEERGTPA DAATEPTPTA GGGGAVPADG AGEAAGSAAT
GGAASATSAL SGIGSESDGE PWLVGEDLIS GYGNHRVVDG ISMESRDGVT CIFGPNGSGK
STLLKTLAGV VPAWEGRVTH RGTDVTHNRP AENVHRGVTM LPQDGGIFGG LTVRENLLLG
GYTVGEGAVR EERLDEVLSS FPELKDKLDD RGRSLSGGQQ MMLSYGRAMM TGAEVYLLDE
PSSGLAPSLI DQVFEMTRRL VASGAQVILI EQNVREALRI ADYVYILAQG QLQFEGTPAD
LTDEDDLVEL YLGLD