Gene Hlac_1055 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_1055 
Symbol 
ID7400127 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp1050516 
End bp1052213 
Gene Length1698 bp 
Protein Length565 aa 
Translation table11 
GC content63% 
IMG OID643708123 
ProductBCCT transporter 
Protein accessionYP_002565722 
Protein GI222479485 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1292] Choline-glycine betaine transporter 
TIGRFAM ID[TIGR00842] choline/carnitine/betaine transport 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGACC CCGGATCGAG CATGATCGAC GAGTTCCGCG AAGAGATCGA TCCGATCGTG 
TTCGCGTTCG GCGCCCTGTT GACGGTGGGT GTGATCGCCG CGTTCTTCAT CAGCCCGTCG
GCCGTCGAAA ACGGTATTTC GTCGCTGAAC AACCAACTGC TCGGCGCGTT CAACTGGGCG
ATGATGTTGA TCGTGTTCCT GATCGTGCTG TTCCTGCTCT TTCTGATCGT GGGTCCGTGG
GGCGGAATCA AGTTCGGTGA CGAGCCCCCC GAGTACAGTT TTCTCTCCTT TTTCGCGATG
CTGTACTCCG CGGGGTTCGC GGCCGGCGTC GTGTTCTGGG GGCCGACCGA GGCGCTGTTC
TACTACAACA ACCCCTCACC GCTCTTCAGT AACATCGAGG GGGGAACGGC CGAGGCGATG
ACGATTGCCG TCCAGCAGAC GCTGTTCCAC TGGGCGCTGC CACAGCTCGC GGTGTTCACC
ATTATGGGCA TCGCGATCGG CTACTTCGCG TACAACTACG ACAACGTCCC GCTGCGCGTC
TCTTCGGCGC TCACGCCGAT CCTCGGCGCC GACAACCTCG ACGGTCCGGC CGCGAAGGTC
GTCGACATCC TCGCCGTCTT CGCGACGATC GGCGGCGTCG CCACGTCGCT CGGCTTTATC
GGGAGCCAGT TCGTCACCGG CCTCGACTAC CAGTGGGGGA TCAACATGGG CGATCTGGGC
ATTCTGCTCG TCGTCACCAT GATGACCCTG CTGTTTACGA CCTCGATGGT ACTCGGGGTC
GACAAGGGGA TCCGTCGGCT CTCGAACTTC AACATGATCC TGTTCGTCGT GCTCATGTTC
GCGACGTTCA TCGTGGGCCC GACGCTGTTC CTCGTGCTGC TCGGCACGCA GGCGTTCGGC
GGGATGGTCG CCGATTTCGT CTCGATGAGC CTCTTTACCG GTGCCGGGCC GATGGGTGCG
GGCGACGGGA GTGCGACCGG CTGGATAAAC GCGTGGACGG TGTTCTACTG GGCGTGGGCG
CTCTCGTGGT CCCCGTTCGC GGGGCTGTTT ATCGCCCGGA TATCCAAGGG ACGGACCGTC
CGCGAGGTCG CGTTCACGGG GATCGTCGCG ACCTCCGGCG CCACAATCCC GTGGTTCACG
TTCGTCGGCG GCACCGCGGT CTGGGCACAG CACAACGGTA TCGCCGACTT CAGCGCGGTG
ATCGCCGGCA ACGCGGGCCC GGAGGTGTCC GGATTCATCC TCTTCGAGGC ACTGAACTTC
ACGCTGAATC TCGGGGGGGC ATCGATGACG ATCCCGATCG GGTCCGTACT GATCTACGCG
TTCATGATCC TCGTGACGAC GTTCTTCGTC ACGTCGGCGG ACTCCTCGAC GCTGGCCGTC
TCGATGATGA CCACCGGCGG AAAGGCCAAA CCGTCGAACA TCAACCGCAT CTTCTGGGGC
GTCGTCCTCG GGTTGACCGC AGCGATCCTG ATGATCCTCG GCGGTAGCGG CAGCGCGAAC
ACGCTCCAAC AGGCGGCGAT CATCACCGGG ACGCCGTTCG CCTTCGTCTG CTTCCTCGCG
ATGCTGTCGC TGATCAAGGA CTTCGGGGAG AACTACGATC AGGTGCTCTT CCAGAACGAA
ACCGTGCTGT GGGGATCGGG GAAGGACGCA GACTCCCCGT CCAGTTCGGT TGAATCTGCC
GGGTCAGACG ACGACTAA
 
Protein sequence
MSDPGSSMID EFREEIDPIV FAFGALLTVG VIAAFFISPS AVENGISSLN NQLLGAFNWA 
MMLIVFLIVL FLLFLIVGPW GGIKFGDEPP EYSFLSFFAM LYSAGFAAGV VFWGPTEALF
YYNNPSPLFS NIEGGTAEAM TIAVQQTLFH WALPQLAVFT IMGIAIGYFA YNYDNVPLRV
SSALTPILGA DNLDGPAAKV VDILAVFATI GGVATSLGFI GSQFVTGLDY QWGINMGDLG
ILLVVTMMTL LFTTSMVLGV DKGIRRLSNF NMILFVVLMF ATFIVGPTLF LVLLGTQAFG
GMVADFVSMS LFTGAGPMGA GDGSATGWIN AWTVFYWAWA LSWSPFAGLF IARISKGRTV
REVAFTGIVA TSGATIPWFT FVGGTAVWAQ HNGIADFSAV IAGNAGPEVS GFILFEALNF
TLNLGGASMT IPIGSVLIYA FMILVTTFFV TSADSSTLAV SMMTTGGKAK PSNINRIFWG
VVLGLTAAIL MILGGSGSAN TLQQAAIITG TPFAFVCFLA MLSLIKDFGE NYDQVLFQNE
TVLWGSGKDA DSPSSSVESA GSDDD