Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hlac_2091 |
Symbol | |
ID | 7400611 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorubrum lacusprofundi ATCC 49239 |
Kingdom | Archaea |
Replicon accession | NC_012029 |
Strand | + |
Start bp | 2081075 |
End bp | 2082172 |
Gene Length | 1098 bp |
Protein Length | 365 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643709162 |
Product | branched-chain/neutral amino acids amide ABC transporter periplasmic substrate-binding protein 1 |
Protein accession | YP_002566739 |
Protein GI | 222480502 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.649007 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.0283766 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGCGAA GAATACAGCG ACGCGACGTA CTCAGAGGCG CCGGCGCGGC CGGCATTGCG GCAATCGCCG GCTGCTCGAC CGAGAGCGGT GACGGCGGCG ACGGAAGCGA CGGTTCCGAC GGTTCCGACG GCTCCGACGG AAACGACAGC GGCGACGGAA GCGACGGAAG CGACGGAAGC GATGGCAGCA GCGCGCCCGA CGCCGTGATG GTCGTCGGCT TCCCGGAATC CGGCATCCAG CTGTTCCGCG ACTTCTACTC CGGCTTCGCC AGCGACGTGC CCGAACTCGA CATCATCGTG CCCGACGGTC TCATCGACTC CGACCTCCCC GGTGAAGTCG ACAACGACAT GAACAACGTC ATCGGGACTG CCCCGTCCGC GGGCGGTCCG GGAGCGGAGT TCTTCACCAG TGCGTACGAG GAGGAGTACG ACACCGCGCC TGGCGTGTTC ACCGCGCAGG CGTACGACGC GATGGCGGTC GAGATCCTCG CGGCGACCGC CGCGGGCGAG AACAACGGGG AAGCCATCCG CGACCGGGTC CGGACCGTCG CGAACCCCGG CGGCGAGGAG TTCGGTCCGG AAAACCTCCC CGAAGCGGTC GAGACTGTCG CCGCCGGCGA CCCCGTCCAC TACGTCGGAG CGTCGTCGAG CGTCAACTTC GACGCCAACG GCGACATCGC CACCGCGGCG TACGACGTGC AGGACTTCCA GGACGGAGAG ATCGTCACGC TCGACACGCT GGAGTTCGGC AACGAACTCT CGGAGGAGGA CCTGAACGCG ACGGCGGCGG ACCCCGTCGG AATCGACGGC GAGTTCGAGG CGCAGATCGG CGTGCTGATG CCCGAAACGG GTGACCTCGG CTCGCTCGGC GGTCCGATCC GCGACGGCGC GCTGCTCGCC GCGATACAGG TCAACGACGC CGACCTGAAC GTCACGGTGA ACACCCGCGT GGAGGACACT CAGACCGATC CGCAGGCGGG TATCTCGGGC GCGAATGCGC TCGTCGACGC CGGCTACGGC GCCGTCGTCG GACCGGCCGC CTCGAACGTG AACCTCCAGG TCGCGGACCA AGTGTCAGTA CTTCACAAAG CCGCTTAG
|
Protein sequence | MTRRIQRRDV LRGAGAAGIA AIAGCSTESG DGGDGSDGSD GSDGSDGNDS GDGSDGSDGS DGSSAPDAVM VVGFPESGIQ LFRDFYSGFA SDVPELDIIV PDGLIDSDLP GEVDNDMNNV IGTAPSAGGP GAEFFTSAYE EEYDTAPGVF TAQAYDAMAV EILAATAAGE NNGEAIRDRV RTVANPGGEE FGPENLPEAV ETVAAGDPVH YVGASSSVNF DANGDIATAA YDVQDFQDGE IVTLDTLEFG NELSEEDLNA TAADPVGIDG EFEAQIGVLM PETGDLGSLG GPIRDGALLA AIQVNDADLN VTVNTRVEDT QTDPQAGISG ANALVDAGYG AVVGPAASNV NLQVADQVSV LHKAA
|
| |