Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hlac_0243 |
Symbol | |
ID | 7401169 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorubrum lacusprofundi ATCC 49239 |
Kingdom | Archaea |
Replicon accession | NC_012029 |
Strand | + |
Start bp | 260893 |
End bp | 263604 |
Gene Length | 2712 bp |
Protein Length | 903 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 643707306 |
Product | oligopeptide/dipeptide ABC transporter, ATPase subunit |
Protein accession | YP_002564918 |
Protein GI | 222478681 |
COG category | [E] Amino acid transport and metabolism [P] Inorganic ion transport and metabolism |
COG ID | [COG0444] ABC-type dipeptide/oligopeptide/nickel transport system, ATPase component |
TIGRFAM ID | [TIGR01727] oligopeptide/dipeptide ABC transporter, ATP-binding protein, C-terminal domain [TIGR02323] phosphonate C-P lyase system protein PhnK |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.351825 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTCCTC TACTTTCGCT CTCTGACGTC CGTACCCAGT TCTCGACGGA ACGGGGACAG GTGAAGGCGG TTGACGGAAT CGATCTGGAG ATCCGCGAAG GCGAGACCGT TGGACTCGTC GGCGAATCGG GCTCCGGGAA AAGCGTCACG GCGCTGTCGA CGATGGGGCT CGTCGACGAC CCCGGCGAGA TCGTCGGCGG AACCGTCGAG CTGACGGACG CCCGAGTTGC GGACGAGCTC CGCGACCGGT ACGACGCCGC TGAATTCGTC GACGGCGACA CGATCGATCT CACGGCCGCC CCCGAGGAGG CGCTTCGGTT CGTCCGCGGC CGGGAGATAA GCATGATCTT CCAAGACCCG ATGACGTCGC TGAACCCTTC AGTGACGGTC GGCGAGCAGG TCGCAGAGAG TCTCAAACTC CATCAGTACG GCGGGCGTCG CAAGGACTCG TGGTTCAACG CGGTGCGCGA GATCCTCCCG AAGATCAGCC GCGACGTGGA CGACGAAGTC CGCGAAGAGA CCATTGAGGT GCTTGAAGAG GTCGGCATCC CGGAACCGGG CGCGCGGATC GACGAGTACC CGCACGAGTT CTCCGGCGGG ATGCGCCAGC GGGTGCTCAT CGCGATCGCG TTAGCCTGTC AGCCGGGGCT GCTCGTGGCC GACGAGCCGA CGACCGCGCT GGACGTGACC ATTCAGGCGC AGATCCTCGA CCTGATAGAC GACCTGCAGG CCGACCTGGG GATGTCGGTG CTGATGATCA CTCACGACCT CGGCGTCGTG GCCGAGACGT GCGACCGCGT CGCGGTGATG TACGCCGGCG AGATCGTCGA AGAGGGCCCC GTCGAGGAAA TCTTCGGGAA CCCGTCGCAC CCGTACACGT ACACGCTCCT CGAGTCCCTC CCGAGCGAGG AGAAAGAGCG CCTCACGCCG ATCGAGGGGA ACGTCCCCGA CCTCATCGAT ATGCCGTCGG GGTGTCACTT CGCGCCGCGG TGCCCGTGGG CGACCGAGGA GTGTACGAGC GGCGAGATCC CGTACCTCCA GCACGGCGCC GAGGATGTCG ACCATCGGTC GAAGTGCATC ATGGAGTCGT TCGACAAGAA CGAGTACGGC GACGACGCTG TCGGCCCCGG GCGCGACCGG TCGATCGGCG AGCCCCTCGT CGAGATCGAC GGACTCCGGA AGTACTACGA ACAGACGGAC AGCGTGCTCG ACCGGGTTCT CGGTGCCGAC GACCGCAGCG TGAAGGCGGT CGACGGGATC GACTTCACGA TCAACCGCGG CGAAACGCTG GGACTCGTCG GCGAATCCGG TTGCGGGAAG TCGACGGCCG GGCGAGCGCT GTTACACCTC ACCAAACCGA CCGACGGCCG GGTGGTGTTC GCCGGGACCG ACCTCACGGA ACTCGACGGA TCGGCGCTGC GAGAGCAGCG GAAGAACCTC CAGATGATCT TCCAAGACCC GCTCTCCTCG CTCGATCCGC GACAGACGGT CGGCCAGACG ATCCGCGAGC CGCTGTCGAT CCACGACTTG CCCGAGAGCG ACCCGAGCGT GGCGACCGAG GCCGAGGTGA CCGTCTCCGG GATCGCCCGC GACAGGGTCG GCGTGACGGT CGACGACGAG ATCGACGCCG TCGTCGGCTC CGGGAGCGGC GTGGCGACGG CCCACGTCGA CGTAACGGTC GCGGACGGCG AGGTTGACGT CGACGTGCGC GAACACCTCG GGGTCGAGGG GACGGTCGAG CGCACTGACG CCGGCGACGT GGAACGCGTC TCGGTGACGG TCTCTGCCGG CGACACCGAC CGGCTCAGGC GGCGGCGTCG CGTCCGACAA CTGCTGGAGG CAGTCGGTCT CGAGGTGGGC CAGTACGACC GGTACCCCCA CGAGCTGTCC GGCGGCCAGC GCCAGCGCGT CGGCATCGCT CGGGCACTCG CGGTCGATCC GGAGTTCATC GTCGCCGACG AGCCGGTGTC GGCGCTCGAC GTGAGCGTGC AGGCGCAGAT CCTCAACCTG ATGGAGGATC TTCAGGACCG GTTCGACCTG ACGTACCTGT TCATCGCGCA CGATCTCTCG GTGGTGCGCC ACATCAGCGA CCGCGTCGCC GTGATGTACC TCGGCGAGAT CGTCGAGGTG GCGGCGACCG ACGAGCTGTT CGCGGACCCG CAACACCCGT ACACGAAGGC GTTGCTCTCC GCGATCCCCG CGCCGGACCC GACGGTGGAC ACCGACGATC GCGTCATCCT CGAAGGCGAC GTGCCGTCGC CGATCGACCC GCCATCGGGG TGTCACTTCC GGACCCGGTG TCCGTCGGTG ATCCCCCCCG CCGATCTCGA CATCAAACAG GAGACGTACC GTGAGGTGAT GAACTACCGA CAGCGGGTCG ATCGACAGGC GATCGACGTC GAGACGATCC TCGAGGCGGC CGACGAATCG CCCGGACAGG TAGCGGCCGA CGGCGGCACC GCGTCCGCTG CCCCGTCCGG GGGTGACGCG ATTCCGCCCG GTGCGGTCGA GGCGGTCCGC GACGTCCAGT TCGATCAGTA CCCGGACGGA CGCGCCGGCG AGGTCGTCGA TCGCTCCCTC AGACTGGTGA TTGCCGGCGA GTGGGAGGAG GCGGCCGATA TCCTCGAGGA GACGTTCGCC AGCGTCTGCG AGCGCGAGGA GCCCACGCTC CCCGACGGCG ACCATCCGGC TGCGTGCCAC CTGATCGACT GA
|
Protein sequence | MPPLLSLSDV RTQFSTERGQ VKAVDGIDLE IREGETVGLV GESGSGKSVT ALSTMGLVDD PGEIVGGTVE LTDARVADEL RDRYDAAEFV DGDTIDLTAA PEEALRFVRG REISMIFQDP MTSLNPSVTV GEQVAESLKL HQYGGRRKDS WFNAVREILP KISRDVDDEV REETIEVLEE VGIPEPGARI DEYPHEFSGG MRQRVLIAIA LACQPGLLVA DEPTTALDVT IQAQILDLID DLQADLGMSV LMITHDLGVV AETCDRVAVM YAGEIVEEGP VEEIFGNPSH PYTYTLLESL PSEEKERLTP IEGNVPDLID MPSGCHFAPR CPWATEECTS GEIPYLQHGA EDVDHRSKCI MESFDKNEYG DDAVGPGRDR SIGEPLVEID GLRKYYEQTD SVLDRVLGAD DRSVKAVDGI DFTINRGETL GLVGESGCGK STAGRALLHL TKPTDGRVVF AGTDLTELDG SALREQRKNL QMIFQDPLSS LDPRQTVGQT IREPLSIHDL PESDPSVATE AEVTVSGIAR DRVGVTVDDE IDAVVGSGSG VATAHVDVTV ADGEVDVDVR EHLGVEGTVE RTDAGDVERV SVTVSAGDTD RLRRRRRVRQ LLEAVGLEVG QYDRYPHELS GGQRQRVGIA RALAVDPEFI VADEPVSALD VSVQAQILNL MEDLQDRFDL TYLFIAHDLS VVRHISDRVA VMYLGEIVEV AATDELFADP QHPYTKALLS AIPAPDPTVD TDDRVILEGD VPSPIDPPSG CHFRTRCPSV IPPADLDIKQ ETYREVMNYR QRVDRQAIDV ETILEAADES PGQVAADGGT ASAAPSGGDA IPPGAVEAVR DVQFDQYPDG RAGEVVDRSL RLVIAGEWEE AADILEETFA SVCEREEPTL PDGDHPAACH LID
|
| |