Gene Htur_4091 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_4091 
Symbol 
ID8744719 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013744 
Strand
Start bp349297 
End bp351540 
Gene Length2244 bp 
Protein Length747 aa 
Translation table11 
GC content61% 
IMG OID646514651 
Productoligopeptide/dipeptide ABC transporter, ATPase subunit 
Protein accessionYP_003405598 
Protein GI284167320 
COG category[E] Amino acid transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG0444] ABC-type dipeptide/oligopeptide/nickel transport system, ATPase component 
TIGRFAM ID[TIGR01727] oligopeptide/dipeptide ABC transporter, ATP-binding protein, C-terminal domain
[TIGR02323] phosphonate C-P lyase system protein PhnK 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0481022 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGGAG AGTCGGCTGA CCAAGACGTG ATTTTGGAGG TTCGAAACGC CTCGGTCGAG 
TTCGATATGG GCCGCGGGAC GTCGCGGGTC CTCGACGACG TCTCGATGGA TATCCGCCAG
AGCGAAATCC TCGGCGTCGT CGGCGAGTCG GGCTCCGGGA AGTCGATGTT CGCTTCGGCG
CTCCTCGACG CGGTCGTCGA TCCGGGTCAA CTCACCGGGG AAGTGATCTA TCACCCCGAG
GACGGCGATC CGGTCGACAT TGCGAATGCC GATCGTGATC TGCTGAAACG ATACCGTTGG
AAGGAAATTT CGATGGTGTT TCAGGGCGCG ATGAGTTCGT TCAATCCGAC CCAGCGGATC
AGGGATCACT TCACGGAGAC GCTGAAGGCA CACGACCATA CCCTCAAGTC AGGGATGGAC
CACGCGCGTA AGCTGTTGCG AGAACTCCAC CTCGATCCCG ATCGGGTGCT CGATGCGTAC
CCCCACGAAC TGAGTGGCGG AATGCAGCAG CGAGCACTCA TCGCGCTCTC GCTAGTGTTG
AAACCGCGAG TGCTGGTGAT GGATGAGCCG ACCGCGGCGC TCGACCTGTT GATGCAGCGA
TCGATCCTCG GCCTTCTGGA GAACCTCCAG GAGCGGTACG ATCTCACCGT CGTGTTCATC
ACTCACGACC TCCCACTGGT GGCTGGCCTA GCGGACCGGA TCGGCGTCCT GTACGCCTTC
CAGATGGTGG AGGTCGGCCC GACCGACGAA ATCCTCCGCG ATCCCGCCCA CCCATACACG
AGGGACCTGC TCAACGCCGT CCCGAACCTC GAGACACCAC TAGACTCGAT GACACCGATC
GAGGGACAGG CACCCGATCC GGTCAACACG GCGACCGGCT GCCGGTACGT GTCTCGGTGT
CCGCTGGCAA CCGGCGAGTG TCGATCCGAA GACCCGCCGT TCTTTGATGT CAGTGCTGAC
CACGAGACGG CCTGTCACCA CTGGGAGAGA GCGCGCGAGG AGATCCCGTT CGATCAGACC
GAAAGCCCCA AACAGGTCGA AACAGACGTC GTGACAGGTC GCCAGTCAGA TGATCCGGTA
CTCTCGCTCA ACGACGTGGA CGTCCACTTC GAACAGAGCT CGGGGCTTTG GTCGAAACTC
ACTGGCGACA GTGAGACTGT CTACGCTGTC AATGACGTTA CCCTCGATAT CTACGAGAAC
GACGTGGTCG CGCTCGTCGG TGAATCTGGC TGTGGCAAGA CGACGCTTGG AAAGACCGCT
ATCGGTGTCC AACGACCGAC CAGTGGGAGG GTTTCACATC GGGGAGTCGA CGTCTGGGAG
GCCCGCGACG GCGGCGACGA CGCCTACGAC GAGATCCGCT CGTCGCTACA GATTATCCAC
CAGGACCCGG GAAGTTCGCT AAATCCCAAC AGGAGCGTTC AGGAGATCCT CGAGACACCG
CTCAAGCAGG CTCAAAACGA CCTCAGTTTC AAGGATCGAC GGGAACGGAT CATCGCGATG
CTCGAATACG TCGGCCTCTC TCCGGCACGT GACTACGCCG AACGTTACCC ACACCAGCTA
TCCGGTGGCG AGAAACAGCG CGTTGCGCTC GTGCGCGCAC TGTTCATGAA TCCGGACCTT
ATCCTGGCTG ACGAAGCGGT GAGCGCACTG GACGTATCGC TCCGCGTCGA AATGATGGAT
CTGATGCTCG AACTGCAACA GCGGTTCGAT ACGTCGTACC TGTTCATCTC GCACAACTTC
GAGAACGCGC GGTATCTCGC TGGAAGGGTC GACGGCCGGA TCGGCGTGAT GTACCTCGGA
AACATCGTCG AAATCGGTCC CGCCGAGGAA ATCATCCAGA ATCCGCGTCA TCCCTACACG
AAGATTCTTC GCTGGTCGAC GGCGGACATC GACCCGGACG ATAGCAGTGG GGAACTACCG
GTGCGGAACA TCGATATCCC GGATCCCGTC GATCCCCCGA GCGGCTGTTC GTTCCAGACG
CGGTGTCCGA AAGCCCGCGA ACACTGTACC GAGGAGTGTC CGTCACTTGA TCCGGTGAGC
GGGGGTTCGG ACGATCACGC GATCGCCTGC TTCCGCGAGT GCGATAGTGA CCATCCCTAC
TGGGACAGCG AACCGCTCGA CAGTGTCGAT GGCGATGAGT CCATCTTCGG ACCCGACTGT
CCGCCCGAAG AACGCGCGGC ACCGGACGGC GGCGAGCGGA CCTCGGACGA TCACTCCGAG
GACGTACGGA GAGAGGATCG ATGA
 
Protein sequence
MNGESADQDV ILEVRNASVE FDMGRGTSRV LDDVSMDIRQ SEILGVVGES GSGKSMFASA 
LLDAVVDPGQ LTGEVIYHPE DGDPVDIANA DRDLLKRYRW KEISMVFQGA MSSFNPTQRI
RDHFTETLKA HDHTLKSGMD HARKLLRELH LDPDRVLDAY PHELSGGMQQ RALIALSLVL
KPRVLVMDEP TAALDLLMQR SILGLLENLQ ERYDLTVVFI THDLPLVAGL ADRIGVLYAF
QMVEVGPTDE ILRDPAHPYT RDLLNAVPNL ETPLDSMTPI EGQAPDPVNT ATGCRYVSRC
PLATGECRSE DPPFFDVSAD HETACHHWER AREEIPFDQT ESPKQVETDV VTGRQSDDPV
LSLNDVDVHF EQSSGLWSKL TGDSETVYAV NDVTLDIYEN DVVALVGESG CGKTTLGKTA
IGVQRPTSGR VSHRGVDVWE ARDGGDDAYD EIRSSLQIIH QDPGSSLNPN RSVQEILETP
LKQAQNDLSF KDRRERIIAM LEYVGLSPAR DYAERYPHQL SGGEKQRVAL VRALFMNPDL
ILADEAVSAL DVSLRVEMMD LMLELQQRFD TSYLFISHNF ENARYLAGRV DGRIGVMYLG
NIVEIGPAEE IIQNPRHPYT KILRWSTADI DPDDSSGELP VRNIDIPDPV DPPSGCSFQT
RCPKAREHCT EECPSLDPVS GGSDDHAIAC FRECDSDHPY WDSEPLDSVD GDESIFGPDC
PPEERAAPDG GERTSDDHSE DVRREDR