Gene Htur_2084 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_2084 
Symbol 
ID8742684 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013743 
Strand
Start bp2150406 
End bp2152718 
Gene Length2313 bp 
Protein Length770 aa 
Translation table11 
GC content68% 
IMG OID646512666 
Productamino acid permease-associated region 
Protein accessionYP_003403640 
Protein GI284165361 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0531] Amino acid transporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGGGA GCGACGAGGA ACTCGCCAAG GACCTCGGAC CGCTCGCCGC GCTGACGATC 
GGCGTCGGGA CGATGATCGG CGCGGGGATC TTCGTTCTCC CCGGGGAGGC CGTCGCGGAT
CTCGGCCCGC TGGCGTCGCT GGCGTTCGTC ATCGGCGGCG TGATCGCCCT GTTCACGGCG
CTCTCGGCGT CCGAACTCGG CACGGCGATG CCCGTCTCCG GCGGCGCCTA CTACTACGTC
AATCAGGGAC TCGGGCCGCT GTTCGGCTCG ATCGCCGGCT GGGGGAACTG GATGGGGCTG
GCGTTCGCCT CCGCGTTCTA CATGTACGGG TTCGGCGAGT ACGTCAACCA GTTCGTGACC
GTCCCGGCGT TGACGCTCGG TCCCGTCGGC CTCGAGTCGG CCCAGTTGAT CGGGCTGGTC
GGCGCCGCCT TCTTCATCAC CGTCAACTAC GTCGGCGCGA AGGAGACCGG CCGGTTACAG
AACATCATCG TCGTCACGTT GGTCGGTATT CTGGGCGTCT TCACGCTGTT CGGTCTCATG
AACGCCGATC TGGAGACGCT CCGCCCCGTC GATCCCTTCG GCTGGGCGCC GCTCCTCCCG
GTGACGGGGC TCGTCTTCGT CTCCTACCTC GGGTTCGTCC AGATCACGTC CGTCGGCGAG
GAGATTCAGA ACCCCGGCCG GAACTTACCG CGGGCGGTCA TCGGGAGCGT CGTCATCGTG
ACCGTGATGT ACGCGCTGAT CCTCCTGACG GTGCTCGCCG CCGTCGAGAC CGAGGTCGTC
GCGAACAACG AGACCGCCGT CGTCGACGTC GCGCAGATGC TGATGGGGCC GATCGGCGCC
GCCGCGCTGC TGTTCGGCGG GCTGTTGGCG ACCGCCTCGT CGGCGAACGC CTCGATCCTC
GCGTCCTCGC GGATCAACTT CGCGATGGGT CGGGACAAGC TCGTCTCGCC GAAGATCAAC
GAGATCCACC CGCGGTTCGC GACGCCGTAT CGCGCGATCG CGATCACGGG CGCGCTCATC
CTGCTGTTCA TCGCGCTGGG TAACCTCGAG ATGCTGGCGT CGGCCGGCAG CGTCCTCCAC
CTCATCGTCT ACGGGCTCCT CAATATCGCG CTGATCGTCT TCCGCGAGGC GGAACCCGCC
GGCTACGACC CCGATTTCGA AGTCCCATTG TATCCCATCA CGCCGATCCT GGGAGCGGTG
CTCTCGCTCG CGCTGATCGG ATTCATGGAA CCGACCGTCA TCCTCCTCTC GATGGCGTTC
GTCGTGTTCG GTCTCGTCTG GTACCTCGGC TACGCCCGGT CGGAGATCGA ATCGCAGGGC
GTGCTCGCCG ACTACGTCCT CGAGCGATCC GACGAACTCC CCGACGCGGC GGTGTCGGCG
ACGACGGCGG TCAAACCGGA GGGCGGCGAC TACCGCGTTA TGGTCCCGCT CGCGAACCCC
GCCCACGAGA AACACCTCAT CACCCTCGCC TCCGCCATTG CGGACCAGAA CGACGGGACG
GTCGTCGCCG TCAACGTCGA GAAGGTCCCC GACCAGACGT CGCTGACGGC CGCACGCGAC
CAGCGGGACC ACGAGGCGGC CGAACACCTG GTCGAGCAGG CGCGAGCCGA CGCCGAGACG
TACGGCGTCG ACGTGGAAAC CCACGTGGTT CTCTCGCACC GCGGCGTCGA GGAAGTGTTC
GACGCCGCCA CGCGCTACGA TGCCGACGTC TGCGTGATGG GCTGGGGTCC GGACTCGCTG
GGATCGTCGG GCCGCGTGGA GAGCAGGACC GACGAACTCG CCCACTCGCT GCCGTGTGAC
TTCCTCGTGT TTCGGGACCG CGGGTTCGAC CCGTCCCGGA TCCTCCTCCC GACCGCGGGC
GGTCCGGACT CCGACCTCGC GGCGGCCGTC GCGCGCTGTC TGCGCGATCA GTACGACGCC
GAGGTGACCT TACTCCACGT CGCGGACGAT CCCCAGCAGG GACGCGCGTT CCTCGAGTCG
TGGGCCGACG AACGCGACCT GTCGGACGCG ACGCTCACCG TCGAGACGGG CGACGTCCAG
CGCAGCATCG GCGACGCGGC AGCGGACGCG ACGCTGCTGG TCATCGGCGC GACGGAGAAG
GGCCTGCTCT CGCGGGTCGT TCGCGGCTCG CTCGTGCTCG ACGTCCTCGA GGACGTCGAC
TGTTCGGTAC TGCTCGCCGA GAAGCGCCAC AAACGCTCGG TTCGCGAGCG GATATTCGGA
TCCGGAAGCG GCAATCGAAG CGACGCGGAG ACGGGCGTGA CGACGGAGCC GTCGACGCCG
GATCCCGAGT CCGAGACGAC GAGAACGGAC TGA
 
Protein sequence
MSGSDEELAK DLGPLAALTI GVGTMIGAGI FVLPGEAVAD LGPLASLAFV IGGVIALFTA 
LSASELGTAM PVSGGAYYYV NQGLGPLFGS IAGWGNWMGL AFASAFYMYG FGEYVNQFVT
VPALTLGPVG LESAQLIGLV GAAFFITVNY VGAKETGRLQ NIIVVTLVGI LGVFTLFGLM
NADLETLRPV DPFGWAPLLP VTGLVFVSYL GFVQITSVGE EIQNPGRNLP RAVIGSVVIV
TVMYALILLT VLAAVETEVV ANNETAVVDV AQMLMGPIGA AALLFGGLLA TASSANASIL
ASSRINFAMG RDKLVSPKIN EIHPRFATPY RAIAITGALI LLFIALGNLE MLASAGSVLH
LIVYGLLNIA LIVFREAEPA GYDPDFEVPL YPITPILGAV LSLALIGFME PTVILLSMAF
VVFGLVWYLG YARSEIESQG VLADYVLERS DELPDAAVSA TTAVKPEGGD YRVMVPLANP
AHEKHLITLA SAIADQNDGT VVAVNVEKVP DQTSLTAARD QRDHEAAEHL VEQARADAET
YGVDVETHVV LSHRGVEEVF DAATRYDADV CVMGWGPDSL GSSGRVESRT DELAHSLPCD
FLVFRDRGFD PSRILLPTAG GPDSDLAAAV ARCLRDQYDA EVTLLHVADD PQQGRAFLES
WADERDLSDA TLTVETGDVQ RSIGDAAADA TLLVIGATEK GLLSRVVRGS LVLDVLEDVD
CSVLLAEKRH KRSVRERIFG SGSGNRSDAE TGVTTEPSTP DPESETTRTD