Gene Htur_3333 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_3333 
Symbol 
ID8743953 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013743 
Strand
Start bp3434885 
End bp3436153 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content66% 
IMG OID646513916 
ProductExtracellular ligand-binding receptor 
Protein accessionYP_003404870 
Protein GI284166591 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCGTTC AATTGAATCG TCGAAAACTA CTGTCCGGTC TCTGTACCGC CGGACTCGGC 
GGCCTCGCCG GCTGTCTCAG TTCGGTTCCC GGGCTCGACT CGACCGACGT CGAGGACGGG
AGCGACGTCG GAGGCACCGA TCGAACGCTC AGACTGGGAA TCATGCAACC GCTCAGCGGC
GACCTCGAGA ACGTCGGACT CCCGATTCGG GACGCCGCAA CACTTCCTAT CAAGCAGATC
GAAGACGAGA TCTCGCTCGA TATCGAGTAC GAAGTCGTCG ACACCGAGAC ATCGCCGTCG
GCGGGCGTCC AGGGTGCGGC CGCCCTGGTC GACGAAGGAT ACCCGATGGT CAACGGCCCG
GCTGCCTCGG ACGTGACGCT GCAGGCGACA CAGCAGGTCC TCATTCCGTA CCGGACCGTC
TGCTGTTCGC CCGGTGCGAC CACGCCGACG ATCACCTCGC TCAACGACGC CGGGCTGGTC
TTCCGGACCG CGGTCTCCGA CTCGCTGCAG GCGGTCGTGC TCGCCGACCG GGCGGCGAAC
GATCTCGGCC ACGACAGCGC CGCGACCCTC TACGCGAACA ACGACTACGG CTGGCAGTTG
AGCCAGGCGT TCGCGCGCTC GTTCCGGAGC GATCACGGCG GGACGGTCTC GGCGCAGATC
CCGCTAACGG AAGGACAGGA CACCTACGAA GCGGCTCTCG AGCGGGCCAG GGAGGACGAC
CCCGAACTGC TCGTCGTGGT CGGCTACCCG GAAACGGGAG GACAGGTTCT CAGAGATCTC
GGAGCCGACG CCCCGGAAGA CGTCCTCGTC ACCGACGGCC TGCGGGACGG AGACCTCCAC
GACGAGATCG ACTACTCGCT CGACGGCATC CGCGGCACCG CGCCGCTGGT CGACGGGCCG
GGGACCGAAA CGTTCACCGA GCTGTTCGAG GACGCCTACG ACGCCGAGCC GAGCGTCTTC
ACGCCCCACT CGTACGACGC GAGCGCCGTC CTGTTGCTCG CGAACGCCTA CGCCGGGCAA
AACGACGGCA CCGCTATCAG AAACGCCATG CAGGCGGTCA CCACAGGCGA CGGCGAGGAG
ATCACACCGG AGACCCTCGC CGAGGGGGTC GACCTCGCGG CCCGGGGTGA CTCCGTCACC
TACCAGGGGG CCTCGAGTTC GGTTGACTTC GACGAGAACG GTGACATCGT CGGCGCGGTC
TTCGAGTACT GGGAGTTCGA CGAAAGCGCG GACGGCGGAA TCACCGAGAT AGAACGGGTG
AGCTCCTAA
 
Protein sequence
MSVQLNRRKL LSGLCTAGLG GLAGCLSSVP GLDSTDVEDG SDVGGTDRTL RLGIMQPLSG 
DLENVGLPIR DAATLPIKQI EDEISLDIEY EVVDTETSPS AGVQGAAALV DEGYPMVNGP
AASDVTLQAT QQVLIPYRTV CCSPGATTPT ITSLNDAGLV FRTAVSDSLQ AVVLADRAAN
DLGHDSAATL YANNDYGWQL SQAFARSFRS DHGGTVSAQI PLTEGQDTYE AALERAREDD
PELLVVVGYP ETGGQVLRDL GADAPEDVLV TDGLRDGDLH DEIDYSLDGI RGTAPLVDGP
GTETFTELFE DAYDAEPSVF TPHSYDASAV LLLANAYAGQ NDGTAIRNAM QAVTTGDGEE
ITPETLAEGV DLAARGDSVT YQGASSSVDF DENGDIVGAV FEYWEFDESA DGGITEIERV
SS