Gene Htur_1588 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_1588 
Symbol 
ID8742180 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013743 
Strand
Start bp1645383 
End bp1646714 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content67% 
IMG OID646512165 
Productprotein of unknown function DUF21 
Protein accessionYP_003403147 
Protein GI284164868 
COG category[R] General function prediction only 
COG ID[COG1253] Hemolysins and related proteins containing CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTAGACC TCGTCGTCGA CCTCGCGCGG CTGTTGGGAG CGTTCGTACT GGTCGCCCTG 
AACGGCTTCT TCGTCGCCGC GGAGTTCGCC TACGTCCGCG TTCGCTCGTC GGCCGTCGAA
CGGGCGGTCG CGGAGGGGAA GACCGGGGCG ACGCTCCTGC AGGAGGCCGT CGACAACTTA
GACGACTACC TCGCGACCAC CCAGCTGGGA ATTACCCTCG CCTCGCTGGG GCTGGGGTGG
CTGGGCGAAC CGGCCGTGGC GGCCCTCCTC GAGCCGGTGC TCGCGCCGGT TCTCCCGGCG
AGCCTGCTCC ACCTCGTCGC CTTCATCATC GGATTCAGCT TCATCACGTT CTTGCATGTC
GTCTTCGGCG AACTCGCGCC GAAGACGATC TCGATCGCCC AGGCCGAACG CGTCGCACTG
CTGGTCTCGC CGCTGATGAA GTTCTTCTAC TTCATCTTCA TCCCCGGGAT CATCGTCTTC
AACGGGACGG CCAACGCCTT CACGCGACTG ATCGGGATTC CGCCGGCCTC GGAGACCGAC
GAGACGCTCT CCGAGGAGGA GATCCTGACC GTCCTGAGCA GGTCCGGGAA CGAGGGCCAG
ATCGACGCGG AGGAGGTCGA GATGATCGAA CAGGTCTTCG AACTCGACGA CACGACCGTC
CAGGAGGTGA TGGTGCCCCG GCCCGACGCC GTGACGATCA CCGACGACCT TCCGCTCTCT
GACCTTCGCA CGCTGATCCT CGAGGAGGGA CACACTCGCT ATCCGGTGCT CGACCCCGAC
GGGGACGATC AGGTGATCGG CTTCGTCGAC GCCAAGGACG TGCTGCGAGC GGGCGAGTCC
GCGGGCGACC TCGCGGACGT GACCGTCGGC GAGGTCACCC GCGAGATGCC CGTCGTGCCG
GAGACGACGC CCGTTACCGA TCTCCTGGAG CAGTTTCAGG GAGACCGGGC ACAGATGGCC
GCGGTGATCG ACGAGTGGGG CGTTTTCGAG GGAATCGTCA CGGTCGAGGA CCTCGTCGAG
CAGATCGTCG GCGACCTCCG CGACGGGTTC GACGCCGACG AGCCCTCGAT CGACCGGCGG
GGCGACGGTT CCTACGTCGT CGACGGTGCC GTCACCGTCT CGGACGTCAA CGAGCGACTC
GACGCCGACT TCGAGTCCGA CGAGTTCGGC ACGATCGGCG GGCTCGTGCT GGATCGGCTG
GGTCGGGCCC CCGACGTCGG CGACCACGTC GAGGTCGACG GCTACGCGCT CGAGGTCGCC
GCCGTCGAGG GCGCACGAAT CTCCTCGCTG GTCGTTCGCG AGAACCCGAA GGCGGAGGAG
ACGGCCGAAT GA
 
Protein sequence
MVDLVVDLAR LLGAFVLVAL NGFFVAAEFA YVRVRSSAVE RAVAEGKTGA TLLQEAVDNL 
DDYLATTQLG ITLASLGLGW LGEPAVAALL EPVLAPVLPA SLLHLVAFII GFSFITFLHV
VFGELAPKTI SIAQAERVAL LVSPLMKFFY FIFIPGIIVF NGTANAFTRL IGIPPASETD
ETLSEEEILT VLSRSGNEGQ IDAEEVEMIE QVFELDDTTV QEVMVPRPDA VTITDDLPLS
DLRTLILEEG HTRYPVLDPD GDDQVIGFVD AKDVLRAGES AGDLADVTVG EVTREMPVVP
ETTPVTDLLE QFQGDRAQMA AVIDEWGVFE GIVTVEDLVE QIVGDLRDGF DADEPSIDRR
GDGSYVVDGA VTVSDVNERL DADFESDEFG TIGGLVLDRL GRAPDVGDHV EVDGYALEVA
AVEGARISSL VVRENPKAEE TAE