Gene Htur_1239 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_1239 
Symbol 
ID8741829 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013743 
Strand
Start bp1300161 
End bp1301390 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content55% 
IMG OID646511819 
Producttransposase, IS605 OrfB family 
Protein accessionYP_003402803 
Protein GI284164524 
COG category[L] Replication, recombination and repair 
COG ID[COG0675] Transposase and inactivated derivatives 
TIGRFAM ID[TIGR01766] transposase, IS605 OrfB family, central region 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTACTA CAGCGAATAA GACGCTCGAA GCGACGCTTG TCTCGCCGAC AGCCCACAAA 
GAGGAGAAGT TACAAGATAC TCTCAAAACA TACCGTGAGG CGCTGCAAGA CGCGTTCGAC
TCTGGTGCGG ATACGATGAA CGGTGTCTCT GAAGTAGTGA CGCCGTTCGA TCTTCCATAC
CAAGCGAAGG CTGCACTATG CAGCTACATC CCGAAGCTCC GGAAAACATA TAACGCCCGT
GAGTTAGACG ATGAACACCC GCTCCGGCTC ACAAATCAGG CCGCGAGGTT CGACTACTCG
AGCGAACGTG AGCACGAATT CACGTGGTGG GCACCGCGAC CGGGACGAGG GACGAACTTC
TGGATTCCGC TTCGGATTAA CCCGGAGCAA GAAGACCTCT GGCACGATCT CCTCAACGAG
GACGTCAAGG CTGGACAGAT TCAACTCCAG AAGAACCGGA AGAACTGGGC ACTTCACGTT
ACCGTCGAGT ACCCGGTTGA AGAACCGACG GTAGACGGTG ACACCACACC AGTCGGGCTT
GATATCGGTG AGACTGCGCT GATCACGGCC TGTGGCCTTA AGCGCGGTAC ACCGACAAGA
CCCGTTCTCT GGAGTGGTAA GCGTACAAAA CACCTCCGAA AGGAAATGTC GACCACGCTT
CAGCGACTAC AAGAACGTGA TGCTGAATGG CGCATTGATG AACGGTTCGA CTACTACCAA
AACGCGCTTA CGGATATCCT CGAGAAGGCC AGTTGCGAGG TCGTCGAATA CGCTGGCACT
TTCGAGAACC CGATGATCGT GATGGAGAAT CTGACGTACA TCCGTGAGAA CTTGGACTAC
GGGAAGTACA TGAACCGGCG ACTCCACGCG TGGGCCTTTG CACGGCTTCA GGGCCGTGTT
GAGGACAAAG CGAGAGACGT CGGTATCCCG GTCGAATACG TGAGTCCGCG TTACACGTCT
CAGACGTGCC ACGAGTGTAG TCACATCGGA AAGCGAAGTA CGCAAGCAGA ACTTCGGTGT
ACGAACGACC ACTGTCGCGT CTCGACGTTC CAAGCGGATA TCAGTGCAGC TGCAAGCATC
GCTCAGAGGG TTGACCCGTG GGGAGAGAGC GTTCCTTGGA AATCGGAACG CAATGACTCG
CCTCGGGATG GGAGCGGTAG TGACACCGCC GTAAGACCAC CCAAGCCGAG CACACCTACG
CAAATGACGC TTGGAGATGA TCGGTCTTAA
 
Protein sequence
MSTTANKTLE ATLVSPTAHK EEKLQDTLKT YREALQDAFD SGADTMNGVS EVVTPFDLPY 
QAKAALCSYI PKLRKTYNAR ELDDEHPLRL TNQAARFDYS SEREHEFTWW APRPGRGTNF
WIPLRINPEQ EDLWHDLLNE DVKAGQIQLQ KNRKNWALHV TVEYPVEEPT VDGDTTPVGL
DIGETALITA CGLKRGTPTR PVLWSGKRTK HLRKEMSTTL QRLQERDAEW RIDERFDYYQ
NALTDILEKA SCEVVEYAGT FENPMIVMEN LTYIRENLDY GKYMNRRLHA WAFARLQGRV
EDKARDVGIP VEYVSPRYTS QTCHECSHIG KRSTQAELRC TNDHCRVSTF QADISAAASI
AQRVDPWGES VPWKSERNDS PRDGSGSDTA VRPPKPSTPT QMTLGDDRS