Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_3900 |
Symbol | |
ID | 8744528 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013744 |
Strand | + |
Start bp | 142737 |
End bp | 144932 |
Gene Length | 2196 bp |
Protein Length | 731 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 646514484 |
Product | hypothetical protein |
Protein accession | YP_003405431 |
Protein GI | 284167153 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.020807 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGTACGAG ATACCGAACC GCAAGGTAGT GGCAACGCGA GCGCACACGA GTCCGGCCTG ACGTTCGACC GGCGAACGAT CCTCGGCCTG TTGGGCGTCG GCGGTCTCGC CGCGGCGTAC GGGAGCGGAA CGGCGCGAGC CGACGGCGGC CGGGCCGACG GACGACACGG CGAGTCGGGC CCGACACGCA AGTGGAACCA GGACATCGAT GCGCAGGGAC ACGACCTCTC GAATCTGGGG TCGCTCGAGG TCGACCACGT CTACACGGCA GCGCGGAACG CGGACGTGAT CGTCTGGAAG GACGACGACG GCGTCTTCCA CGCCGACGGG ACGGACGGAC ACGTCGCGAG CGGCGAGGAC GTGATCGAGG TCACGCAGGC GGCGGTCGAC AGCCTGACCG ACGGGCGCGA CTGGAAGGAG ACGGTCGCGG TGGTCTCGCC GAGCACCGTC GGTCCGGTCG AGGGAAGCGG CGACGTACCG ACCTACGAGG GCTCGGACGA CATCGAGAGC ATCGAACTGC CGAGTTACAC CGTGCTGGAT ATGCCCGCGA CGATGCATGT CGAGGACGAG GGCGATCAGG CGCTCGTCGT TCCGGTCGCG GCCTACGACG CCGAACACAT CGAGATTCCG AACTTCAGAG TCGTCGGAAA TCCTCGGTTC GGAATGTTCC TCCGGAGCGT TCGGAACCTC CGGCTCGGGA ACGTGAACGT CGAGATGACC GGCGAGAGCC CCCGCGGCGG CATCGGCGTC CGCATCGACG GCTTCGCCCA CGGCCGCGGC GAGGACACCG TCCGCTGTAC GGATATCCAG GTCGATTCGG TCTACGTCGA GAACTCGGGC GGCCACGCCT TCGAGACGTA CGCGGTCGAC CGCCTCCAGG TCGGCCAGGT CATCGCGAAC GGCGTCGAGT CTGGCTGTGG CGTCCTGCTC AACGAGACCA CCGACGCGAC GGTCGGCTCC GTCGTCGGGC GGGAGATCGA CCCCGGCGGC GGCTACGCCG GATTCCGCGT GGCCAACGGC ACGCACGACG TCACCTGCGA TCAGGTGGTC GTCCGCGGCG GCGCCCGCGG GATCTTCGGC GTCTCGGGCT CCCACGACAT CACCATCGGA GAGGTGAACA TCTCGGAGAT GGGCGGCGGC GTCTTCATCG AGGACAACCA GAACTTCACC ATCGAGGGCG GCGTCGTCAA GAACTGCGAC TGGGAGGCCG TGCGCATCCA CTCGCGGTCG GACTACCAGC ACGATCCCAC GAACGGCGTG ACGATACAGA ACCTCCGCAC CTACGACGAC CGGCCGGAGG AAGACCGCGA ACAGAGTTAC GGCATCTACG TCTCCGGCGG GCAGACCTCG AACGTCCGGA TCATCGACTG CGACGTGCGC GGCGGCGGCA CCGATCAGAA CATCCGGGTC GATGCCGACG AGACGATCCT CCGGGGCAAC CACGGCGGCG GACTCGCGAA GGGAACCGTC ACCCTCGAGT CCGGCGCCGA CCCCGCCGCG ACCGTCGGGG GCGTCAGTCC GTTCGGCTAC CAACAGCCGT CGCTGCGGGC CGATCCCGTC GAGGCGACGG ACGCGACGTT CGCCTTCGAT CACTACTTCG TGTGGAACGC CGACGCGGAG GCGTGGGATC TCCACCTCGA GTGGAAGCGC GATCCCGGCC AGGACGTGGA CGTCCAGTAC GTCGTCGACA ATCCGCGGGC GAACCTCGGT GCCCGCGAGT CGATGGGCGG CGGTGGCGAG CTCACGGAAC TCGAGGCGGG GACCTACCGG CTCACCTCGG CGCTCAACGG CGACGACATC GTGATGAGCG TCGACGGCGA CCTCGCCGAC GGCGCGAACG TCTACAACGA TACCTGGGGC GAGGCGAGCG GCCAGGTGTG GGACGTGACG GAACTCGAGG ACGGCGTCTT CCGCATCAGC CCGGCCGACG CCGGCGGGCT CGCGCTCGAG ACGGCCGACG GCGGGACCGA CACCGGCACG AACCTCGAAC TGGGCGCGTG GGAGGACGCC GACCACCAGA AGTTCGAGGC GAATCCGATC GCGCCCGATC GCTACTCGCT CGAGCCGACC CACGCGGACG ACCTCGCGGT CGACGTCTGG GAGGTCGACC CCGAACCGGG CGCCGACCTG CGCCACTGGA ACGTGACGAA CAGCAGCAAC CAGCTCTGGA AGTTCCAGGA TCCCGAGGAC GGATAG
|
Protein sequence | MVRDTEPQGS GNASAHESGL TFDRRTILGL LGVGGLAAAY GSGTARADGG RADGRHGESG PTRKWNQDID AQGHDLSNLG SLEVDHVYTA ARNADVIVWK DDDGVFHADG TDGHVASGED VIEVTQAAVD SLTDGRDWKE TVAVVSPSTV GPVEGSGDVP TYEGSDDIES IELPSYTVLD MPATMHVEDE GDQALVVPVA AYDAEHIEIP NFRVVGNPRF GMFLRSVRNL RLGNVNVEMT GESPRGGIGV RIDGFAHGRG EDTVRCTDIQ VDSVYVENSG GHAFETYAVD RLQVGQVIAN GVESGCGVLL NETTDATVGS VVGREIDPGG GYAGFRVANG THDVTCDQVV VRGGARGIFG VSGSHDITIG EVNISEMGGG VFIEDNQNFT IEGGVVKNCD WEAVRIHSRS DYQHDPTNGV TIQNLRTYDD RPEEDREQSY GIYVSGGQTS NVRIIDCDVR GGGTDQNIRV DADETILRGN HGGGLAKGTV TLESGADPAA TVGGVSPFGY QQPSLRADPV EATDATFAFD HYFVWNADAE AWDLHLEWKR DPGQDVDVQY VVDNPRANLG ARESMGGGGE LTELEAGTYR LTSALNGDDI VMSVDGDLAD GANVYNDTWG EASGQVWDVT ELEDGVFRIS PADAGGLALE TADGGTDTGT NLELGAWEDA DHQKFEANPI APDRYSLEPT HADDLAVDVW EVDPEPGADL RHWNVTNSSN QLWKFQDPED G
|
| |