Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_3824 |
Symbol | |
ID | 8744452 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013744 |
Strand | - |
Start bp | 48388 |
End bp | 50574 |
Gene Length | 2187 bp |
Protein Length | 728 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 646514410 |
Product | protein of unknown function DUF224 cysteine-rich region domain protein |
Protein accession | YP_003405357 |
Protein GI | 284167079 |
COG category | [C] Energy production and conversion |
COG ID | [COG0247] Fe-S oxidoreductase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.676113 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATGCTG TAGCCCAATC TACCGTGGCG AGGGAGACGT ACTGGGGAAT CACGAGCGTC GAGTACGCCG TGTTCTACTT CCTCGCGTTC ACGGCGGTCG CCGTCTTCGC GTACGGTGTC TACCGGCGGT TCGCGCGCTA CTCCCGCGGG GACGATGAGT CGTCCCCGCG GGTCGACGAT CTGTCGAAGC GCATCGTCAG CGCGTCCAAG ATCGCCCTCT CGAACGAGAA GCAGTTCAAC CGGGACCTCT ACGGCGGTCT GATGCACGCG TTTATCCTCT GGGGATTCCT GACGCTGTTC ATCGCGACGA CGATCCTGAT GATCGACGAG TACGCCGCTC AGGCGGTGTT GCACACGACG TTCTGGGAGG GCGACTTCTA CCTCGCCTAC CAGTTCGCGG TCGACGCAAT GGGGCTACTC TTCGTCGTCG GCCTCGGAAT GGCGATCTAC CGGCGCTACT GGGTCCGGAA TCACCGCCTC TGGGGTCGTC ACACCTCAAG CGAGGACGAC CTCTTCGTCT GGACGCTGTT CGGCCTCGGC GTCGGCGGCT TCCTCCTCGA GGGGTTGCGC GTCTACAGCG CCGGGATCCC GGACCACGAG GTCGTCAGCT TTGTCGCCTA CGGGATGGCG CTCGGCTTCG AAGCCGTCGG ACTCCCGACG CTCGGGCCCG AGCAGGCCGG CCTCAACGCG TCTGGGCTGA ACGTCGAGAA CCTCCACTGG CTCGCGTGGT GGTCGCACTC GCTGCTCGCG TTCTTCTTCA TCGCGTGGAT CCCTTACGCG AAGCCGTTTC ACATGCTCTC GTCGTTCGCG AACGTCGTCA CGCGCGACGA GAAGGCCGGT CGGCGGCTTC CGAACGTCCC CTCGGATCTG GACGCGACCA ACGCCGAGTC CATCGACGAC TTCACCTGGA AGGAACTGCT CGATCAGGAC GCCTGCACCA AGTGCGGTCG CTGCTCGTCG GTCTGCCCCG CGAAGGCCTC CGACCGTCCG CTCGATCCGC GAGACGTCAT CCTCGATCTG AAGTCCTATC GGGAGGGCCT CGAGTCCGGT AGTGAAGAAC AACCGATTAT CGCCGATGGC GGCACCTCGG TGATCGATGC CGAGACGATG GAGTCTTGCA TGGCCTGTAT GGCCTGTATG GACGCCTGTC CCGTCGAGAT CGAACATCTC AAGAGCTTCA CCCGACTCAA CCGCCAGCTG ACCGATCAGG GTGACATCGA CTCGAGCATG CAGGACGTCT TCCAGAACGT CATGCAGAAC GGTAACACGT TCGGCGACTC GCCGCGCAAT CGGGGCGACT GGAGCGAGGA TCTCGCGTTC GACGTGCCCG ACGCTCGCGA GGAGGAAGTC GACTACCTCT GGTACGTCGG GGACTTCCCG AGCTACGACG AGCGCAACAA GCAGGTCGCG CGCTCGCTAG CGACGATCCT CGAAGAGGCG GACGTCAGCT TCGGCATCCT CTTCGACGAC GAGAAGTACG ACGGCAACGA CATCCGCCGG GTCGGCGAGG AGTTCCTCTA CATCGAACTC GCCGGCCACC ACGTCGAGAC CTGGGAGGAT TGTGAGTTCG ACACGATCGT CTGTACAGAT CCGCACTCCT ACAACACGTT CAAGAACGAG TATCCGGAGG TCGACTTCGA CGAGTTCGCC GACGACCCGA TGATGCCCTT CGAGTACGAC GAACGGTGGA ACGAAGACAG CGAAATCGAG GTCTACCACT GGACCCAAGC CGTCGAAGAA CTGGTCAACG AGGGCAAACT CGAGCTGAAC GGCTCCGAAC TCGACTACAC GGTCACCTAC CACGACCCGT GTCATCTCGG CCGGTACAAC GACGAGTACG AAGCCCCGCG CGCACTCATC GAGGCGACGG GCTGCGAACT CGACGAGATG CCCCGCAACC GCAGCAATTC CTTCTGCTGT GGCGGGGGTG GCGGCGGCCT CTGGATGGAT TTCGAGGAAG AGCCCAAACC CAGCGAGGAA CGCATCCGCG AAGCCCTCGA GGACACGGAC GCCGGGAGCG GGGTCGAGAA GTTCGTCGTC GCCTGTCCGA TGTGCATGAC GATGTACGAG GACGGGCGCA AGACCGGTGG CTACGAGGAC GAGATCGAGA TCGTCGACGT CGCCGAACTC ATCGTCGAAG CGATCGGGAA AGAAGACGAA GCGAGGGTCG AAGCCGCCGC TGACTAG
|
Protein sequence | MDAVAQSTVA RETYWGITSV EYAVFYFLAF TAVAVFAYGV YRRFARYSRG DDESSPRVDD LSKRIVSASK IALSNEKQFN RDLYGGLMHA FILWGFLTLF IATTILMIDE YAAQAVLHTT FWEGDFYLAY QFAVDAMGLL FVVGLGMAIY RRYWVRNHRL WGRHTSSEDD LFVWTLFGLG VGGFLLEGLR VYSAGIPDHE VVSFVAYGMA LGFEAVGLPT LGPEQAGLNA SGLNVENLHW LAWWSHSLLA FFFIAWIPYA KPFHMLSSFA NVVTRDEKAG RRLPNVPSDL DATNAESIDD FTWKELLDQD ACTKCGRCSS VCPAKASDRP LDPRDVILDL KSYREGLESG SEEQPIIADG GTSVIDAETM ESCMACMACM DACPVEIEHL KSFTRLNRQL TDQGDIDSSM QDVFQNVMQN GNTFGDSPRN RGDWSEDLAF DVPDAREEEV DYLWYVGDFP SYDERNKQVA RSLATILEEA DVSFGILFDD EKYDGNDIRR VGEEFLYIEL AGHHVETWED CEFDTIVCTD PHSYNTFKNE YPEVDFDEFA DDPMMPFEYD ERWNEDSEIE VYHWTQAVEE LVNEGKLELN GSELDYTVTY HDPCHLGRYN DEYEAPRALI EATGCELDEM PRNRSNSFCC GGGGGGLWMD FEEEPKPSEE RIREALEDTD AGSGVEKFVV ACPMCMTMYE DGRKTGGYED EIEIVDVAEL IVEAIGKEDE ARVEAAAD
|
| |