Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_3068 |
Symbol | |
ID | 8743688 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013743 |
Strand | - |
Start bp | 3146847 |
End bp | 3148265 |
Gene Length | 1419 bp |
Protein Length | 472 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 646513652 |
Product | LVIVD repeat protein |
Protein accession | YP_003404606 |
Protein GI | 284166327 |
COG category | [S] Function unknown |
COG ID | [COG5276] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGCGGA GAGCATTCCT CCGAGCGGGT GGGGCCGCCG GCGCCGTCCT GGCCGTCTCG GGAGCGGCGG TCGGGACGTC CACAGCGACG CCGCGCGCCA GCCAGGAGGT CCCCGACTCG TTCGAACCGC TCGGGCAGCT CGAGCTACCC GACAGCGACC CCGCCGAAGT GGTCATCGAC GACCACGGCG AGACGGCGTA CCTCGCCACG ACGTACGGGT TCGCGACCGT CGACCTCGGC GACCCGACCG CGCCGGAGCT GCTCGCCGAA CGAACTCGGC TCGAGGCCGA CGGCCAGGAG TTCACGCGGA TTTTCGACGT CAAAGTCGAC GGCGACCGAC TCGCGGTCGT CGGGCCCGCA GACGAGGGGT TCGGCGAGTT CAACGGGTTC GAACTCTACG ACGTCAGCGA CCCCGCGGAC CCCGCGGTCG TCGACCGCTA CGAGACCGGC TTTCACATCC ACAACTGCTT TTTCGCGGAC GAGCTGCTGT ACGTCGTCAA CAACGGCCCC GACGATACCG CGCTCGTCAT CTACGACACG AGCGACGACG ACACCGAGGC GGTCGGCCGC TGGTCGCTAC TCGACCACGA CCCCGAGTGG GAGGACGTCT ACTGGTACGT CCATTATCTC CACGACGTCA CCGTCCACGG CGACCTCGCC GTCTTCCCGT TCTGGGACGC CGGCACCTAT CTGGTCGACG TCAGCGACCC GAGCGATCCA AAGTATGTCT CACACGTGCG CGACCCCGAC GTCAGCAGGG ACCGAAGCTA CGGGGAGAGG GAGGCCGTCT ACGGCCTGCC GGGCAACGAC CACTACGCGA CGGTCGACGA CGCCGGCGAG CTCCTCGCGG TCGGTCGCGA GGCCTGGACG ACCGGCGGCT CGGCGCCGGA CGGGCCGGGC GGGATCGACC TCTACGACGT CACCGATCCG TCGGCGCCGG AGCCGCTGGC GTCGATCGAG CCGCCCGAAA GCGACGACGC GTCCCGCAGG GGCGGCGAGT GGACGACCGC CCACAACTTC GAGTTACGCG ACGGGCGCCT CTACTCGGCG TGGTACCAGG GCGGCATCAA AATACACGAT GTGAGCGACC CCGCCGCTCC CGAGGGACTC GCCCACTGGC GGGCGACCGA CGACGCCGCG CTCTGGACGG CACGCGTCGC CAACGACGGC GCGACGGTCG TCGCGAGCAG CACGTCGCGG CTCCCTGCCA CGGACATCGA CGGCGCGCTG TACACCTTTC CGACCGGACT CGAGAGTGAC GGATTCGAGA CGGGCGGCGA CGACAACGGC TCCGATGGCG ACGGGAACGA TTCCCTCAGC GACCGGGTTC CCGGCTTCGG AGGACTCAGC ACTGGGATCG GACTCGCCGG CAGTGCGGCC GCCCTCGAGT GGGTCCGTCG ACGCGGCGAC GATCGGTGA
|
Protein sequence | MQRRAFLRAG GAAGAVLAVS GAAVGTSTAT PRASQEVPDS FEPLGQLELP DSDPAEVVID DHGETAYLAT TYGFATVDLG DPTAPELLAE RTRLEADGQE FTRIFDVKVD GDRLAVVGPA DEGFGEFNGF ELYDVSDPAD PAVVDRYETG FHIHNCFFAD ELLYVVNNGP DDTALVIYDT SDDDTEAVGR WSLLDHDPEW EDVYWYVHYL HDVTVHGDLA VFPFWDAGTY LVDVSDPSDP KYVSHVRDPD VSRDRSYGER EAVYGLPGND HYATVDDAGE LLAVGREAWT TGGSAPDGPG GIDLYDVTDP SAPEPLASIE PPESDDASRR GGEWTTAHNF ELRDGRLYSA WYQGGIKIHD VSDPAAPEGL AHWRATDDAA LWTARVANDG ATVVASSTSR LPATDIDGAL YTFPTGLESD GFETGGDDNG SDGDGNDSLS DRVPGFGGLS TGIGLAGSAA ALEWVRRRGD DR
|
| |