Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_5040 |
Symbol | |
ID | 8745846 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013748 |
Strand | + |
Start bp | 29622 |
End bp | 30662 |
Gene Length | 1041 bp |
Protein Length | 346 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 646515654 |
Product | hypothetical protein |
Protein accession | YP_003406601 |
Protein GI | 284176325 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 69 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCACAGC TCAATCCAAC ACAGATAGCG AACCGTATCG GACGTATTGC CCTTGCAACC ATCCTCGCCG GAGTCTTGTG CGCGCTCGTG AACGTGCCCA GCGAGGTCCG GTCGGGAACG TTCCCCGATC TGACCGACCC GATCGATATC TACCTCTCCC TCTTGGTGGA GGTGTGGACG TACGGGATAG TCGCTGCAGT ACTCGTCCCG GTCGTACTGG GCGTGTTGTG GGGATTCTCG GAACGGTTCT CCCGTGTCCG TGTCTGGCAG GCGATTCTCG TCGTTACGGT CGGACTCGCG GTCGCGATGT CCCTCGAGTC GTGGGGCCAC CCGGCCGCGT TCGTGCTCGC AACAGTCGGC CGGCTCGCCT CTGGCGGCGC CGTGCTGGCG GCAGTCGTGA TTGTCGAGTT GCTGTTCGCG TGGGATATCG CCTCGCAAAC CTCGCTCGAG GGGTTCGTCA CGGCCTCGGG TGGAACGCTC GCGGTCGTTG TTGCCCTGGT GCTCGTCGTC GCAACGATCT CGACTGGCGT GCTCGCCGTG GCCGGTACCG CGGGCGTGTC GTTGCCTGAG GAGAGCCACG ACCACGGGTT TGGGTCGGCG TCGGCAGACG AACTCGAGAC CGAGTACAGC CACTATCTGG ACGTCAAATC GGGCGACGAA CTCACCTGCG AGCCGGCGAC CGTCGAACGC GAGAACGTCC CGGCGGCCGC ACAGACCCAC GAGAACGACC TGAACGACTT CGAAGTTAAC GCGACGGTCT ACGAGGGGAT GGGTGCAAGT ATCATCTACG AGTGGACCTA CACCGGTGAG GGAACGCTCG AGACCCAGCG CAGTGGTACC GTCGAGAACG GTGCAGTGGA ACTCGACGGC TACTGGGATC GGCCAGTCAA CGAGGGGAAC CACGAAATGC CCGTTATCGA CGGGACGGCC GTCGCCGACG GTGGCGATTC GATCAACGGC ACCTACGTCG AGTTCGACGT CGTGAACGAC GACGGCGAGT TGATTCGGTA CACCGGGACG CTGTGCGACA AGAATCTTTG A
|
Protein sequence | MPQLNPTQIA NRIGRIALAT ILAGVLCALV NVPSEVRSGT FPDLTDPIDI YLSLLVEVWT YGIVAAVLVP VVLGVLWGFS ERFSRVRVWQ AILVVTVGLA VAMSLESWGH PAAFVLATVG RLASGGAVLA AVVIVELLFA WDIASQTSLE GFVTASGGTL AVVVALVLVV ATISTGVLAV AGTAGVSLPE ESHDHGFGSA SADELETEYS HYLDVKSGDE LTCEPATVER ENVPAAAQTH ENDLNDFEVN ATVYEGMGAS IIYEWTYTGE GTLETQRSGT VENGAVELDG YWDRPVNEGN HEMPVIDGTA VADGGDSING TYVEFDVVND DGELIRYTGT LCDKNL
|
| |