Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_3343 |
Symbol | |
ID | 8743963 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013743 |
Strand | - |
Start bp | 3452217 |
End bp | 3454577 |
Gene Length | 2361 bp |
Protein Length | 786 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 646513926 |
Product | hypothetical protein |
Protein accession | YP_003404880 |
Protein GI | 284166601 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACACGCA ATAACGAACC ATCATATCGC GAAAAGGGAC GCGCAGTCGT CCTGGCCGCG CTTATGGTGA TGTCGGTTTT CGCCATGTCC GCAGCGTTTG CGGGCGGGGC GGCCGCGGCA GAAGATGAAC CGGATGCGAA CTACACTTCC CTTAACGACC TTGACAACTC ACTCGTTTAC GCTGGTCAGA CCGTTCAGGT CGACCTCTCT GGTGAAAACG TCACAGAGGG TGACGATATC CAGATTCGCG TCAAGGGCGG TAGTCCGGTT GTCCTTGAGA CGGCCGACGC AAACTCTACC ATCACGTTCG ACACGGCAGA CCTCGACACC GGTGAATACG AACTGACCGG TGACAATATC TCGACGGATA ACTCCTTCGA GGTTGTCGTC CAGAACGTCG ATGCTAGCTT CAGCGGAGAC ACCGTCACGA ACGGTGACAG CGACTCGGTT GACCTGAGTA TTGAGTCCAG CCTCCGCAAC CAGTACAACG TCACGGTCAA CGCTGATGGT CTCACCCACA GCGAACTCGC GACGCTCTTC AACGTCTCTG AGGACCCTGA CGCTAGCGAT GACGACCCGG TCGTCATCCC GGATGCAGAG AACACTGAGG CGCTTCAGGA CCTCACTTTC GAGAGCGTCG AGACCGGAGA CTACACGTTC GACATCAATG TTGTCGACAC GACCGCATCG GACACCGCTA CGGTGTCTGT CGCGGAGAAG GGCTCCGTTA ACATCGACTT CGAGAACTCC GTCACTTCGG TCAACGAAGG CGATAAAGTT GACCTCAAGA TCAACTTCCA GAACACCGCT GACGACGCAG AGCTCGTCCT CGGTGGCGAG GACGTTGGTT TCGAAGTGAA CGCAACTGTT ATCAACCCCG AAAACAGCAA CACCGCCACC GTGACGGTCG ATACGTCCGC ACCTGGTTCG AACACCAGCG AGGTTCTCAG CGGTGATGTT GAGAACGTCA ATGTGAGTCA GGACATCGCC CGTGACAAAC TCGCGACGGG CGAGTACGGC ATGGAACTCG ACCTCACTAC CGGCATCGCG TCGGCTATCG GCACGCTCGA CGTGAATTCT CACGAAGAGC CTTCGCTTGA AGGTACGTAC GTAACTGGCG ACACCTTCAC CGCTGGTAAC ACCAACCCTG ACGACGTCCG AACGACGAGC GGAAACGTGG TCGCGGAAGG CGACACGGTT ATCGTCATCT ACGAGGGCGT TGGCTTTAAC GGCGATTACG CCGACAATGA TTCGAACGAT TCCGACAATG GTTGGAGCGC AATCATTGAG GAGAGCGAAG CGGCCCTGAA CCAGGAGCCC ATGACGCTGG ACGAGAGTGA CAGCGGCGTT AGTATGGCCT GGGACGAGGA CGCAGGTACG TACACGGTCG CTATCGACAC GTCTAACGCA AACGTCAGTG TCGACGAGGC CTACGACGTT TCGCTCCTCC TCGACGCTAA CGAGAACAAG TACTTCGACG CCGAGGACTA CGAGGACGAC TACGCCTTCG ATGTCGGAAC GCAGTTCGAA GTGACTGACC GCACGGTCGA CTTCGCTCAC GAAGCGAACG CGGACGGTCA ATTCGAGATC GAACAGCAGG ACGACGGTAC GGTCGCCATC ATGGGCGAAG GTACCGCCGC AGACGGTGCG GAGTACACTG TTTCCCTCCG TAGCGACGAA GCCAACGAGC TCCTGACGCA GCAGGTCGAA CTCGAGAACG GCGAATTCGC CGCTGAGTTC AACCTGAGCG AGTACGAGCC TGGCTTCGAA TTCACGCTCG ACATTCAAGG TAGCGACTAC GACGGCGACC GTCGGAACGC AGTCCTCGTC GCAGGCGACG AGCCTGCAGA GGCTTCGTTC GAAACCACGG ACGTCAGCGC ACCTGACAAC GTCACCGTTG GCGACGACGC CACCCTCGAC GTGACCGTCG AGAACACCGG CGACGTCGAA GGTACCACCA CGGTCACCGC GACGATCAAC GGTGAGGAAC GCACCCAGAA CGTCACGCTC GACGCTGGCG CCTCTGACAC TGTCTCCTTC GACCTCCCCA CCGACGAAGC GGGCGACGTC GCGTGGTCCG TTGGTGACGA GTCCGGTACG CTGACCGTCA ACGAGCAGAC TGACGACTCG GACGACTCTG ACGACTCGGA CGACTCTGAC GACTCGGACG ACTCCGACGA CTCGGACAAC TCCGACGACT CCGACAGCTC CGACAGCTCG GACGACTCTG ACAGCTCCGA CAGCTCGGAC GACTCCGACG ACTCCGAGAG CTCGGACGAC GGCACGCCCG GCTTCGGCGT CGCTGTCGCC GCCATCGCGC TGCTCGCCGC CGCCATGCTC GCACTCCGCC GCGAGAACTA A
|
Protein sequence | MTRNNEPSYR EKGRAVVLAA LMVMSVFAMS AAFAGGAAAA EDEPDANYTS LNDLDNSLVY AGQTVQVDLS GENVTEGDDI QIRVKGGSPV VLETADANST ITFDTADLDT GEYELTGDNI STDNSFEVVV QNVDASFSGD TVTNGDSDSV DLSIESSLRN QYNVTVNADG LTHSELATLF NVSEDPDASD DDPVVIPDAE NTEALQDLTF ESVETGDYTF DINVVDTTAS DTATVSVAEK GSVNIDFENS VTSVNEGDKV DLKINFQNTA DDAELVLGGE DVGFEVNATV INPENSNTAT VTVDTSAPGS NTSEVLSGDV ENVNVSQDIA RDKLATGEYG MELDLTTGIA SAIGTLDVNS HEEPSLEGTY VTGDTFTAGN TNPDDVRTTS GNVVAEGDTV IVIYEGVGFN GDYADNDSND SDNGWSAIIE ESEAALNQEP MTLDESDSGV SMAWDEDAGT YTVAIDTSNA NVSVDEAYDV SLLLDANENK YFDAEDYEDD YAFDVGTQFE VTDRTVDFAH EANADGQFEI EQQDDGTVAI MGEGTAADGA EYTVSLRSDE ANELLTQQVE LENGEFAAEF NLSEYEPGFE FTLDIQGSDY DGDRRNAVLV AGDEPAEASF ETTDVSAPDN VTVGDDATLD VTVENTGDVE GTTTVTATIN GEERTQNVTL DAGASDTVSF DLPTDEAGDV AWSVGDESGT LTVNEQTDDS DDSDDSDDSD DSDDSDDSDN SDDSDSSDSS DDSDSSDSSD DSDDSESSDD GTPGFGVAVA AIALLAAAML ALRREN
|
| |