Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_4778 |
Symbol | |
ID | 8745368 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013745 |
Strand | - |
Start bp | 392143 |
End bp | 393093 |
Gene Length | 951 bp |
Protein Length | 316 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 646515276 |
Product | hypothetical protein |
Protein accession | YP_003406223 |
Protein GI | 284172841 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.041833 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACGCTC CTGAGAGAGA CGACACTCGA GACGCGATCC TCGATTGCCT CGGTTCGTTC CCCGACCGAC CCGATCCGGC CGTCGAAACC ATCTCGGTCA TGACCGCCGA CGGCTTCGAG CGCCGACTCC TCGAGTACGC CGTCGAACCG GACGAACGGA GTCGAGCGTA TCTACTCGTC CCCGACGATA TCGACGGGCA GCGGCCCGGA ATTCTCGCCG TGCACCCGCA CGCGGGCGAG TTCGACGTCG GGAAATCGGA TCCCGCCGGG CTGAGCGAGA CGGAGCCGTA CCACTACGGC GCCGAACTCT GCCGGCGCGG GTACGTCGTC TGCTGTCCGG ACCTGCTCGC CTTCGAGGAC CGACGGCCGT CCGAGCGCGA ACGAGCGGCC GGGACGGCGC CCACGGGTGC CGACTACGAG AAGTTCGTCG CGATGGATCG CCTGCTACGC GGCTCTTCGC TCCAGACGAA GTACCTCTCG GATCTCGTCG CCACGCTCGA CGTCCTCACG GCCCACGACC AGGTCACCTC CGAGGCGTTG GGCGTGATCG GCCACTCGCT GGGCGGTCAG GAAGCCGCCT GGCTCTCGTG GTTCGATGAT CGCATCGACG CCGCGGTCGT CTCGAGCGGT GTGGCCAAAC TCGCCGCCGT CCAGCGCGAG CAAATCACCC ACAACTTCGC GCTCTACGTG CCCGACCTCC TGACGGTTGG TGACATGAAA GACGTCTTAG CGGACATCGC GCCGCGCTCA CTGCTGGTTA CCCACGGCAC CGACGATCAC ATCTTCCCGC CCGAATCGGT CCGCGACCTC GCCGACGTCG TCTCCGAGGC CTACGCCGAT GCTGATGCCC TGGAGCGATT CGAGACGCTG TTTTTCGACG GCGGTCACGA GTTCTCCACG GAAGTCCGCT CGAGTTCCTA CGACTGGTTG GATCAGCAAC TCGGTCGGTA G
|
Protein sequence | MDAPERDDTR DAILDCLGSF PDRPDPAVET ISVMTADGFE RRLLEYAVEP DERSRAYLLV PDDIDGQRPG ILAVHPHAGE FDVGKSDPAG LSETEPYHYG AELCRRGYVV CCPDLLAFED RRPSERERAA GTAPTGADYE KFVAMDRLLR GSSLQTKYLS DLVATLDVLT AHDQVTSEAL GVIGHSLGGQ EAAWLSWFDD RIDAAVVSSG VAKLAAVQRE QITHNFALYV PDLLTVGDMK DVLADIAPRS LLVTHGTDDH IFPPESVRDL ADVVSEAYAD ADALERFETL FFDGGHEFST EVRSSSYDWL DQQLGR
|
| |