Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_1366 |
Symbol | |
ID | 8741956 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013743 |
Strand | - |
Start bp | 1417611 |
End bp | 1419224 |
Gene Length | 1614 bp |
Protein Length | 537 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 646511943 |
Product | para-aminobenzoate synthase component I |
Protein accession | YP_003402927 |
Protein GI | 284164648 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0147] Anthranilate/para-aminobenzoate synthases component I |
TIGRFAM ID | [TIGR01824] aminodeoxychorismate synthase, component I, clade 2 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGATC CGCGCGTCGT TACCTCGCTC GCGTCGTTTC GAGCCGCCGC CCATGAGCTG CTCGAGGGTG ACGACGCCAC ACCGACCGAT AATGCGCCGG CTGACAACGT GACGACCGAC GTCGCGACGT CACGAGAACC CGACGTTCGA ATTCCAATCG AAGTCCGCGT CGCCGTCGAC GATCCGTTTC TCGCCTATCG ACGGGCGCGC GATGCCGACG CGGGCGGCGC CTTCCTCGAG ACGACCGGCG GCCAGCCCGG CTGGGGCTAC TTCGGCGTCG ACCCCGTCGA CCGGCTGACG GTCGGGCCCG ACGCGGTCGC GCGAACTGAC GACGAGGATT CTCCGACGCT GGCGGCCCTC GAGGGGCTCC TCGAGCAGGA CCAGCTGGTT CGCGGCGACT GTTCGGTCCC CTACCCCTGC GGGGCGATCG GCTGGCTCTC CTACGACGTC GCCCGCGAAC TCGAGTCCCT TCCCGAGTCG GCCGTCGACG ATCGGGGGCT TCCCCGCCTC GAGATCGGCG TCTACGACCG GCTGGCGGCC TGGGAAGCGC CGACCGACGA CGGTGAGGTG ACGCTGCGGG TGACGGCCTG TCCGCGAATC GCGGTCGGCG ACGGCCGCTC CGACGAGACG CTCGAGGCGG CCTACGAACG CGGCCGCGAC CGGGCGCTCG AGCTCGCGCG GGCCGCCCTC GAGGGCGATC CCGCGGTCGA CGAGCCGCCA GTCGCGACGT CCGAAGCGAC GTTCGAGAGC GACTGCGGCC GCGAGGCGTT CGCCGAGCGC GTCCGTCGAG TCAAGGAGTA CGTCCGTGAC GGCGACACCT TTCAGGCGAA CGTCTCCCAG CGGCTGGTCG CCCCCGCGGC GGTCCACCCC GTCGCGGCCT ACGACGCCCT CCGACGGGTC AACCCCGCGC CGTACTCGGG GCTCCTCGAG TTTCGTGCGG CCGATCTGGT GAGCGCGAGT CCCGAGCTAT TACTGGAACG AAATGGCGAC TTCGTCCGGA CGGAACCCAT CGCGGGCACG CGACCGCGCG GCGAGACGGC CGAAGACGAC CGAGAACTCG AGGAGGACCT CCTGACCGAC GAGAAGGAAC GCGCCGAACA CGCAATGTTG GTCGATCTGG AACGTAACGA CCTCGGGAAG GTCTGCGAGT ACGGCTCCGT GACGGTCGAC GAGTACCGGC GGATCGACCG CTACTCGGAG GTGATGCACC TCGTCTCGAA CGTGACCGGA CGACTGCGCG ACGACGAGTC GCTGGCCGAC GCTATCGCGG CGGTCTTCCC GGGCGGTACG ATCACCGGCG CGCCGAAGCC GCGGACGATG GAAATCATCG ACGAACTTGA GGCGACCCGT CGGGGCCCCT ACACGGGCAG CGTCGGAATC TTCGGTTTCG ACGGGCGGGC GACGCTGAAC ATCGTCATCC GGACGCTCGT CCGCCACGCC GAGGAGTACC ACCTCCGCGT CGGCGCCGGG ATCGTCCACG ACTCCGATCC CTACCGCGAG TACGACGAGA CCCTCGACAA GGCCCGCGCG CTGATCGCGG CCGTCGACGA GGCACTGGGC GAGCGGGCCG GAATGGCGCT CGAGGCTGAA GGCAGAGGTG AGCAGCGTGA GTGA
|
Protein sequence | MSDPRVVTSL ASFRAAAHEL LEGDDATPTD NAPADNVTTD VATSREPDVR IPIEVRVAVD DPFLAYRRAR DADAGGAFLE TTGGQPGWGY FGVDPVDRLT VGPDAVARTD DEDSPTLAAL EGLLEQDQLV RGDCSVPYPC GAIGWLSYDV ARELESLPES AVDDRGLPRL EIGVYDRLAA WEAPTDDGEV TLRVTACPRI AVGDGRSDET LEAAYERGRD RALELARAAL EGDPAVDEPP VATSEATFES DCGREAFAER VRRVKEYVRD GDTFQANVSQ RLVAPAAVHP VAAYDALRRV NPAPYSGLLE FRAADLVSAS PELLLERNGD FVRTEPIAGT RPRGETAEDD RELEEDLLTD EKERAEHAML VDLERNDLGK VCEYGSVTVD EYRRIDRYSE VMHLVSNVTG RLRDDESLAD AIAAVFPGGT ITGAPKPRTM EIIDELEATR RGPYTGSVGI FGFDGRATLN IVIRTLVRHA EEYHLRVGAG IVHDSDPYRE YDETLDKARA LIAAVDEALG ERAGMALEAE GRGEQRE
|
| |