Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_4701 |
Symbol | |
ID | 8745297 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013745 |
Strand | + |
Start bp | 292795 |
End bp | 294306 |
Gene Length | 1512 bp |
Protein Length | 503 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 646515205 |
Product | para-aminobenzoate synthase component I |
Protein accession | YP_003406152 |
Protein GI | 284172770 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0147] Anthranilate/para-aminobenzoate synthases component I |
TIGRFAM ID | [TIGR01824] aminodeoxychorismate synthase, component I, clade 2 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCACGA TCACGGTTAT AACCGATAGC GAATCGTTCG CGGAGACTGC CGAGAGCGCA CCGGACGGCG CTCGCGTCCC GGTCGAAGTA CGCGTCTCCG TCGCCGATCC GTTTGACGCT TATTGCAGGG CCCGGACCGA CGAGGCCGAC GGGTTCTACC TCGAGACGAC CGGCGGGCAG TCTGGCTGGG GCTACTTCGG TGTTGATCCC GTCGAACGGA TTCGAGTAAC TTCGAACGCG GTCGATCGGG ACGGTGGAAG TCCGACGATC CGAACGATCG ACGCGCTCCT CGAGCGCGAG CGGTTGGTAC GCGGAGACTG CTCGGTTCCG TATCCCTGCG GTGCGTTCGG CTGGCTCTCC TACGACGTCG CCAGGGAACT CGAGTCGTTG CCGTCGACGA CGAGCGACGA ACGCGGGTTG CCGCGGCTTC AACTGGGCGT CTTCGACCGG GTCGCCGCGT GGACCGAACC ACGCGACAGC GAAACGGAAC TACGAGTGAC CGCCTGTCCC GTGGTCGACG ACGATCCCGC CGACGCCTAC GAGACCGGTC GAGCGGCGGC TCAATCCCTC GCGCAGGCGG CAATCGACGG CAGCGAAGCG GACCGCAGTC CGCCGGTCGA CGCGACCCGG GCCGCGTTCG AGAGCGAGTG CGGTCGAGCG GCGTTCGCCG ACCGCGTTCG GCGAGTCAAG CGATACATTC GCGACGGCGA TACGTTTCAA GCGAACGTCT CCCACCGACT CGTCGCCCCG GCGGCCGCCC ATCCGGTCGA CGTCTTCGAC GCGGTCCGAC GCGTGAACCC GGCCCCATAC TCGGGACTGC TCGAGTTCCC GGGCGTCGAC CTCGTCAGCG CGAGTCCGGA ACTGCTGCTC GAGGTCCGCG ACGGTTCGCT CGTCACCGAA CCGATCGCCG GAACCAGGCC CCGCGGGCGG ACGCCGGCCG AAGACGAGCG ATTGGAGGCC GATCTCCGCG ACGACGAGAA GGAACGCGCC GAACACGCGA TGCTCGTCGA CTTAGAGCGC AACGACCTCG GGAAGGTCAG CGAGTACGGT TCCGTGTCGG TGACGGACTA TCGCCGCGTG GACCGGTACT CGGAAGTCAT GCACCTCGTC TCCCTCGTCG AGGGAACCTT GCGAGACGAC GCGAGCATCG CCGACGCCGT CGCGGCGGTG TTTCCGGGCG GCACGATCAC GGGCGCGCCG AAGCCGCGGA CGATGGAAAT CATCGACGAA CTCGAGGCGA CCCGCCGCGG CCCCTACACC GGGAGCATCG GGATCTTCGG CTTCGACGAT CGGGCGACGC TGAACATCGT CATCCGAACG CTGGTCCGCC ACGCCGACGA GTACCATCTG CGCGTCGGCG CCGGCGTCGT CCACGATTCC GTTCCCGATC GGGAGTACGA CGAGACGCTC GACAAGGCAC GGGCGCTCGT GACGGCAGTC GACGAGGCCT TGGGTGAGCG GGCGTCGTTC GCCCTCGAGA CCGCAACGGA CGCGGTCGGT GATGCGGGAT GA
|
Protein sequence | MSTITVITDS ESFAETAESA PDGARVPVEV RVSVADPFDA YCRARTDEAD GFYLETTGGQ SGWGYFGVDP VERIRVTSNA VDRDGGSPTI RTIDALLERE RLVRGDCSVP YPCGAFGWLS YDVARELESL PSTTSDERGL PRLQLGVFDR VAAWTEPRDS ETELRVTACP VVDDDPADAY ETGRAAAQSL AQAAIDGSEA DRSPPVDATR AAFESECGRA AFADRVRRVK RYIRDGDTFQ ANVSHRLVAP AAAHPVDVFD AVRRVNPAPY SGLLEFPGVD LVSASPELLL EVRDGSLVTE PIAGTRPRGR TPAEDERLEA DLRDDEKERA EHAMLVDLER NDLGKVSEYG SVSVTDYRRV DRYSEVMHLV SLVEGTLRDD ASIADAVAAV FPGGTITGAP KPRTMEIIDE LEATRRGPYT GSIGIFGFDD RATLNIVIRT LVRHADEYHL RVGAGVVHDS VPDREYDETL DKARALVTAV DEALGERASF ALETATDAVG DAG
|
| |