Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hlac_0459 |
Symbol | |
ID | 7400339 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorubrum lacusprofundi ATCC 49239 |
Kingdom | Archaea |
Replicon accession | NC_012029 |
Strand | + |
Start bp | 476421 |
End bp | 478151 |
Gene Length | 1731 bp |
Protein Length | 576 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 643707523 |
Product | Chorismate binding-like protein |
Protein accession | YP_002565131 |
Protein GI | 222478894 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0147] Anthranilate/para-aminobenzoate synthases component I |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 0.971281 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGCGAT GGGTACACAC CGACCGGGAC CGGTTCCGCG AGGTCGCCGC GGCGGCGCCC GCTGGCGCCC GAGTTCCGGT CGAGCGCCGG GTCGCCGTCG ACGACCCCTT CGCGGCCTAC CGGCGCGCCC GCGACGGCCC GGGCGGGTTC TTCTACGAGA CCACCGGCGG CCAGTCCGGC TGGGGGTACT TCGGCGTCGA CCCGGTCGAG CGGCTCACCG TCTCCGGGGA CGCGGTGGTC GCGTCGGAGG GTAGCCGGTC GAGCGGCGAC TACCGACGCC CGAGTCCCAC TCTCGCGGCG TTGGAGGGGG TTATCGACGG CGAGGCGCTC GCACGCGGCG ACTGCGACGT GCCGTACCCC TGCGGCGCGT TCGGCTGGCT CTCGTACGAC GCCGCCCGCG AGCTGGAGTC GTTTCCGGAG TCGGCGCCCG CGGGCCCCGG CGCGGTCGAC GATCGAGGCC TCCCGCGGCT CCAGATCGGC GTCTTCGACC GCGTGGCGGC GTGGGAGTGT CCGGTTTCCG CGGGTGGCGA TGACTCGCGC GACTCGGCCG CCTCCACCCT CCGCGTGACG GCCTGTCCGC GCGTTCCCGA GGGGCTCGAC GATCCTGACG CCGACCGCGA CGCGCTCGAC GCGCTGTTCG ACGAGGGGGC GTCGCGGGCG GACGACCTGA TTGACCGGAT CGAATCGGGC AACCCCGCTG TCGGGCCGGC ACCCGACCCC GACGCGTCGA CCGCGACCTT CGAGAGCGAC GTGGGTCGAG AAGGGTACGC CGAGGCGGTC AGTCGGGTGA AGGCGTCCAT CCGCGACGGC GACACCTTTC AAGCGAACGT CTCCCAGCGA CTGCGTGCCC CGGCCGCGGT CCATCCGGTC GAGGCGTACG ACGCGCTTCG GACGGTGAAC CCGGCGCCGT ACTCCGGGCT GATCGAGTTC TCGCGGGAGG GGGGTGCGGA GGGGGACAAC GACACCGACG ATAACGACGA TAACGGCGAC AACGACGACG GCGACGACGC CCCGTCCGGC GTCGACCTCG TGAGCGCGAG CCCGGAGCTA CTCTTAGAGC GCGTTCCGAG CGGCGGCGCG GCGGAGGCGG GCGCCGACCG CGACGAGGGT GAACACGGCG CTCGCCTCGT CACGGAGCCC ATCGCTGGCA CCCGCCCGCG GGGGGAGACG CCGGAGGCGG ACGCTGACAT GGAGGCGGAG CTGACCGGCG ACGAGAAGGA GCGCGCGGAA CACGCCATGC TCGTCGACTT GGAGCGCAAC GACCTCGGGA AGGTCTCCCG ATTCGGCACG GTCGACGTGG CGGAGTACCG CCGGGTGGAC CGGTACAGCG AGGTGATGCA CCTCGTGAGC CTGATCGAGG GGGAGGCGCG GTCGGACGTG GGGCTCGCCG ACGCGGTCGC GGCGTGTTTC CCCGGCGGGA CGATCACGGG CGCGCCCAAG CCGCGGACGA TGGAGATCAT CGACGAGTTG GAGGAGACGC GGCGGGGCCC CTACACCGGC TCCATGCTCG CGGCCGGTTT CGACGGGCGG GCGACGCTCA ACATCGTCAT CCGCACGCTG GTCCGGCGGG CGGCCGAGTA CCACCTGCGC GTCGGCGCGG GAATCGTCCA CGACTCCGAG CCCGACGCGG AGTACGAGGA GACGCTCGCG AAGGCCCGGG CGCTCGTGAC CGCGGTCGAC GAGGCGCTCG CGGCCGGCGG GATGGCGGTC GAGGAGGGAG TCGACCGATG A
|
Protein sequence | MQRWVHTDRD RFREVAAAAP AGARVPVERR VAVDDPFAAY RRARDGPGGF FYETTGGQSG WGYFGVDPVE RLTVSGDAVV ASEGSRSSGD YRRPSPTLAA LEGVIDGEAL ARGDCDVPYP CGAFGWLSYD AARELESFPE SAPAGPGAVD DRGLPRLQIG VFDRVAAWEC PVSAGGDDSR DSAASTLRVT ACPRVPEGLD DPDADRDALD ALFDEGASRA DDLIDRIESG NPAVGPAPDP DASTATFESD VGREGYAEAV SRVKASIRDG DTFQANVSQR LRAPAAVHPV EAYDALRTVN PAPYSGLIEF SREGGAEGDN DTDDNDDNGD NDDGDDAPSG VDLVSASPEL LLERVPSGGA AEAGADRDEG EHGARLVTEP IAGTRPRGET PEADADMEAE LTGDEKERAE HAMLVDLERN DLGKVSRFGT VDVAEYRRVD RYSEVMHLVS LIEGEARSDV GLADAVAACF PGGTITGAPK PRTMEIIDEL EETRRGPYTG SMLAAGFDGR ATLNIVIRTL VRRAAEYHLR VGAGIVHDSE PDAEYEETLA KARALVTAVD EALAAGGMAV EEGVDR
|
| |