Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Paes_1566 |
Symbol | |
ID | 6459385 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prosthecochloris aestuarii DSM 271 |
Kingdom | Bacteria |
Replicon accession | NC_011059 |
Strand | - |
Start bp | 1704669 |
End bp | 1706165 |
Gene Length | 1497 bp |
Protein Length | 498 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 642725554 |
Product | anthranilate synthase component I |
Protein accession | YP_002016231 |
Protein GI | 194334371 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0147] Anthranilate/para-aminobenzoate synthases component I |
TIGRFAM ID | [TIGR00564] anthranilate synthase component I, non-proteobacterial lineages |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0133412 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.00994784 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTCGATAT CAAGACAGAG CGAAACCTTT ACGCTTCATC CTCTTATCAG CGCTGTCCAT GCTGATACCG AAACCCCTGT TTCAGTCTAT CTCAAGCTTC GTGAACCCTA TTCCTGCCTT CTTGAGTCGG TTGAAGGAGA GGAGCAGCTT GCCCGTTTTT CTATTATTGC GATCGATCCT GTCGCTGTTC TTAAAGGGAC GGTCAATGGG GATGTTTCTA TAGACATCCG CGATAAGCGG TTTGAACGCC TGGCAGCGAT TCCCGGTGAG TCGTCCGGTT TGCGCGATGC TGTCGACCGT TGTCTCGCTC TTTTTAAAAG CAGCGAATTT ACGCCTGACG GTCCTGGCGC ATCACGGATG ATTACATCCG GAGCATTCGG TTATTTTGCC TACGACACCA TGCATCTGGT AGAACGGATT CCTTCTGCCC AACTGCCCGA TCCTGCCGAT CTTCCTGACG TATGCCTCCT GTTATGCGAT AAACTGGTTG TTTTCGACAA TGTCAAGCGC AAGGTGTTTA TCATCGTGAA TTATCTCGAT GAGGCTGATC GTCCGAGGGC TGAAAAAACC ATGGCAGATA TCAGGGCAAG GATGTTTAAT CCCTTATCAG CCCGGGAGCT GATGCTTGTA CCCGAAAAGC CTGAGCCGAT TGTCTCTAAT ACCGAGCGGG AAGCCTATCT GGAGAAGATC CGGGTTGCCA AGGAGTACAT TATGCAGGGG GATATTTTTC AGGTACAGGT TTCGCAGCGT CTCAAACGCC ATCTCAATTC ACGCCCGTTT GACGTTTACC GTATGCTGCG AACCATCAAT CCTTCTCCAT ATCTCTATTA TTTCGATATG GAGGAGTTCA TGATTGTGGG TTCATCTCCT GAACTTCTGG TCAAGGTCGC CGATGATCAC GACGGCAGAA GGATCGTCGA CACCCGTCCG ATCGCAGGAA CCCGCCCGAG AGGTGCGACA TGGGAAGAAG ACCAGCGTAT CGAGAAAGAG CTTTTACGTG ACGAAAAGGA GCTTGCCGAG CACTTGATGC TGATCGACTT GAGCCGTAAC GATATCGGGC GGATAGCTAA AATCGGGACG GTTGAAACCA ATGAGATGAT GATCATCGAG CGCTATTCGC ATGTGATGCA TATTGTCAGC AACGTTCGGG GTGAGCTGCG TGACGAATTG ACGCCGATGG ACGCATTCTG GGCATGCTTT CCTGCCGGAA CCCTGACAGG TGCCCCTAAA GTCAGGGCTA TGGAAATCAT CTATGAACTG GAGCAGGAGA AACGCGGGCT TTATGGTGGA GCGGTCGGAT TTATCGATTT CTGCGGGCAG CTTGAGACCG CTATTGCTAT TCGTACCATG GTGGTTCGTG ACGATACAGT CTATTTCCAG GCGGCCGGTG GTGTGGTTGC CGATTCTCTT GCCGAGAATG AATTCGACGA GACCATGAAC AAAATGAGAG CGGGGCTCAG AACGCTCGAA GCGCTCGAGC ATGTTTCAAA CGATTGA
|
Protein sequence | MSISRQSETF TLHPLISAVH ADTETPVSVY LKLREPYSCL LESVEGEEQL ARFSIIAIDP VAVLKGTVNG DVSIDIRDKR FERLAAIPGE SSGLRDAVDR CLALFKSSEF TPDGPGASRM ITSGAFGYFA YDTMHLVERI PSAQLPDPAD LPDVCLLLCD KLVVFDNVKR KVFIIVNYLD EADRPRAEKT MADIRARMFN PLSARELMLV PEKPEPIVSN TEREAYLEKI RVAKEYIMQG DIFQVQVSQR LKRHLNSRPF DVYRMLRTIN PSPYLYYFDM EEFMIVGSSP ELLVKVADDH DGRRIVDTRP IAGTRPRGAT WEEDQRIEKE LLRDEKELAE HLMLIDLSRN DIGRIAKIGT VETNEMMIIE RYSHVMHIVS NVRGELRDEL TPMDAFWACF PAGTLTGAPK VRAMEIIYEL EQEKRGLYGG AVGFIDFCGQ LETAIAIRTM VVRDDTVYFQ AAGGVVADSL AENEFDETMN KMRAGLRTLE ALEHVSND
|
| |