Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2989 |
Symbol | |
ID | 5734861 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 3774788 |
End bp | 3776542 |
Gene Length | 1755 bp |
Protein Length | 584 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641280133 |
Product | para-aminobenzoate synthase, subunit I |
Protein accession | YP_001545755 |
Protein GI | 159899508 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0115] Branched-chain amino acid aminotransferase/4-amino-4-deoxychorismate lyase [COG0147] Anthranilate/para-aminobenzoate synthases component I |
TIGRFAM ID | [TIGR00553] aminodeoxychorismate synthase, component I, bacterial clade |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAGTGCGA TTTATGGGCG TTTTGATTTT GCTGATGCAA CTGGTCAGCC CCAAGCGTTG GAGTTTCGTC AACCCTTGGC AATCTACCAA GCGACGACCA GTGCCGAGGT GCTGCCGACG ATTCAAGCCG CTCAGGCGGC GGCGCAAGCT GGAGCGTATG TAATTGGCTA TGTGAGCTAC GAGGCAGCAG TCGCCTTTGA TTCGGCCTTG CAATCGTATC CACCAGCCGC ATTGCCGTTG GTCTGGTTTG CGGCCTTTGC TGTGCCTCAA GTGGTTGAAC CAACAACGCA GCACTACCAG CTCTCGCCGT GGCAACCGAC GATTAGCCTT GAACACTACC GCCAAGCGAT CGCGGCGATT CATGCAGCGA TTGCCAGTGG TGAAACCTAT CAAGTTAACT ATACCTTGCG ACTACGAGCC ACGTTTAACG GCGATCCGTT AGCCTTTTAT CATGATTTGC GGGCGGCTCA AGCTGCCAAC TATTGTGCCT ACCTCAATCT TGGCGAGTAT CAAATTCTCT CGGCTTCGCC CGAACTCTTT TTCGATTGGC GTGATCAGCG GTTAACCACC AAGCCAATGA AGGGCACGGC TCCGCGTGGG CGTTGGCCTG AAGAAGATCA ACGCTTGGCC CGTCAATTAT TGGCCTCGGA GAAAAACCGT GCCGAAAATT TGATGATTGT TGATCTGTTG CGCAACGATT TGGGGCGAGT CGCAGCGATT GGCAGCGTTG GCGTGCCACG TTTATTCGAG CTAGAGCGCT ATCGCACCGT TTGGCAACTG ACCTCAACTG TTGCCGCCAA AACCAAGCCG AACACCAGCC TGCTTGATAT TCTGCAAGCG CTCTTTCCCT GTGGCTCGAT CACTGGTGCT CCCAAAGTCA AAACCATGGA ACTGATTCGC CAGTTCGAGG CTGATCCACG GGCGGTTTAT TGTGGGGCGA TTGGCATACT GCGGCCTGAT GGCAGCGCAA CCTTTAATGT GGCCATTCGC ACCGTATGGA TCGACCAACA GCGCCAGCAG GCCGAATATG GCGTGGGCGG CGGTATTACT TGGGATTCGC AGGCTGACGA CGAATATGCT GAAGCCCAAC TCAAAGCTCA GTTATTGACC GAGCGCTGGC CCCAATTTGA TCTGATCGAA ACGCTGCGTT GGGATGGTCA GCGTTACTGG TTGCTTGAAC ACCATCTACG ACGTTTGCAC GATTCGGCGG CCTATTTTGG CTTTGCCTAC GACCAAAGCG CCGTGCTGAA TGCGCTCAAT CAGCATAGCT TTGGCCATTC AACTGCGTTG CGAGTGCGTT TGAACCTCAC CCATACAGGT GATATTGCAA TTAGTAGCAG CCCGCTAACG CCGACCGCCG ATGGCCAAAA GGTGAGCTTG GCGGCAACAG CGGTTAACTC CCAGAACCGC TTCCTCTACC ACAAAACGAC TAACCGCAGA TTGTACGACG AGTACACCCA ACAATCCCCC ACAGATTTTG ATGTGTTGCT ATGGAATGAG CATGGTCAAT TGACCGAATT TACCAGAGGC AACCTCGTGC TTGAACTTGA TGGCCAGCGT TGGACTCCCC CAGTCGAAGT TGGCTTATTG GCCGGAACTT ATCGTGCCGA ATTGTTGCAA CAACGCGCTA TCCAAGAGCG TACTTTAGTC CTAGCCGATC TTTGGGCGGC CAGCAAAATT TGGCTGATTA ATAGCGTTCG TGGCTGGGTA TTAGTTGAGT TAGCTACTAC AGAAGTTACC ATTTCTTGCC AATAA
|
Protein sequence | MSAIYGRFDF ADATGQPQAL EFRQPLAIYQ ATTSAEVLPT IQAAQAAAQA GAYVIGYVSY EAAVAFDSAL QSYPPAALPL VWFAAFAVPQ VVEPTTQHYQ LSPWQPTISL EHYRQAIAAI HAAIASGETY QVNYTLRLRA TFNGDPLAFY HDLRAAQAAN YCAYLNLGEY QILSASPELF FDWRDQRLTT KPMKGTAPRG RWPEEDQRLA RQLLASEKNR AENLMIVDLL RNDLGRVAAI GSVGVPRLFE LERYRTVWQL TSTVAAKTKP NTSLLDILQA LFPCGSITGA PKVKTMELIR QFEADPRAVY CGAIGILRPD GSATFNVAIR TVWIDQQRQQ AEYGVGGGIT WDSQADDEYA EAQLKAQLLT ERWPQFDLIE TLRWDGQRYW LLEHHLRRLH DSAAYFGFAY DQSAVLNALN QHSFGHSTAL RVRLNLTHTG DIAISSSPLT PTADGQKVSL AATAVNSQNR FLYHKTTNRR LYDEYTQQSP TDFDVLLWNE HGQLTEFTRG NLVLELDGQR WTPPVEVGLL AGTYRAELLQ QRAIQERTLV LADLWAASKI WLINSVRGWV LVELATTEVT ISCQ
|
| |