Gene Haur_2989 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2989 
Symbol 
ID5734861 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3774788 
End bp3776542 
Gene Length1755 bp 
Protein Length584 aa 
Translation table11 
GC content53% 
IMG OID641280133 
Productpara-aminobenzoate synthase, subunit I 
Protein accessionYP_001545755 
Protein GI159899508 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0115] Branched-chain amino acid aminotransferase/4-amino-4-deoxychorismate lyase
[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00553] aminodeoxychorismate synthase, component I, bacterial clade 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGTGCGA TTTATGGGCG TTTTGATTTT GCTGATGCAA CTGGTCAGCC CCAAGCGTTG 
GAGTTTCGTC AACCCTTGGC AATCTACCAA GCGACGACCA GTGCCGAGGT GCTGCCGACG
ATTCAAGCCG CTCAGGCGGC GGCGCAAGCT GGAGCGTATG TAATTGGCTA TGTGAGCTAC
GAGGCAGCAG TCGCCTTTGA TTCGGCCTTG CAATCGTATC CACCAGCCGC ATTGCCGTTG
GTCTGGTTTG CGGCCTTTGC TGTGCCTCAA GTGGTTGAAC CAACAACGCA GCACTACCAG
CTCTCGCCGT GGCAACCGAC GATTAGCCTT GAACACTACC GCCAAGCGAT CGCGGCGATT
CATGCAGCGA TTGCCAGTGG TGAAACCTAT CAAGTTAACT ATACCTTGCG ACTACGAGCC
ACGTTTAACG GCGATCCGTT AGCCTTTTAT CATGATTTGC GGGCGGCTCA AGCTGCCAAC
TATTGTGCCT ACCTCAATCT TGGCGAGTAT CAAATTCTCT CGGCTTCGCC CGAACTCTTT
TTCGATTGGC GTGATCAGCG GTTAACCACC AAGCCAATGA AGGGCACGGC TCCGCGTGGG
CGTTGGCCTG AAGAAGATCA ACGCTTGGCC CGTCAATTAT TGGCCTCGGA GAAAAACCGT
GCCGAAAATT TGATGATTGT TGATCTGTTG CGCAACGATT TGGGGCGAGT CGCAGCGATT
GGCAGCGTTG GCGTGCCACG TTTATTCGAG CTAGAGCGCT ATCGCACCGT TTGGCAACTG
ACCTCAACTG TTGCCGCCAA AACCAAGCCG AACACCAGCC TGCTTGATAT TCTGCAAGCG
CTCTTTCCCT GTGGCTCGAT CACTGGTGCT CCCAAAGTCA AAACCATGGA ACTGATTCGC
CAGTTCGAGG CTGATCCACG GGCGGTTTAT TGTGGGGCGA TTGGCATACT GCGGCCTGAT
GGCAGCGCAA CCTTTAATGT GGCCATTCGC ACCGTATGGA TCGACCAACA GCGCCAGCAG
GCCGAATATG GCGTGGGCGG CGGTATTACT TGGGATTCGC AGGCTGACGA CGAATATGCT
GAAGCCCAAC TCAAAGCTCA GTTATTGACC GAGCGCTGGC CCCAATTTGA TCTGATCGAA
ACGCTGCGTT GGGATGGTCA GCGTTACTGG TTGCTTGAAC ACCATCTACG ACGTTTGCAC
GATTCGGCGG CCTATTTTGG CTTTGCCTAC GACCAAAGCG CCGTGCTGAA TGCGCTCAAT
CAGCATAGCT TTGGCCATTC AACTGCGTTG CGAGTGCGTT TGAACCTCAC CCATACAGGT
GATATTGCAA TTAGTAGCAG CCCGCTAACG CCGACCGCCG ATGGCCAAAA GGTGAGCTTG
GCGGCAACAG CGGTTAACTC CCAGAACCGC TTCCTCTACC ACAAAACGAC TAACCGCAGA
TTGTACGACG AGTACACCCA ACAATCCCCC ACAGATTTTG ATGTGTTGCT ATGGAATGAG
CATGGTCAAT TGACCGAATT TACCAGAGGC AACCTCGTGC TTGAACTTGA TGGCCAGCGT
TGGACTCCCC CAGTCGAAGT TGGCTTATTG GCCGGAACTT ATCGTGCCGA ATTGTTGCAA
CAACGCGCTA TCCAAGAGCG TACTTTAGTC CTAGCCGATC TTTGGGCGGC CAGCAAAATT
TGGCTGATTA ATAGCGTTCG TGGCTGGGTA TTAGTTGAGT TAGCTACTAC AGAAGTTACC
ATTTCTTGCC AATAA
 
Protein sequence
MSAIYGRFDF ADATGQPQAL EFRQPLAIYQ ATTSAEVLPT IQAAQAAAQA GAYVIGYVSY 
EAAVAFDSAL QSYPPAALPL VWFAAFAVPQ VVEPTTQHYQ LSPWQPTISL EHYRQAIAAI
HAAIASGETY QVNYTLRLRA TFNGDPLAFY HDLRAAQAAN YCAYLNLGEY QILSASPELF
FDWRDQRLTT KPMKGTAPRG RWPEEDQRLA RQLLASEKNR AENLMIVDLL RNDLGRVAAI
GSVGVPRLFE LERYRTVWQL TSTVAAKTKP NTSLLDILQA LFPCGSITGA PKVKTMELIR
QFEADPRAVY CGAIGILRPD GSATFNVAIR TVWIDQQRQQ AEYGVGGGIT WDSQADDEYA
EAQLKAQLLT ERWPQFDLIE TLRWDGQRYW LLEHHLRRLH DSAAYFGFAY DQSAVLNALN
QHSFGHSTAL RVRLNLTHTG DIAISSSPLT PTADGQKVSL AATAVNSQNR FLYHKTTNRR
LYDEYTQQSP TDFDVLLWNE HGQLTEFTRG NLVLELDGQR WTPPVEVGLL AGTYRAELLQ
QRAIQERTLV LADLWAASKI WLINSVRGWV LVELATTEVT ISCQ