Gene Haur_2962 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2962 
Symbol 
ID5734834 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3736977 
End bp3738494 
Gene Length1518 bp 
Protein Length505 aa 
Translation table11 
GC content52% 
IMG OID641280106 
Productanthranilate synthase component I 
Protein accessionYP_001545728 
Protein GI159899481 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00564] anthranilate synthase component I, non-proteobacterial lineages 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00182854 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTTCGC CAACATTTGA ACAAGTGCAA GCATGGGCTG CGGCTGGCTA CACCCAATGT 
GCAGTCTATC GTGAGTTAGT CGCCGATTTG GAAACGCCAG TGTCGGCGTA CTTGAAGGTT
GCTCAGGGTC ATTATAGTTT CTTGCTTGAA AGTGTTGAGG GTGGCGAGCA AATTGGTCGC
TATTCATTTA TTGGCTGTGA GCCTCATTTA ATTATTCGTG GCCTTGGCCA GCAAAGTATT
ATTGAAACTG CCAATGGCGA ACGTACCAGC TTTGATGATC TGACGACGCT TGATCAATTA
GAACGCTTGG TGGTTGGCAC GCAACGGGCT AACCCTGCGC CTCAACCTGA TTTGCCGCGC
TTTACGGGCG GGGCGGTTGG CTTTTTGGGC TATGAAACTG TGCGCACGTT TGAACGTTTG
CCCGCGCCAA CCCTGCGCCC ATTGCAAATT CCCGATGGCG TGTGGATGGT GGTCAAAACG
GTTTTGGCTT TCGACCATGT GCGCCATACG ATCAAAATTA TGAGCACACT GGTGTTTGAT
TCGGCGGTAG ATTTGGCAAC TCAATTTGCC GAAGCCAACC AAGCGATCGA AGCCATGACC
AAAAAATTGG TACGGCCATT AGCACCCGAA GTCTATAGCT CAAGCGCCGC CTCGCCCAGC
TTACCCGAAC TCAACGAGCA ATTGCAATCA AATCAATCGT TTAGCGAATT TAGCACCGCG
ATTGAAAAGG CTCGCGAATA TATTCGGGCT GGCGATATTT TTCAAGTGGT GTTATCGCAG
CGTTTTCAGC GTGAAACCGA TGCCGAGCCA TTTGCGGTCT ATCGAGCACT GCGCACGGTC
AATCCATCAC CATACATGTT CTTTTTGAAT GTGCCTGATG CGGCGATTAT TGGGGCATCG
CCCGAAATGT TGGTGCGGGT TGAAGATGGC ATTATTGAGA CCCACCCAAT TGCTGGTACG
CGCCGCCGTG GCCGCGATGC CGATGACGAA GCTCGCATGC AGGCTGAATT ATTAGCCGAC
GAAAAAGAAC GGGCTGAGCA TTTGATGCTG GTTGATCTTG GGCGCAACGA TGTGGGGCGG
GTTTCGCTGC CTGGCACCGT CCACGTGCCC AAATTTATGC AAATTGAAAA ATATTCGCAT
GTGATGCACT TAGTTTCGGT GGTCAAAGGC ACGCTGGATA CCTCGCGCTA CTCGCCATTG
CATGCCTTAC GCGCCTGCTT CCCTGCTGGC ACGCTGACTG GTGCGCCCAA AGTGCGGGCC
ATGGAAATTA TCGCTGAATT AGAGCCAAGC CAACGCGGGC CATATGGTGG TTGCGTTGGC
TATGTCTCGT TTGGCGGGTT GTCGCTTGAC ACAGCGATTA CTATTCGCAC AATGGTTATC
AAAGATGGCG TAGCCTATAT GCAAGCTGGC GCGGGGATTG TCGCCGATAG CGATGTTAAA
TTGGAAGATC TCGAAACCCG CAACAAAGCT GGTTCGCTGA TTCGCGCCTT GCACGTCGCC
GAGATGTTGG AGTTGTAA
 
Protein sequence
MASPTFEQVQ AWAAAGYTQC AVYRELVADL ETPVSAYLKV AQGHYSFLLE SVEGGEQIGR 
YSFIGCEPHL IIRGLGQQSI IETANGERTS FDDLTTLDQL ERLVVGTQRA NPAPQPDLPR
FTGGAVGFLG YETVRTFERL PAPTLRPLQI PDGVWMVVKT VLAFDHVRHT IKIMSTLVFD
SAVDLATQFA EANQAIEAMT KKLVRPLAPE VYSSSAASPS LPELNEQLQS NQSFSEFSTA
IEKAREYIRA GDIFQVVLSQ RFQRETDAEP FAVYRALRTV NPSPYMFFLN VPDAAIIGAS
PEMLVRVEDG IIETHPIAGT RRRGRDADDE ARMQAELLAD EKERAEHLML VDLGRNDVGR
VSLPGTVHVP KFMQIEKYSH VMHLVSVVKG TLDTSRYSPL HALRACFPAG TLTGAPKVRA
MEIIAELEPS QRGPYGGCVG YVSFGGLSLD TAITIRTMVI KDGVAYMQAG AGIVADSDVK
LEDLETRNKA GSLIRALHVA EMLEL