Gene Haur_4467 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4467 
Symbol 
ID5736318 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5713299 
End bp5714372 
Gene Length1074 bp 
Protein Length357 aa 
Translation table11 
GC content50% 
IMG OID641281630 
Productaminodeoxychorismate lyase 
Protein accessionYP_001547227 
Protein GI159900980 
COG category[R] General function prediction only 
COG ID[COG1559] Predicted periplasmic solute-binding protein 
TIGRFAM ID[TIGR00247] conserved hypothetical protein, YceG family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000156063 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGACGTT TAATTCGAGC TTTGTTGATT GTTGCCACCA TCGGCGCACT AGTTGTCGCG 
TGTGTTGCAA CCCTGTTTCT GCGTGAGTTA ACCCAGCCTG CTGGCGAGAG CAATATTGCC
CAAGATTTTA CCATCGCTCC CAGCGAAAGT TTGGCGGTGA TCAGCAGCAA TCTGGAATCT
GAAGGTTTGG TGCGGCGGGC AATTGTTTTC CGCGTATTCG CCGATTTACG TAATGCCGAG
ACCGATTTGT ATCCTGGCAC CTACAAAATT AGCCCAAATA TGACGATCAA TCAGATTTTA
GAGATGTTTC GGGTTGCCCC AGAAGTTCAA ACTGCGGTGC GCTTTACCGT GCCTGAAGGC
TTGCGGATTG AAGAAATCGC GGCGGTGATT GAATCGACTG GCGTAGTTAG TGCCGATGAT
TTCTTGGCTG TGGCCCGCGA TGGCTCGCAA TTTAAGGCCG ATTATAGCTT TTTATCCAGC
TTGCCAGATA GCGCAACCTT GGAAGGCTAT CTCTTCCCTG ATACCTATGA AATCTTTTCT
GATGCAACCA GCGAAGAGAT TATTCGCAAA ATGCTCGATA CCTTTGCAAT TCGCTGGGCT
GATTCGCCGC TGAGCAGCGC CACGACCGGG CGTTCTGTCC ATGAAGTGGT GACTTTAGCC
TCGATTGTGC AGCGTGAAGC CAGCAATAAC GAAGAAATGC CACGGATTGC TGCCGCCTTC
TGGAATCGCC TGAAACCAGA ATTTGCTGGC AATCAGCTGG GAGCCGATCC GACAATTCAA
TATATTTTAG GCGAATCAGG CAATTGGTGG CCAAAGCTTG ATCAGCTAAC GGTTGAACAA
ATTAATAGTG CTGCTGGCCC TTATAACACA CGGGTCAACC CCGGCTTGCC ACCTGGGCCA
ATTAGTGCGC CTGGTTTGTT TGCCTTGCAA GCCGCTGCCT CGCCTGCCGC CGAAGATGTG
ACCTATTTTG TGACCAAGTG TGTGGCTGCT GGCGAACGCC CAACCCACAA CTTTACCAAC
GACTATAGCG AATTTTTGCA ATTTCAAGAA GAGTTTTTGG CGTGTCCCAA ATAG
 
Protein sequence
MRRLIRALLI VATIGALVVA CVATLFLREL TQPAGESNIA QDFTIAPSES LAVISSNLES 
EGLVRRAIVF RVFADLRNAE TDLYPGTYKI SPNMTINQIL EMFRVAPEVQ TAVRFTVPEG
LRIEEIAAVI ESTGVVSADD FLAVARDGSQ FKADYSFLSS LPDSATLEGY LFPDTYEIFS
DATSEEIIRK MLDTFAIRWA DSPLSSATTG RSVHEVVTLA SIVQREASNN EEMPRIAAAF
WNRLKPEFAG NQLGADPTIQ YILGESGNWW PKLDQLTVEQ INSAAGPYNT RVNPGLPPGP
ISAPGLFALQ AAASPAAEDV TYFVTKCVAA GERPTHNFTN DYSEFLQFQE EFLACPK