Gene Haur_4081 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4081 
Symbol 
ID5735940 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5213440 
End bp5214510 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content55% 
IMG OID641281233 
Productchorismate synthase 
Protein accessionYP_001546841 
Protein GI159900594 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0082] Chorismate synthase 
TIGRFAM ID[TIGR00033] chorismate synthase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTGGAA ATAGTTTTGG TCGGTTGTTC CGAATTTCAA CATGGGGCGA ATCCCATGGA 
GTTGGCTTGG GCGTGGTGAT CGATGGTTGC CCCGCAGGTT TGGAGCTTGA TTTGGCTGCC
ATCCAAGCCC AATTGGATCG CCGTCGGGTT GGACAAAGCC GCATGACTTC GGCTCGGCGG
GAGCCTGATG AAGTTGAGAT TTTATCGGGC ATGTTTGAGG GTCGCACGAC TGGTACAGCC
TTGGCAATGC TGATTCGCAA TACCAACGCC CGATCCAGCG ATTACGATGC AATCAAACAT
TTATATCGAC CTGGCCATGC TGACCATAGT TACGATGCTA AATATGGCTT CCGTGATTAT
CGTGGTGGTG GTCGTTCGAG CGCACGCGAA ACCGCCGCGC GGGTTGCGGC TGGCGCAGTC
GCCCGCCAAA TCTTGGCCAC AATGGGCATT AGCTTGGTGG CCTATACGCT GAGTTTAGGC
CATCTCAAAG CCCAAATCAT CGACGAAAAC GAAATTGAAA ATAACATTAT GCGCTGCCCA
GACCCCGCTG TGGCCGAGCA GATGATCGCC TATGTCGATC AAGCCCGCCG CGATTTGGAT
TCGCTGGGTG GCGTGGTTGA GGTGCGGGCA CGTGGAGTTC CGGCTGGGCT AGGCGAGCCA
GTGTTTGATA AACTTGATGC TTTGATTGGT CATGCCATGT TTAGTATTCC CGCAGTCAAA
GCAGTCGAAA TTGGCTCAGG CATCGAGGCA GGCAATGCCC GTGGTTCGCA AAATAACGAT
CCATTTATCC AGCGAGCAGA TGGTAGCATT GGCACAAGCA GCAACCATGC TGGCGGGATT
TTGGGTGGCA TCAGCAGCAG CGAGGAGATT GTGGTGCGCC TGACGGCCAA ACCACCAGCT
TCAATCGCCC AAGAACAAAC CACGGTCGAT CAAGCGGGCG AACCTGCCAC AATTGTGGTC
AAAGGCCGCC ACGACCCAAC CGTCTTGCCG CGTTTAGTGC CAGTTGCCGA GGCGATGTTG
GCCTTGGTGC TGGTCGATTG TGTCTTGCAA CAACGTGCCG CCCGATTGTA G
 
Protein sequence
MPGNSFGRLF RISTWGESHG VGLGVVIDGC PAGLELDLAA IQAQLDRRRV GQSRMTSARR 
EPDEVEILSG MFEGRTTGTA LAMLIRNTNA RSSDYDAIKH LYRPGHADHS YDAKYGFRDY
RGGGRSSARE TAARVAAGAV ARQILATMGI SLVAYTLSLG HLKAQIIDEN EIENNIMRCP
DPAVAEQMIA YVDQARRDLD SLGGVVEVRA RGVPAGLGEP VFDKLDALIG HAMFSIPAVK
AVEIGSGIEA GNARGSQNND PFIQRADGSI GTSSNHAGGI LGGISSSEEI VVRLTAKPPA
SIAQEQTTVD QAGEPATIVV KGRHDPTVLP RLVPVAEAML ALVLVDCVLQ QRAARL