Gene Haur_0970 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_0970 
Symbol 
ID5732856 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp1112376 
End bp1113671 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content55% 
IMG OID641278102 
Product3-phosphoshikimate 1-carboxyvinyltransferase 
Protein accessionYP_001543746 
Protein GI159897499 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0128] 5-enolpyruvylshikimate-3-phosphate synthase 
TIGRFAM ID[TIGR01356] 3-phosphoshikimate 1-carboxyvinyltransferase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.138964 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACAGA CAGTTAGTCA TGCCAAGCGC CTGCGCGGGG CAATTAGCGT CCCAGGTGAT 
AAATCGATCT CGCATCGCTC GGTGTTATTT AATGCTTTAG CCGAGGGCAA CGCCGAAATT
ACGGGGTTCT TGCCAGGCGC TGATTGTCTT TCGTCAATCG CTTGTTTGCG CCAAATGGGC
GTTGAAATTG AACACAGCGA CGATAAGGTA CGGGTGTTCG GGCGGGGTTT GCGTGGCCTG
CGTGAGCCAA GCGACGTTTT AGATTGTGGT AATTCGGGTA CAACGCTCCG TTTGTTGGCA
GGTTTATTGG CTGGTCAGCC ATTTTTGAGC GTGCTAACTG GCGATGCGTC GTTGCGTTCA
CGCCCTCAGA AACGCATTGT TGAACCATTA CGCCAACTAG GAGCCAAGCT CGATGGCCGC
GATAACGGCA ACCGTGCACC CTTGGTGATT CGTGGCACAA CCATTCATGG TGGCAACTAT
GAATTGCCGA TCGCCAGTGC TCAAGTTAAA TCGGCCTTGC TCTTGGCTGG TTTAACTGGC
GATGCGCCAA TGCGTTTATC GGGCAAAATC GTTAGCCGCG ACCATACCGA GCGCATGTTG
ATCGCCATGG GAATTGATCT CACCGTTAAA GATGATGAGA TTGTGCTCTA TCCACCGAGC
CATCCGGTTT TCCCCTATCC GCTTTCGTTG CATGTTCCAG GCGATCCTTC GTCGGCAACC
TTTTGGTGGG TAGCCGCAGC GATTCACCCC GATGCCGAAA TTACCACCTT GGGCGTTGGA
TTAAACCCCA GTCGCACTGG AGCGCTCGAT GTGCTCAAGG CCATGGGCGC TGATATTACG
ATCAGCAATG AGCGCAATGA AGGTGCAGAG CCTGTTGGCG ATGTAACCGT GCGTGGCGGT
GGCTTACGAG GCACACGCAT CGATGGCGAT TTAATTCCGC GTTTGATCGA TGAAATTCCG
GTGCTGGCGG TGGCGGCAGC CTGTGCAGTT GGCGAAACCG TGGTTGCCGA TGCCGAAGAA
CTGCGGGCCA AAGAAACCGA TCGGGTAGCC ACAGTGGTTA GCGAACTAAC AGCCATGGGT
GCGACCCTCG AAGCCACACC CGATGGCATG ATCATCGCTG GTGGTGGCGA ACTCCAAGGC
GCTCACGTTC AATCGCATGG TGATCATCGC ATCGCGATGG CCTTGGCGGT GGCTGGCTTA
GTGGCCGAAG GCGAAACGAT TATCGACGAA GCTGAAGCCG TGACCGTCTC GTACCCAACA
TTCTGGCAGC ATTACGCGCA GATCAAAGAA GCCTGA
 
Protein sequence
MKQTVSHAKR LRGAISVPGD KSISHRSVLF NALAEGNAEI TGFLPGADCL SSIACLRQMG 
VEIEHSDDKV RVFGRGLRGL REPSDVLDCG NSGTTLRLLA GLLAGQPFLS VLTGDASLRS
RPQKRIVEPL RQLGAKLDGR DNGNRAPLVI RGTTIHGGNY ELPIASAQVK SALLLAGLTG
DAPMRLSGKI VSRDHTERML IAMGIDLTVK DDEIVLYPPS HPVFPYPLSL HVPGDPSSAT
FWWVAAAIHP DAEITTLGVG LNPSRTGALD VLKAMGADIT ISNERNEGAE PVGDVTVRGG
GLRGTRIDGD LIPRLIDEIP VLAVAAACAV GETVVADAEE LRAKETDRVA TVVSELTAMG
ATLEATPDGM IIAGGGELQG AHVQSHGDHR IAMALAVAGL VAEGETIIDE AEAVTVSYPT
FWQHYAQIKE A