Gene Sare_4680 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4680 
Symbol 
ID5704307 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp5302284 
End bp5303489 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content67% 
IMG OID641274078 
Product4-hydroxyphenylpyruvate dioxygenase 
Protein accessionYP_001539424 
Protein GI159040171 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins 
TIGRFAM ID[TIGR01263] 4-hydroxyphenylpyruvate dioxygenase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.211865 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.133112 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCAAG CGATCGACCG ACCCCAGTCG AGCGACGAGG TCGACGCCGA CCTGCTGGTC 
GGCGCCGTAG ACCACGACAT CAGCCGGGAT CCGTTCCCGG TCAGGGGCCT CGACCACGTG
CACTTCCTGG TGGGCAACGC CAAACAGGCC GCGCACTACT ACTCCACCGC GTTCGGCATG
ACGTGCGTGG CCTACCGGGG GCCGGAACAG GGCCACCGAG ACCACGCCCA GTACGTCCTG
ACCAGTGGCT CGGCCCGGTT CGTTCTCACC GGCACGGTCC GCCCCGACGC GGCGGGTGCC
GAGCAGGTCG CCCGGCACAG CGACGGCGTC TCCGACATCG CACTGGAGGT CCCGGACGTC
GACGCGGCGT ACGCGCACGC CATCGCCCAG GGCGCGAGCG GTCTGGCGGA GCCGTACGAC
GTCAGCGACG AACACGGCAC CGTCCGGCTG GCAGCCATCG CGACGTACGG CGACACCCGC
CACACCCTGG TCGACCGCTC CCGTTACCGC GGTCCGTTCC TGCCCGGCTA CGTCGCCCGG
CAGCCGATCG TCGATCGTCA GCCGATGGTC AACGCAGGTC TCCAGCCCAA GCGCTTCTTC
CAGGCGATCG ACCACATCGT CGGCAACGTC GAGCTGGGCC GCATGGACGA GTGGGTCGAG
TTCTACCGGC GTGTGATGGG CTTCACCAAC ATGGCGGAGT TCGTCGGCGA CGACATCGCC
ACCGACTATT CGGCGCTGAT GAGCAAGGTG GTCGCCAACG GCACCCGGAA GGTGAAGTTC
CCGCTCAACG AGCCGGCGGT CGCCCGGAAG AAGTCGCAGA TCGACGAGTA CCTGGAGTTC
TATCAGGGCC CGGGAGCCCA GCACATCGCG GTGGCCACCA ACGACATTCT GGCCAGCGTG
GACGCGATGC GCGCGGCCGG GGTCGAGTTC CTGGACACCC CGGACTCGTA CTACGACGAC
CCGGAACTAC GTGCCCGGAT CGGTGAGGTG CGGGTGCCGA TCGAGGAGCT GAAGGCCCGC
GGGATCCTGG TTGACCGGGA CGAGGACGGC TACCTGCTCC AGATCTTCAC CAAGCCGGTG
CAGGACCGCC CAACCGTCTT CTTCGAGCTG ATCGAGCGAC ACGGCTCACT CGGCTTCGGC
AAGGGCAACT TCAAGGCACT CTTCGAGGCC ATCGAACGGG AACAGGAGAA GCGCGGCAAC
CTGTGA
 
Protein sequence
MTQAIDRPQS SDEVDADLLV GAVDHDISRD PFPVRGLDHV HFLVGNAKQA AHYYSTAFGM 
TCVAYRGPEQ GHRDHAQYVL TSGSARFVLT GTVRPDAAGA EQVARHSDGV SDIALEVPDV
DAAYAHAIAQ GASGLAEPYD VSDEHGTVRL AAIATYGDTR HTLVDRSRYR GPFLPGYVAR
QPIVDRQPMV NAGLQPKRFF QAIDHIVGNV ELGRMDEWVE FYRRVMGFTN MAEFVGDDIA
TDYSALMSKV VANGTRKVKF PLNEPAVARK KSQIDEYLEF YQGPGAQHIA VATNDILASV
DAMRAAGVEF LDTPDSYYDD PELRARIGEV RVPIEELKAR GILVDRDEDG YLLQIFTKPV
QDRPTVFFEL IERHGSLGFG KGNFKALFEA IEREQEKRGN L