Gene Strop_4250 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagStrop_4250 
Symbol 
ID5060735 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora tropica CNB-440 
KingdomBacteria 
Replicon accessionNC_009380 
Strand
Start bp4818443 
End bp4819648 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content68% 
IMG OID640476512 
Product4-hydroxyphenylpyruvate dioxygenase 
Protein accessionYP_001161056 
Protein GI145596759 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins 
TIGRFAM ID[TIGR01263] 4-hydroxyphenylpyruvate dioxygenase 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.365692 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0823259 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCAGG CGATCGACCG ACCCCAGACG AGTGACGAGG TCGACGCCGA CCTGCTGGTC 
GGCGCCGTTG ACCACGACAT CAGCCACGAC CCGTTCCCGG TCAAGGGTCT CGACCACGTG
CAGTTCCTGG TCGGCAACGC CAAGCAGGCC GCGCACTACT ACTCCACCGC CTTCGGCATG
ACCTGCGTGG CCTACCGGGG GCCGGAGCAA GGCTACCGGG ATCACGCTCA GTACGTGCTG
ACCAGTGGTT CGGCCCGCTT CGTCCTCACC GGCGCGGTCC GCCCGGACGC GGCCGGTGCC
GAGCAGGTCG CCCGGCACAG CGACGGGGTC TGCGACATCG CGCTGGAGGT CCCCGACGTT
GACGCGGCGC ACGCGCACGC CATCGCCCAG GGCGCGATCA GCCTTGCTGA GCCGTACGAG
GTCAGCGACG AACACGGCAC GGTCCGGCTC GCCGCCATCG CCACGTATGG TGACACCCGC
CACACCCTGG TGGACCGCTC CCGCTACCAC GGCCCGTTCC TACCCGGCTA CGTCGCCCGC
CGACCGATCG TCGACCGCCA GCCAATGATC GACGCTGGCG TCCAGCCGAA GCGCTTCTTC
CAGGCGATCG ACCACGTCGT CGGCAACGTC GAGCTGGGTC GCATGGACGA GTGGGTCGAG
TTCTACCAGC GGGTGATGGG CTTCACCAAC ATGGCGGAGT TCGTCGGCGA CGACATCGCC
ACCGACTACT CGGCGCTGAT GAGCAAGGTC GTCGCCAACG GCACCCGGAA GGTGAAGTTT
CCGCTCAACG AGCCGGCGGT CGCCCGGAAG AAGTCGCAGA TCGACGAATA CCTGGACTTC
TACCAGGGCC CCGGGGCCCA GCACATCGCG GTGGCCACCA ACGACATCCT GGCCAGCGTG
GACGCGATGC GCGCGGCAGG CGTGGACTTC CTGGACACCC CCGACTCGTA CTACGACGAC
CCGGAGCTGC GGGCCCGGAT CGGCGAGGTC CGGGTTCCGA TCGAGGAGCT GAAGGCCCGC
CGGATCCTGG TCGACCGGGA CGAGGACGGC TACCTGCTCC AGATCTTCAC CAACCCGGTG
CAGGACCGCC CGACCGTCTT CTTCGAGCTG ATCGAGCGAC ACGGCTCGCT CGGCTTCGGC
AAAGGCAACT TCAAGGCGCT CTTCGAGGCC ATCGAGCGGG AGCAGGACAA GCGCGGCAAC
CTGTGA
 
Protein sequence
MTQAIDRPQT SDEVDADLLV GAVDHDISHD PFPVKGLDHV QFLVGNAKQA AHYYSTAFGM 
TCVAYRGPEQ GYRDHAQYVL TSGSARFVLT GAVRPDAAGA EQVARHSDGV CDIALEVPDV
DAAHAHAIAQ GAISLAEPYE VSDEHGTVRL AAIATYGDTR HTLVDRSRYH GPFLPGYVAR
RPIVDRQPMI DAGVQPKRFF QAIDHVVGNV ELGRMDEWVE FYQRVMGFTN MAEFVGDDIA
TDYSALMSKV VANGTRKVKF PLNEPAVARK KSQIDEYLDF YQGPGAQHIA VATNDILASV
DAMRAAGVDF LDTPDSYYDD PELRARIGEV RVPIEELKAR RILVDRDEDG YLLQIFTNPV
QDRPTVFFEL IERHGSLGFG KGNFKALFEA IEREQDKRGN L