Gene Sare_2673 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2673 
Symbol 
ID5706984 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp3048261 
End bp3049370 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content70% 
IMG OID641272131 
Product4-hydroxyphenylpyruvate dioxygenase 
Protein accessionYP_001537501 
Protein GI159038248 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins 
TIGRFAM ID[TIGR01263] 4-hydroxyphenylpyruvate dioxygenase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000192911 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGATATCC GTGGAATAGA CCACATCGAA CTCTACGTGG GTGACGCCCG GCAGGCCGCC 
TTCTACTTCG GCAACGCGGT GGGAATGCGG CTGTGCGGCC AGGGTGGCCC GGAGACCGGA
CTGACCGGGC AGCGTTCGTT GCTGCTGCGG CACGCTGGTG TCCGGTTGTT GCTGACCTCG
GGGCTGACCG CCGACCATTC GGCGGCGGCG TACGTGCGAC GGCACGGCGA CGGTATCGCC
GTGGTCGCGA TGGAGGTCGA CGACGCCGCC GGGGCGTACG CCGAACTGTT GGCCAGGGGT
GCGACCGGCG GGACACCCCC GACCACCGTC ACCAGCGCCG ACGCCGAGGT CGTCGTTGCC
GAGGTGGACG GTTTCGCCGA TGTGCGGCAC CGGCTGGTCG AGCGTCGCCG GGGCGGACCC
GACTTCCTGC CGGGCCTGGC GGAGCTGCCG CCGGTGGACG ACACCGCCGA GAACCTGCTC
GCCGAGATCG ACCACCTGGC GGTGTGTGTA CCGCCCGGGC AACTCGCCGA AACGGTCCGT
GGCTACCGGG AGGTGTTCGG ATTCGCCGAG ATCTTCCACG AGTACGTGGA GGTCGACGGT
CAGGCGATGA ACTCCACTGT GGTGCAGAGC CGGTCCGGGC GGGTGACGTT GGTGCTGCTC
GAACCAGACA CCACGCGGCG GGCCGGGCAG ATCGACGCGT TCCTCACCCA GCACGCCGGT
GCGGGGGTGC AGCACCTCGG GCTGCGCACC GACGACATCG TCGAAGCGGT CACCGCGCTG
CGCCAGCGCG GGGTGGGATT CGCGCGTACC CCGGCGGCCT ACTACGACGA TCTGGAGACC
CGGGTCGGCC GGGTCGACGG CTCACTGGAC CGGCTGCGGG AACTCGGCGT GCTGGTTGAC
CGGGACCACG ACGGTCAGTT GCTGCAGATC TTCACAGAGT CGATGCACGT GCGCCGCACC
CTCTTCCTCG AGTTGATCGA GCGGCGCGGG GCGCGGACCT TCGGCAGCGG CAACATCAAG
GCGCTCTACG AGGCCAAAGA ACGGGAACTG GCCGTGGCGG GGGCGCTCCC CGCCGTCAGT
GCGGCCACCG GCCAGGAGGT GACGGCATGA
 
Protein sequence
MDIRGIDHIE LYVGDARQAA FYFGNAVGMR LCGQGGPETG LTGQRSLLLR HAGVRLLLTS 
GLTADHSAAA YVRRHGDGIA VVAMEVDDAA GAYAELLARG ATGGTPPTTV TSADAEVVVA
EVDGFADVRH RLVERRRGGP DFLPGLAELP PVDDTAENLL AEIDHLAVCV PPGQLAETVR
GYREVFGFAE IFHEYVEVDG QAMNSTVVQS RSGRVTLVLL EPDTTRRAGQ IDAFLTQHAG
AGVQHLGLRT DDIVEAVTAL RQRGVGFART PAAYYDDLET RVGRVDGSLD RLRELGVLVD
RDHDGQLLQI FTESMHVRRT LFLELIERRG ARTFGSGNIK ALYEAKEREL AVAGALPAVS
AATGQEVTA