Gene Sros_6970 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_6970 
Symbol 
ID8670280 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp7682936 
End bp7684024 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content68% 
IMG OID 
Product4-hydroxyphenylpyruvate dioxygenase 
Protein accessionYP_003342414 
Protein GI271968218 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.91426 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCTCGT CAGGAACGCA GATCTTCGAA GACATGTTGC TCGATCACGT TGAGTTCTAC 
GTCAACGAGC TCGCGGCGAA AACGGACTGG TTCGTCGACA GCTTCGGCTT CTCGGTGTAC
GCGACCACGG ACGCGTCGGA GAAGGAGCCC GAGGTCCGCT CGGTGGGGCT CGGCGGCAAC
CGGATCCGGC TCGTGCTGAC CGAGCCGCTG GTGGGCGACC ATCCGGGTGC CGCCTACGTG
GAGAAGCACG GCGACGGAGT GGCCGACATC GCGCTGCGGG TCACGGACGC CGCCGCGGCG
TTCGACGAGG CCGTGCGGCG CGGCGCCCGC CCGGTGTCCC CGCCGGCCGG GTACGACGGC
GTCGTGACAG CCACGATCAT GGGCTTCGGC GACGTGGCGC ACACCTTCGT GCAGCGCGCG
GGTGACACGG ACGAGCGCGC GCTTCCCGGG CTGCGGCCGG TGTACGGGTC GGCGTCCGGC
ACGGGCGGCA ACCTGGACGA GGTGGACCAC TTCGCGGTCT GCGTGGAGTC CGGCCAGATC
GACGCGACAG TCGACTTCTA CCGGCACATT CTCGACTTCG AGCTGATCTT CACCGAGCAC
ATCGTCGTCG GCTCCCAGGC GATGACCATC AAGGTGGTGC AGAGCAGGTC CGGCGCGGTG
ACGCTGACCC TGATCGAGCC GGACGTGTCA CAGGTCGCCG GCCACATCGA CGAGTTCCTC
AAGCACCACG GCGGTGCCGG CGTGCAGCAC ATGGCGTTCA CGGCCGGCGA CATCGTGGAG
GCGGTGGGCA CCATCGGTGC CCGGGGCGTG GAGTTCCTGA GCACCCCGGA CGCCTACTAC
AGCCTGCTCC CGGAGCGGAT GGAGCTGGGA CGGTACTCCG TCGACGAGCT GCGGAGGCTC
AACATCCTGG TCGACGAGGA CCACGACGGC CAGCTCTACC AGATCTTCGC CCGATCCGTG
CACCCGCGTA ACACGTTCTT CCTGGAGCTC ATCGAGCGGC TGGGGGCGCG TTCCTTCGGC
AGCGGCAACA TCTCGGCGCT CTACCAGGCG GTGGAGCTCC AGCAGAGCAG GGAAGAGGCC
GCCGCCTGA
 
Protein sequence
MASSGTQIFE DMLLDHVEFY VNELAAKTDW FVDSFGFSVY ATTDASEKEP EVRSVGLGGN 
RIRLVLTEPL VGDHPGAAYV EKHGDGVADI ALRVTDAAAA FDEAVRRGAR PVSPPAGYDG
VVTATIMGFG DVAHTFVQRA GDTDERALPG LRPVYGSASG TGGNLDEVDH FAVCVESGQI
DATVDFYRHI LDFELIFTEH IVVGSQAMTI KVVQSRSGAV TLTLIEPDVS QVAGHIDEFL
KHHGGAGVQH MAFTAGDIVE AVGTIGARGV EFLSTPDAYY SLLPERMELG RYSVDELRRL
NILVDEDHDG QLYQIFARSV HPRNTFFLEL IERLGARSFG SGNISALYQA VELQQSREEA
AA