Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sros_8544 |
Symbol | |
ID | 8671878 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Streptosporangium roseum DSM 43021 |
Kingdom | Bacteria |
Replicon accession | NC_013595 |
Strand | + |
Start bp | 9428221 |
End bp | 9429366 |
Gene Length | 1146 bp |
Protein Length | 381 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | |
Product | 4-hydroxyphenylpyruvate dioxygenase |
Protein accession | YP_003343929 |
Protein GI | 271969733 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGCACAACC AGTCCATCGA AACAGAAGGT GTCCGCATGA GTGACATCTT TCCCGTGAAC GGCATGGACG CCGTGGTCTT CGCCGTGGGC AACGCCAAGC AGGCCGCCCA CTACTACTCC AGCGCCTTCG GCATGCGGCT GGTGGCCTAC CGCGGCCCGG AGAACGGCAG CCGTGACGAG GTCGCCTATG TCCTCACCTC CGGCAGCGCC CGGTTCGAGT TCCGCGGCGC GGTGCGGCCG GGCACCGGCG TCGCCCGCCA CGTGGCCGAG CACGGCGACG GCGTGATCGA CCTGGCCATC GAGGTCCCGG ACGTGGAGGC GGCCTACACC GGCGCGCTGG CCCGGGGCGC CAAGGGCCTC GCCGAGCCGT ACACGATGGA GGACGAGCAC GGCAAGGTCG TGCTCGCGGC CATCGCCACC TACGGCGAGA CCCGGCACAC GCTGGTCGAC CGGTCCAACT ACAGCGGCCC CTACCTGCCC GGCTACGCCG CGGCCGAGCC CCTCGTGCCG GCCCCCGACG TGCGGAACGG CCGCCTGTTC CAGGCCGTCG ACCACTGCGT CGGCAACGTG GAACTCGGCA AGATGGACGA GTGGGTGGAG TTCTACGGGC GGGTCATGGG CTTCACCAAC ATGGCGGAGT TCATCGGCGA CGACATCGCC ACCGAATACT CGGCGCTGAT GTCCAAGGTC GTCGCGGACG GCACCCGCAA GGTGAAGTTC CCGCTCAACG AGCCGGCGAT CTCCAAGAAG AAGTCGCAGA TCGACGAGTA CCTGGAGTTC TACGGCGGAC CCGGCGTGCA GCACATCGCG CTGGCCACCA ACGACATCCT CAGCACCGTC GACCACATGC GCGCGGCCGG GGTGCGGTTC CTCGACACGC CCGACTCCTA CTACGACGAC CCGGAGCTGC GCGCCCGCAT CGGCGAGGTC CGCGCGCCGA TCGAGGAGCT GAAGAAGCGC AAGATCCTGG TCGACCGCGA CGAGGACGGC TACCTGCTGC AGATCTTCAC CAAGCCGGTG CAGGACCGCC CGACGGTGTT CTTCGAGCTC ATCGAGCGGC ACGGCTCGCT GGGCTTCGGC AAGGGCAACT TCAAGGCGCT CTTCGAGGCG ATCGAGCGCG AGCAGGACCT CAGGGGCAAC CTGTAA
|
Protein sequence | MHNQSIETEG VRMSDIFPVN GMDAVVFAVG NAKQAAHYYS SAFGMRLVAY RGPENGSRDE VAYVLTSGSA RFEFRGAVRP GTGVARHVAE HGDGVIDLAI EVPDVEAAYT GALARGAKGL AEPYTMEDEH GKVVLAAIAT YGETRHTLVD RSNYSGPYLP GYAAAEPLVP APDVRNGRLF QAVDHCVGNV ELGKMDEWVE FYGRVMGFTN MAEFIGDDIA TEYSALMSKV VADGTRKVKF PLNEPAISKK KSQIDEYLEF YGGPGVQHIA LATNDILSTV DHMRAAGVRF LDTPDSYYDD PELRARIGEV RAPIEELKKR KILVDRDEDG YLLQIFTKPV QDRPTVFFEL IERHGSLGFG KGNFKALFEA IEREQDLRGN L
|
| |