Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sros_4052 |
Symbol | |
ID | 8667346 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Streptosporangium roseum DSM 43021 |
Kingdom | Bacteria |
Replicon accession | NC_013595 |
Strand | + |
Start bp | 4511354 |
End bp | 4512613 |
Gene Length | 1260 bp |
Protein Length | 419 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | |
Product | hypothetical protein |
Protein accession | YP_003339703 |
Protein GI | 271965507 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.0134377 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGTGGAT TCCTGCGGGC GGTCGCGTCG GCGACCGCCG CGACGGCGCT CGCCGCGTCG GGATGCGGCA GCAGCGCCGG CTCCCCGGCT CCGCCGGCCG CTCCCACTTC CTCGCCCGCG AGCCCCACTT CCTCCGCCGC GAGCCCTGCG GCCTCTCCCA CGACCACGCC GGCCTCTCCG CCGCCGGCCT CCCCGTCCCG GTCGGCACCC AGCGGCAGGC CCCCCGTCAC CCCCGAGGAA CGGGCGTCCG CCCTGATCCG CCCCTCCGTC ATGTACGTGG AGGTGGAGTG GAGGGGATAC CTCCGGAACC TGTCGTCCGG GCAGTTCTGG TCGGCAGAGC CCATCAGCTT CCTCGCCGCC TGCACCGGCT TCACCGTGAA CCCGGACGGG CACGTCGTCA CCGCCGGGCA CTGCGTCGAC CCCGGCATCG AGGGCGCTTC ACGGGCGTTC TTCGACGAGG TGATACGCAA GGAGCAGGAG AAGGGCAACA AGAAGACCAA GGAACAGCTG GACAAGGAGT TCACCGGCAA CTGGGTCGTC GAGGGCGAGG TCGCGGGCAC GCCGCCGGAC ATGATCGTCC GCGTCTTCCA GAGCGTCGCC CCCATGGTGA GCCCGACCGA CGTGTTCGCC TCTCCAGGTC CCATGGGCGC CCTGCCGAAG TTCGCCCGCG TGGTCTCGTT CGAGCCCACG AGCAAGGGCG ACGTCGCGCT GCTGAAGGTC GACGGTGCGG ACCTGCCGAG CGTCGAGCTC GCCCCGGAGA GCACGGTCGA GATAGGGACG CCCGTTCTCG CGATCGGTTA TCCCGGGTTC GAGCGCCCGA TGGAGGACCG CGTGCTGGAG CCGACCAACA AGGACGGAAA GGTGAGCGCG AAAATCACGA GTAAGGGGGT GCCTCTCTAC GAGATAAGCG CAGCCATATC GCAGGGCATG AGCGGGGGAC CCGCGGTCGA AACAGCCTCG GCGAAGGCCA TGGGGCTGCT CACCTTCAAA ATCGACAACG AGCAGCAATT CAACTTCGTG GCCCCCTCCT CCCTCATCCG GGAGATCCTC GCCCGCAACG GCGTGAAAAA CGAGCAGGGC GCCCTGGACC GGCTGTACCG CGCCGCCCTG GAGGATTACT GGGCCGGCGA CTATCCCGCG GCGATGAGGG GATTCGACGC GGTCCTCGTC CGGATGCCCT CGCATCGGCA GGCATGGGAG TACAGGGCCA AGGCGGCCGA GCGGATGGCG GCCCGGCCCG GCCGCCCCGC TCCCTCATGA
|
Protein sequence | MGGFLRAVAS ATAATALAAS GCGSSAGSPA PPAAPTSSPA SPTSSAASPA ASPTTTPASP PPASPSRSAP SGRPPVTPEE RASALIRPSV MYVEVEWRGY LRNLSSGQFW SAEPISFLAA CTGFTVNPDG HVVTAGHCVD PGIEGASRAF FDEVIRKEQE KGNKKTKEQL DKEFTGNWVV EGEVAGTPPD MIVRVFQSVA PMVSPTDVFA SPGPMGALPK FARVVSFEPT SKGDVALLKV DGADLPSVEL APESTVEIGT PVLAIGYPGF ERPMEDRVLE PTNKDGKVSA KITSKGVPLY EISAAISQGM SGGPAVETAS AKAMGLLTFK IDNEQQFNFV APSSLIREIL ARNGVKNEQG ALDRLYRAAL EDYWAGDYPA AMRGFDAVLV RMPSHRQAWE YRAKAAERMA ARPGRPAPS
|
| |