Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sros_4680 |
Symbol | |
ID | 8667974 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Streptosporangium roseum DSM 43021 |
Kingdom | Bacteria |
Replicon accession | NC_013595 |
Strand | + |
Start bp | 5203475 |
End bp | 5204515 |
Gene Length | 1041 bp |
Protein Length | 346 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | |
Product | integrase family protein |
Protein accession | YP_003340274 |
Protein GI | 271966078 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.163759 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCAACA ACGGTGAGGT AGCTGTCCGT CGGCAGTCAG GGCTACCGGA GGCCAGGGGC CTGGATCGGG GGTTGACCGA GGAGGCCGCC CTACTGGTGG AGCGCGGACT GGCCACCAAC ACCCGCCTGG CCTACGCCCG CGACTGGGCG ACGTACGGCG CGTGGTGCGA CGAGAGCGGG CACGCTCTGC TGCCCGCCAC CGCCGAGACC CTGGCAAACT ACGTCGCCCA TCTGGCCTCG CACGCCTACG CGCCGGCCTC CATCGACCGG GCGCTGGCCT GCATCCTGGC CGCCCACGAT CACGCCCAGC TCGGCAAGCC CGCCACCAAG CAGGCGCGGC TGGCCCTGCG CGCCTACCGG CGCGAGCGCG CCCACCAGGG ACAGCGCACC CGCAAGTCGC CACCCATCAC GATCGACCGC CTCCGCGCCA TGATCTCTGC ACTCCCCTCC ACCAGCACAA CGGGCCTACG TGATCGCGCG GTGCTGGTGC TCGGTTTTGC CCTGATGGGA CGCCGTTCCG AACTCGTCGC CTGCGACATC GGCGATCTCA CCTTCACCGT CGACGGCCTC GAGGTCTACA TCCCCACCAG CAAGACCGAC CAGGACGCCC ACGGAGAAAC CGTCGCACTC CCCCACGGAT CCCATCCCGA GACCTGCCCG GTGCGTGTTC TCAAAGCATG GCTGGCCGTA CTCGCCGAAC GCGGCGTCAC CTCCGGCGCC CTGCTGCGCC CGGTCGATCG CCACGGCGGC GTCGGCGGCG CCGCCAAGAG TGCGGGCCGC GGCAACCGTC AGCGCCTCAG TGGCCAGACC ATCAACCTCA TCGTCAAGAA CGCCGCCGCG CTGGCCGGCC TGGACAGGCC GGAGACCTAC ACCGCGCACG GGCTGCGAGC CGGCGGCGCC ACCTCCGCCG CCAAAGCCGG CGCGCCCATG TCGGCCATCA CCACGCACGG CCGCTGGGCC GACGGCTCCC CGGTGGTGGC CGGCTACATC CGCCAGGCCG ACAAGTGGAA CGACAACCCG ATGCACGGAG TCGGCCTGTA G
|
Protein sequence | MINNGEVAVR RQSGLPEARG LDRGLTEEAA LLVERGLATN TRLAYARDWA TYGAWCDESG HALLPATAET LANYVAHLAS HAYAPASIDR ALACILAAHD HAQLGKPATK QARLALRAYR RERAHQGQRT RKSPPITIDR LRAMISALPS TSTTGLRDRA VLVLGFALMG RRSELVACDI GDLTFTVDGL EVYIPTSKTD QDAHGETVAL PHGSHPETCP VRVLKAWLAV LAERGVTSGA LLRPVDRHGG VGGAAKSAGR GNRQRLSGQT INLIVKNAAA LAGLDRPETY TAHGLRAGGA TSAAKAGAPM SAITTHGRWA DGSPVVAGYI RQADKWNDNP MHGVGL
|
| |