Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sros_4455 |
Symbol | |
ID | 8667749 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Streptosporangium roseum DSM 43021 |
Kingdom | Bacteria |
Replicon accession | NC_013595 |
Strand | + |
Start bp | 4968359 |
End bp | 4969378 |
Gene Length | 1020 bp |
Protein Length | 339 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | |
Product | integrase family protein |
Protein accession | YP_003340068 |
Protein GI | 271965872 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.000000349863 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.00111504 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGAGCGGCG GTGATCGGCC GGTGCCGGTG GGTCTGTCTG AGGTGGTCGC GGAGTTCCTG TCTGCCTTGC GGACCAAGAA GCTCTCTTCG CACACCGTTG CCGGTTACAG GCGTGATCTG GAGTTGGTGG CGGGGCTGGC CGCGGCTGCG GCCGGGGTGG GGGTGGCGGA TCTGGAGGTC CGGTCGCTGA GTGGCCGGCT GGTCCGTACG GCGTTCGCGG AGTTCTCCGG GCCTCGTTCG CCGGCCAGTG TCAACCGGGC CTGGAGTGCC TGGAATCAGT TCTTGTCCTT CTGTGTGGCC GAGGGGTTGC TGGAGGGCAA TCCGATGGCG GCGGTGCCCC GGCCCAAGCA GCCGGCCAAG GCTCCCAAGC CTCTTTTGGG TGATGGGAGT GCTTCGGCGA CGTTGCTGGA GCGTATCGCC GCGGGGGCGC GCAGGGCCCG TGATCCGTGG GTGGAGCGTG ATCTGGTGGT TTTGGCGCTG GCGCTGGTGA CCGGGATGCG TTCGGCGGAG TTGCTGGGTC TGACGTTGGG GTCGATCGGT GGTTCGCCTG GTGATCGGCG TATCCAGGTG GTGGGTAAGG GGGGCAGGAG CCGCTCGCTT CCGATCGAGG CTCCGGTGGA GCGTCTTGTG GAGCGTTATC TGCACAGCCG TATGGTCCGT TTCGGGTTGA GCGCGCTCCC CCGTTCGGCG GCGTTGCTGG TCGACACCGG TAATGAGCCG TTGCGGCGGG GTGGTCTGCA GTATCTGGTC CGGCAGTGTT ATCGCCATGC CGGTGTGCAT GATCGGGTGC AGAGGGGCAC GCTGGTGCAT GCGTTGCGTC ATGAGTTCGC GACGCGGCTG GCCGAGCGGG GGGCTTCGGC GCATGAGTTG ATGGAGTTGC TGGGGCACTC CTCCATCGTC ACGGGGCAGG CTTACATCGC TTCGACCGCT CGTGAGGTGC GCCGGGCCGC GCAGGGTAAT CCCGCCTATG GCGTGCTGGA GCGGTTGGGG CCGGAGGGGG GCGGGGGTGC GGCGGGTTGA
|
Protein sequence | MSGGDRPVPV GLSEVVAEFL SALRTKKLSS HTVAGYRRDL ELVAGLAAAA AGVGVADLEV RSLSGRLVRT AFAEFSGPRS PASVNRAWSA WNQFLSFCVA EGLLEGNPMA AVPRPKQPAK APKPLLGDGS ASATLLERIA AGARRARDPW VERDLVVLAL ALVTGMRSAE LLGLTLGSIG GSPGDRRIQV VGKGGRSRSL PIEAPVERLV ERYLHSRMVR FGLSALPRSA ALLVDTGNEP LRRGGLQYLV RQCYRHAGVH DRVQRGTLVH ALRHEFATRL AERGASAHEL MELLGHSSIV TGQAYIASTA REVRRAAQGN PAYGVLERLG PEGGGGAAG
|
| |