Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sros_4694 |
Symbol | |
ID | 8667988 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Streptosporangium roseum DSM 43021 |
Kingdom | Bacteria |
Replicon accession | NC_013595 |
Strand | - |
Start bp | 5218651 |
End bp | 5219862 |
Gene Length | 1212 bp |
Protein Length | 403 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | |
Product | transposase IS116/IS110/IS902 family protein |
Protein accession | YP_003340285 |
Protein GI | 271966089 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.275408 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGTCG ACTACGCGGT GTTTGTCGGC TTAGACGTGG GCAAGGGCGA ACAGCACGCC TGCGCCCTGG ACCCTGGCGG CAAGAAACTC CACGACAAGC CCCTGCCCAA CGACGAACAG CGCCTGCGCG CGCTGTTCGG CAAGCTCAAC ACCCACGGGC CAGTGCTCGT CGTGGTCGAC CAGCCCGCCT CCATCGGCGC ACTGCCCGTC GCCGTGGCTC GCGCCGAAGG CTGCCAGGTC GCCTACCTGC CCGGCCTGAC CATGCGCCGC CTGGCCGACC TGCACCCCGG CAACGCCAAG ACCGACGCCC GCGACGCCTT CATCATCGCC GACGCCGCTC GCACCTTGCC GCACACACTT CGACGGGTCG ACGCCGGCGA CGAAGCCCTC GCCGAGCTGG AGGTTCTCGT CGGATTCGAC GATGACCTGG CCGCCGAGGC GACCCGGGTG ACCAACCGGA TCCGCGGACT GCTCGTCACC ATCCACCCGG CGCTGGAGCG GGCGCTCGGG CCGCGCCTGC ACCATCCCGC CGCCCTGGAG CTCCTGGCAC GCTTCGGCGG CCCGAACGGC CTGCGAGACG CCGGACGCGA GCAGCTCCTG ACAGTCGCCC GGCCACTGGC TCCGCGCATG GCGGGCCGTA TGGTCGACGA CGTCTGGGCC GCGCTCGAAG CCCAGACAGT CCTCGTCCCC GGAACCAGCG CGGCCGAGAC CGTCCTGCCC CGCCTGTCAC AATCGCTACG AAGCGTGCTC GACCAGCGCA AACAGGTCGC GGCCGAGGTA GAGGCGATGC TTGATGCCCA CCCTCTCGCC AAGGTCCTGA TCACCATGCC CGGGCTCGGG ATCAGGACCA CCGCACGGCT CCTGCTGGAG ATCGGCGACA TCTCCGCCTT CGCCACCCCC GGGCACCTCG CCGCCTACGC CGGGCTCGCC CCGGTGACCC GCCGCTCCGG TTCGTCGATC AAGGGTGAAC ACCCGCCCAA GGGCGGCAAC AAGGCACTGA AACGGGCCAT GTTCCTCGCC GCGTTCGCAT CTCTGTCCGA CCCCGAAAGC AGGGAGTACT ACGACAAGAA GCGCGCCGAG GGCAAGAAGC ACAACGCCGC CCTGATCTGC CTCGCCCGTC GCCGTTCAGA CGTCATCTAC GCCATGCTCC GCGACCGCAA GCCCTACCAA CCCCGCCGGA AGAACCGCAC CCGAAAGCCC TCGGCCGCTT GA
|
Protein sequence | MSVDYAVFVG LDVGKGEQHA CALDPGGKKL HDKPLPNDEQ RLRALFGKLN THGPVLVVVD QPASIGALPV AVARAEGCQV AYLPGLTMRR LADLHPGNAK TDARDAFIIA DAARTLPHTL RRVDAGDEAL AELEVLVGFD DDLAAEATRV TNRIRGLLVT IHPALERALG PRLHHPAALE LLARFGGPNG LRDAGREQLL TVARPLAPRM AGRMVDDVWA ALEAQTVLVP GTSAAETVLP RLSQSLRSVL DQRKQVAAEV EAMLDAHPLA KVLITMPGLG IRTTARLLLE IGDISAFATP GHLAAYAGLA PVTRRSGSSI KGEHPPKGGN KALKRAMFLA AFASLSDPES REYYDKKRAE GKKHNAALIC LARRRSDVIY AMLRDRKPYQ PRRKNRTRKP SAA
|
| |