Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sros_4723 |
Symbol | |
ID | 8668017 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Streptosporangium roseum DSM 43021 |
Kingdom | Bacteria |
Replicon accession | NC_013595 |
Strand | + |
Start bp | 5247235 |
End bp | 5250462 |
Gene Length | 3228 bp |
Protein Length | 1075 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | |
Product | transposase, Tn3 family |
Protein accession | YP_003340310 |
Protein GI | 271966114 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTCGGCT GGCACACCGG ATGCGGCTGC TGCTGGGGTT CACGGAAAGC CGCCGGTTGC TCGTCGCCGC TTGCTACGGT CCGGCTCGTA CCGGTGGAGT TTCTGACTGA TGAGCAGGCC GAGGCGTACG GGACGTTCAC CGCGGTGCCC ACGCGTCCGG AGCTGGAGCG GTTCTTCTTC CTCGACGATG ACGACTGCGA TCTGATCGCG CGGCGTCGTA CGGATGCGCA CCGTCTGGGT ATGGCGGTGC AGATCTGCAC GGTTCGATAC ATCGGCCGGT TCCTGGGCGA GGATCCGCTC GCGGTGCCGT GGGAGGTCGT GGAGTACCTC GCCGGGCAGC TCGGCATCGA GGACTCCTCG TGCGTGAAGC GGTACCCCGA GCGCAGGTCG ACGGTGTACG AGCACGCGGA TGAGATCCAG GAGCGGTTCA AGTACCGGGA CTTTACCGAC CGGAAGTGGG GCCGGGAGTT CCGGAGCTTC CTGTACGGGC GGGCGTGGAC GCACGCCGAG GGGCCGGTGG CGCTGTTCAA CCACGCGGTG ACGTGGCTGC GCAAGAACAG GGTGCTGCTG CCCGGGGTCT CGGTGCTGGC CCGGCAGGTG TCGGAGGCCC GTACGGCGGC CGAGCGGCGC CTGTACGACG CGGTGACCCG TGCCGCGCAC CGGGCCGATC CCGCGCTCGC GCCAGCGCTG GCCGGGCTGC TCGACGTGCC GGAGGGCAAG CGGGTCTCGG AGCTGGAGCG GCTGCGTACG CCACCGACGA AGACGACGGG GACCGCGATG GTGCGGGCCA TGGAGCGAGT GGAGGAGATC TCCGCGTTCG CGCTGGGCCG GGTGAACCTC TCCAGGGTGC CGGTGAACCG GCTCTCGGCG CTGGCCCGCT ACGGGCAGTT GAGCAAGGCC CAGACGATCG AGCGGGCGCC GGAACCGCGG CGCACCGCGC TGCTGACGGC GGTGGTGCGC CAGCTCGAGG CCCAGGCGGT CGACGACGCG CTCGACCTGT TCACGGTGCT GATGGCAAAC CGGCTGATCA GCCCTGCCCG CAGGGCGTCG GACCGCGACC GGCTGGCGAT GCTACCGCAG TTGGAGAAGG CCGCACGGAT CCTGGCGAAG GCGTCGAAGA TCCTCACCAA GGAGCTGGAC CTGGTCGCCG AGCATGACGC GGACCTGGAC GTGGCCGCGC TGTGGGCGGC CGTGGAGGAA GCGGTGCCGC GTACGGCGGT CTCCTCGGCG GTCGCGACCG TGGAAGCCCT GGTGCCGGAG GACGACGGGT CGGCCGAGGC GGCGATGCGC GAGAAGCTGG CTCTGCGCTA CAACACCGTG CGCCCCTTCC TGTCCCTGCT GGGCGAGTCG GACGCGCTCG GTGCGGCCCC TGCCGGGCGG CGCCTGCTCA AGGCGGTGCG ACGGCTTCCC GCGCTCTCGC GCCGAAGGGT CAAGGACCGG CTGCTGCTGC CCCGTGAGGT CGACGCCGAG CTGGTGCCCG CGATGTGGAA GCGGGCAGTG TTCTCCAACG CCAAGTTGCC CCAGGGCTCC GTGGACCGGG ATGCGTACGT GGTGTGTGTG CTGGAGCAGC TGCACCGTGC GCTGAACCGG CGCGACGTGT TCGCCTCGCC GTCGAACCGG TGGGCCGACC CGCGCGCGAG ACTGCTGGAC GGGGCCCGCT GGGAGGCGAT GCGCCCGGAC GTGCTCGCGG GCCTGTCATT GACCGAGGAC GCGGGCGAGC ATCTGGCGCA GCTCACACGG GCGTTGGATG CGGCCTGGCG GCAGATGGCC GACCGCATGA AAGAGGCCGG CGACGACGCG AAGGTGGAGA TTGTCGTTCC CGAGGGTGGG GGCCGCGCCA CGCTGTCGGT GGACAAGCTC GGCGCGGTGG GCGAGCCGGA GTTGCTGACC TGGCTGAAGA ACACCACGGA GGCGATGCTC CCCAGGATCG ACCTACCCGA CCTGCTCTTC GAGGTCCACT CCTGGACCGG GTTCCTCGAC GCCTTCGGCC ACGTCTCCGA CCGGCGCACC CGTATGGAGG GCCTGCTGGT CTCCCTGGTC GCGCTGCTGG TGGCGCAGAG CTGCAACATC GGCCTGACCC CGGTCATCGA CCCGAACAAC AAGGCCCTGA CCCGCTCGCG CCTGTCTCAC GTCGACCAGA ACTACGTACG GGCCGACACC ATCGCCGCGG CGAACGCCGC GCTCATCACC GCTCAGTCCT CCATCGAGCT GGCCCAGATG TGGGGCGGCG GGCTGCTCGC CTCCGTCGAC GGCCTGCGCT TCGTCGTCCC CGTCAAGAGC ATCAACACTG GCCCCTCACC CAAGTATTAC GGCTACAAAC GGGGCGTGAC CTGGCTCAAC GCAGTAAACG ACCAGGTCGC CGGGATCGGC GCGATGGTCG TACGCGGCAC CCCGCGCGAC AGCCTCTACA CCCTGGACAC TCTGCTGAAC CTCGACGGCG GGGTGAAGCC GGAGATGGTC GCCACCGACA ACGCCTCCTA CTCCGACATG GCGTTCGGCC TCTACAAGAT GCTCGGCTTC CGCTTCGCCC CGCGCTTCCG TGACCTGAAC GACCAGCGGT TCTGGCGCGC GGACCTGCCC GACGGCGACG AACCGTCGGG ATACGGGCCG CTGGACGAAG TGGCCTGCAA CAAGGTCAAC CTCAAGCGGA TCGTCACGCA GTGGCCGGAC ATGCTCCGCG TCGCCGGGTC GCTGATCACC AACCAGGTGC GCGCGTACGA CCTGCTGCGG ATGTTCGGCT GCGAGGGCCA CCCGACCCCG CTGGGGGCCG CGTTCGCCGA GTACGGCCGG ATCGACAAGA CCATGCACCT GCTCGCCGTG GTCGACCCGG TCGACGACAC CTACCGGCGG CTGATGAATC GGCAGCTCAC CGTGCAGGAG TCCCGCCACC GCCTGGCCCG TGCAATCTGC CACGGCGGCC GGGGCCAGAT CCGCCAGGCG TACCGCGAGG GCCAGGAGGA CCAGCTCGCC GCGCTCGGCC TGGTCCTCAA CGCCGTCGTC CTGTGGAACA CCCGCTACCT GGATGCCGCC GTCGCCCAAC TCCGCGCCGA GGGCCACGAC ATCAAGGACG AGGACGTCGC CCGCCTCTCC CCGCTCAAGG ACCGGCACAT CAACTTCCTG GGCCGCTACC TGTTCAACAT CAAGGCCAGC AGCCTTGGCC GGGAACTGCG CCCGCTGCGC GACCCGGATG CGCCCGAGCT GGACGAGGAC GACGGGGCCC TGGACTGA
|
Protein sequence | MVGWHTGCGC CWGSRKAAGC SSPLATVRLV PVEFLTDEQA EAYGTFTAVP TRPELERFFF LDDDDCDLIA RRRTDAHRLG MAVQICTVRY IGRFLGEDPL AVPWEVVEYL AGQLGIEDSS CVKRYPERRS TVYEHADEIQ ERFKYRDFTD RKWGREFRSF LYGRAWTHAE GPVALFNHAV TWLRKNRVLL PGVSVLARQV SEARTAAERR LYDAVTRAAH RADPALAPAL AGLLDVPEGK RVSELERLRT PPTKTTGTAM VRAMERVEEI SAFALGRVNL SRVPVNRLSA LARYGQLSKA QTIERAPEPR RTALLTAVVR QLEAQAVDDA LDLFTVLMAN RLISPARRAS DRDRLAMLPQ LEKAARILAK ASKILTKELD LVAEHDADLD VAALWAAVEE AVPRTAVSSA VATVEALVPE DDGSAEAAMR EKLALRYNTV RPFLSLLGES DALGAAPAGR RLLKAVRRLP ALSRRRVKDR LLLPREVDAE LVPAMWKRAV FSNAKLPQGS VDRDAYVVCV LEQLHRALNR RDVFASPSNR WADPRARLLD GARWEAMRPD VLAGLSLTED AGEHLAQLTR ALDAAWRQMA DRMKEAGDDA KVEIVVPEGG GRATLSVDKL GAVGEPELLT WLKNTTEAML PRIDLPDLLF EVHSWTGFLD AFGHVSDRRT RMEGLLVSLV ALLVAQSCNI GLTPVIDPNN KALTRSRLSH VDQNYVRADT IAAANAALIT AQSSIELAQM WGGGLLASVD GLRFVVPVKS INTGPSPKYY GYKRGVTWLN AVNDQVAGIG AMVVRGTPRD SLYTLDTLLN LDGGVKPEMV ATDNASYSDM AFGLYKMLGF RFAPRFRDLN DQRFWRADLP DGDEPSGYGP LDEVACNKVN LKRIVTQWPD MLRVAGSLIT NQVRAYDLLR MFGCEGHPTP LGAAFAEYGR IDKTMHLLAV VDPVDDTYRR LMNRQLTVQE SRHRLARAIC HGGRGQIRQA YREGQEDQLA ALGLVLNAVV LWNTRYLDAA VAQLRAEGHD IKDEDVARLS PLKDRHINFL GRYLFNIKAS SLGRELRPLR DPDAPELDED DGALD
|
| |