Gene Sros_4723 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_4723 
Symbol 
ID8668017 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp5247235 
End bp5250462 
Gene Length3228 bp 
Protein Length1075 aa 
Translation table11 
GC content70% 
IMG OID 
Producttransposase, Tn3 family 
Protein accessionYP_003340310 
Protein GI271966114 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCGGCT GGCACACCGG ATGCGGCTGC TGCTGGGGTT CACGGAAAGC CGCCGGTTGC 
TCGTCGCCGC TTGCTACGGT CCGGCTCGTA CCGGTGGAGT TTCTGACTGA TGAGCAGGCC
GAGGCGTACG GGACGTTCAC CGCGGTGCCC ACGCGTCCGG AGCTGGAGCG GTTCTTCTTC
CTCGACGATG ACGACTGCGA TCTGATCGCG CGGCGTCGTA CGGATGCGCA CCGTCTGGGT
ATGGCGGTGC AGATCTGCAC GGTTCGATAC ATCGGCCGGT TCCTGGGCGA GGATCCGCTC
GCGGTGCCGT GGGAGGTCGT GGAGTACCTC GCCGGGCAGC TCGGCATCGA GGACTCCTCG
TGCGTGAAGC GGTACCCCGA GCGCAGGTCG ACGGTGTACG AGCACGCGGA TGAGATCCAG
GAGCGGTTCA AGTACCGGGA CTTTACCGAC CGGAAGTGGG GCCGGGAGTT CCGGAGCTTC
CTGTACGGGC GGGCGTGGAC GCACGCCGAG GGGCCGGTGG CGCTGTTCAA CCACGCGGTG
ACGTGGCTGC GCAAGAACAG GGTGCTGCTG CCCGGGGTCT CGGTGCTGGC CCGGCAGGTG
TCGGAGGCCC GTACGGCGGC CGAGCGGCGC CTGTACGACG CGGTGACCCG TGCCGCGCAC
CGGGCCGATC CCGCGCTCGC GCCAGCGCTG GCCGGGCTGC TCGACGTGCC GGAGGGCAAG
CGGGTCTCGG AGCTGGAGCG GCTGCGTACG CCACCGACGA AGACGACGGG GACCGCGATG
GTGCGGGCCA TGGAGCGAGT GGAGGAGATC TCCGCGTTCG CGCTGGGCCG GGTGAACCTC
TCCAGGGTGC CGGTGAACCG GCTCTCGGCG CTGGCCCGCT ACGGGCAGTT GAGCAAGGCC
CAGACGATCG AGCGGGCGCC GGAACCGCGG CGCACCGCGC TGCTGACGGC GGTGGTGCGC
CAGCTCGAGG CCCAGGCGGT CGACGACGCG CTCGACCTGT TCACGGTGCT GATGGCAAAC
CGGCTGATCA GCCCTGCCCG CAGGGCGTCG GACCGCGACC GGCTGGCGAT GCTACCGCAG
TTGGAGAAGG CCGCACGGAT CCTGGCGAAG GCGTCGAAGA TCCTCACCAA GGAGCTGGAC
CTGGTCGCCG AGCATGACGC GGACCTGGAC GTGGCCGCGC TGTGGGCGGC CGTGGAGGAA
GCGGTGCCGC GTACGGCGGT CTCCTCGGCG GTCGCGACCG TGGAAGCCCT GGTGCCGGAG
GACGACGGGT CGGCCGAGGC GGCGATGCGC GAGAAGCTGG CTCTGCGCTA CAACACCGTG
CGCCCCTTCC TGTCCCTGCT GGGCGAGTCG GACGCGCTCG GTGCGGCCCC TGCCGGGCGG
CGCCTGCTCA AGGCGGTGCG ACGGCTTCCC GCGCTCTCGC GCCGAAGGGT CAAGGACCGG
CTGCTGCTGC CCCGTGAGGT CGACGCCGAG CTGGTGCCCG CGATGTGGAA GCGGGCAGTG
TTCTCCAACG CCAAGTTGCC CCAGGGCTCC GTGGACCGGG ATGCGTACGT GGTGTGTGTG
CTGGAGCAGC TGCACCGTGC GCTGAACCGG CGCGACGTGT TCGCCTCGCC GTCGAACCGG
TGGGCCGACC CGCGCGCGAG ACTGCTGGAC GGGGCCCGCT GGGAGGCGAT GCGCCCGGAC
GTGCTCGCGG GCCTGTCATT GACCGAGGAC GCGGGCGAGC ATCTGGCGCA GCTCACACGG
GCGTTGGATG CGGCCTGGCG GCAGATGGCC GACCGCATGA AAGAGGCCGG CGACGACGCG
AAGGTGGAGA TTGTCGTTCC CGAGGGTGGG GGCCGCGCCA CGCTGTCGGT GGACAAGCTC
GGCGCGGTGG GCGAGCCGGA GTTGCTGACC TGGCTGAAGA ACACCACGGA GGCGATGCTC
CCCAGGATCG ACCTACCCGA CCTGCTCTTC GAGGTCCACT CCTGGACCGG GTTCCTCGAC
GCCTTCGGCC ACGTCTCCGA CCGGCGCACC CGTATGGAGG GCCTGCTGGT CTCCCTGGTC
GCGCTGCTGG TGGCGCAGAG CTGCAACATC GGCCTGACCC CGGTCATCGA CCCGAACAAC
AAGGCCCTGA CCCGCTCGCG CCTGTCTCAC GTCGACCAGA ACTACGTACG GGCCGACACC
ATCGCCGCGG CGAACGCCGC GCTCATCACC GCTCAGTCCT CCATCGAGCT GGCCCAGATG
TGGGGCGGCG GGCTGCTCGC CTCCGTCGAC GGCCTGCGCT TCGTCGTCCC CGTCAAGAGC
ATCAACACTG GCCCCTCACC CAAGTATTAC GGCTACAAAC GGGGCGTGAC CTGGCTCAAC
GCAGTAAACG ACCAGGTCGC CGGGATCGGC GCGATGGTCG TACGCGGCAC CCCGCGCGAC
AGCCTCTACA CCCTGGACAC TCTGCTGAAC CTCGACGGCG GGGTGAAGCC GGAGATGGTC
GCCACCGACA ACGCCTCCTA CTCCGACATG GCGTTCGGCC TCTACAAGAT GCTCGGCTTC
CGCTTCGCCC CGCGCTTCCG TGACCTGAAC GACCAGCGGT TCTGGCGCGC GGACCTGCCC
GACGGCGACG AACCGTCGGG ATACGGGCCG CTGGACGAAG TGGCCTGCAA CAAGGTCAAC
CTCAAGCGGA TCGTCACGCA GTGGCCGGAC ATGCTCCGCG TCGCCGGGTC GCTGATCACC
AACCAGGTGC GCGCGTACGA CCTGCTGCGG ATGTTCGGCT GCGAGGGCCA CCCGACCCCG
CTGGGGGCCG CGTTCGCCGA GTACGGCCGG ATCGACAAGA CCATGCACCT GCTCGCCGTG
GTCGACCCGG TCGACGACAC CTACCGGCGG CTGATGAATC GGCAGCTCAC CGTGCAGGAG
TCCCGCCACC GCCTGGCCCG TGCAATCTGC CACGGCGGCC GGGGCCAGAT CCGCCAGGCG
TACCGCGAGG GCCAGGAGGA CCAGCTCGCC GCGCTCGGCC TGGTCCTCAA CGCCGTCGTC
CTGTGGAACA CCCGCTACCT GGATGCCGCC GTCGCCCAAC TCCGCGCCGA GGGCCACGAC
ATCAAGGACG AGGACGTCGC CCGCCTCTCC CCGCTCAAGG ACCGGCACAT CAACTTCCTG
GGCCGCTACC TGTTCAACAT CAAGGCCAGC AGCCTTGGCC GGGAACTGCG CCCGCTGCGC
GACCCGGATG CGCCCGAGCT GGACGAGGAC GACGGGGCCC TGGACTGA
 
Protein sequence
MVGWHTGCGC CWGSRKAAGC SSPLATVRLV PVEFLTDEQA EAYGTFTAVP TRPELERFFF 
LDDDDCDLIA RRRTDAHRLG MAVQICTVRY IGRFLGEDPL AVPWEVVEYL AGQLGIEDSS
CVKRYPERRS TVYEHADEIQ ERFKYRDFTD RKWGREFRSF LYGRAWTHAE GPVALFNHAV
TWLRKNRVLL PGVSVLARQV SEARTAAERR LYDAVTRAAH RADPALAPAL AGLLDVPEGK
RVSELERLRT PPTKTTGTAM VRAMERVEEI SAFALGRVNL SRVPVNRLSA LARYGQLSKA
QTIERAPEPR RTALLTAVVR QLEAQAVDDA LDLFTVLMAN RLISPARRAS DRDRLAMLPQ
LEKAARILAK ASKILTKELD LVAEHDADLD VAALWAAVEE AVPRTAVSSA VATVEALVPE
DDGSAEAAMR EKLALRYNTV RPFLSLLGES DALGAAPAGR RLLKAVRRLP ALSRRRVKDR
LLLPREVDAE LVPAMWKRAV FSNAKLPQGS VDRDAYVVCV LEQLHRALNR RDVFASPSNR
WADPRARLLD GARWEAMRPD VLAGLSLTED AGEHLAQLTR ALDAAWRQMA DRMKEAGDDA
KVEIVVPEGG GRATLSVDKL GAVGEPELLT WLKNTTEAML PRIDLPDLLF EVHSWTGFLD
AFGHVSDRRT RMEGLLVSLV ALLVAQSCNI GLTPVIDPNN KALTRSRLSH VDQNYVRADT
IAAANAALIT AQSSIELAQM WGGGLLASVD GLRFVVPVKS INTGPSPKYY GYKRGVTWLN
AVNDQVAGIG AMVVRGTPRD SLYTLDTLLN LDGGVKPEMV ATDNASYSDM AFGLYKMLGF
RFAPRFRDLN DQRFWRADLP DGDEPSGYGP LDEVACNKVN LKRIVTQWPD MLRVAGSLIT
NQVRAYDLLR MFGCEGHPTP LGAAFAEYGR IDKTMHLLAV VDPVDDTYRR LMNRQLTVQE
SRHRLARAIC HGGRGQIRQA YREGQEDQLA ALGLVLNAVV LWNTRYLDAA VAQLRAEGHD
IKDEDVARLS PLKDRHINFL GRYLFNIKAS SLGRELRPLR DPDAPELDED DGALD