Gene Sros_3430 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_3430 
Symbol 
ID8666718 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp3771887 
End bp3773974 
Gene Length2088 bp 
Protein Length695 aa 
Translation table11 
GC content74% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003339110 
Protein GI271964914 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.112777 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.338432 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCAACC ACTACTGGCT CCGAGGCGCC CGGCGCCGCG ACCGCGACCG GTCCAGGGAC 
GGGCTCGACC TGCCGCCGAC GCTGGCGGTG ATCGACGCGC ACCGGCGGCT GCGGGGTCCC
TACACCGCCG CCGGCGCGCT GATCCGCTCG ATCGCCGCGG AGGCCCTGGC CCTCCGTCCC
GAGCTGGGCC CCGCCCACAA CATCGAGCTG CTGACCAGCA CGCCGGAGCT CGCCGGCGTC
GTGCCCTCGG CCTGGGCGAC GCTGGAGTGG AGCGTCGGGG AGAAGGAGCG CACGCGTTTC
TACTCCCGGC TGCACACCCT GAACGTCTCC AACGGCCTGG CCGAATTCCT CCGCGACTAT
CTCGCCGCGA TCGGCGGCGG GCCGCGCACC CTGGTCCTGG AGAACATCCA CCAGGCCGAC
CCGACCGACC AGGAGTTCGC CGCGGTGCTG CTGCGCCGCA GCGACCTCGG GCAGCTGACC
GTCGTGGTCG GCACCGGCCC CGACCCGGTC GCCGACCCGC AGGGCGAGAT CGCCGTCTCG
CTGGCGAGCG TCCTGGCGGC CCACGCCGAG CCCGTCGACT GCCCCGCCTC CCCCGCCTCT
CCCGCCTCCT CGGACGGGGA CGCGTGGGAC TACGTGGCCG GCGACGGCAC CACCGACGAC
CCGGAACTCC TCGCCGCCTA CGAACGGCTC CCGCGAGAGG AGCGCGCCCG CCTGCACGAC
GAGCGCGCGG CGCTGCTGAC CGGCCGGGGA GAGTTCTCCC TGCTGCTCGG CGCCGTCCCC
TACCACGCCG AACACGGCAG CGACCCCCGC GGGGCGGGGC TGGGCGCCAT CCGGAAGGCG
CTCCAGCACT GCAAGGACAT CGGTCTCTAC CAGGCGGCCG TCGAGCTGGG ACTGCGCGCC
CGCGCGATCG TGGACCGGCC GGCGCAGGAG GAGCTCTGGT GGTACTTCAC CAACTCCACC
AGCACCTGCA TGGCCTCGCT CGGCCGCGCC GACGAGGCGA ACGCGATCTA CGACGAGGCC
CGCGGGGTCA CCCAGGACCC GGTCGTGCAC ATGGACCTCG CCTACGGCAC CGCGATGCTC
TACGCCCGCC ACTACCCCGA GGAGCGCCGG GACTACCAGC AGGCCAGGGC CTGGATGAAC
CTGTCCGTCG CGATCGCGTC GCTGCTGGGC GACCGGAAGG AGCGCGCCTT CCACTCGGTC
TTCGCCAACA ACGGGCTGGC CCTGGTCGAG GTACGGCAGA AGAGCTCCGA GGTGGCCCTG
CGGCTGCTGG AGGACGGGAT GGCCCGGCTG GACCGGGAGC TCGGGCCGGA CGAGCACGCC
CTGCACCGCG CCGTGCTGCG CTACAACCGC GCCCAGGTGT TCGGCATGAC GGGCCGCCTG
GAGGAGGCGC TGGCCGACTA CGCCGTGGTG GTCGAGCTCG ACCCGGAGTT CCCCGAGCAC
CACTTCAACA TCGGCAACAT CCTGCGCCGC CTGGGCCGCA ACGAGGAGGC CGTCGCCGCC
TACGAGCGCG CGCTGCGGCT GTCCCCGCCG TTCCCCGAGG CCTACTACAA CCTCGCCGAC
GCCCGGCTCG AACTCGGCGA CGTGCCGGGG GCGGTGGCCG ACTTCGTCTA CACCATCGAG
CTGGACCCCG GCCACGTGGA CGCCCACGTG AACCTGGCCG GGCTGCTGCA CGAGCTGGAC
GACGCCGAGG CCGCCTGGCG TGTCACGACC GCCGGGCTCG CCCTCGCCCC GGACAACGCC
CACCTGCTCT GCCTGAAGGG GAAGCTGCTC GCCGAGCGGG GCGACGCCGA CGCCGCCCGC
GACGCCCTGT CTGCGGCGCT CCGGCGCGAC GACGCCCTGG CCGAGGCGTG GGCCGCCCGG
GGCGAGCTGG CCTTCGAGGC CGGCGACCTG GCCGGCGCCG CCGGCGACCT GGACCGGGCG
GTCGAGCTGG CCGGCACGCC CGCGATCCGG TTCAACCGGG CCGTGGTCTA CCAGGAGGCG
GCGCGATACG CCGAGGCGGC GGCCGACTAC GGGGCCGTGC TGGCGGCGAT CGACGACGCG
GAGGCCCGGG AGCGGCTGGA CGCCTGCCTG AAGGCCGTCG CGACCTGA
 
Protein sequence
MGNHYWLRGA RRRDRDRSRD GLDLPPTLAV IDAHRRLRGP YTAAGALIRS IAAEALALRP 
ELGPAHNIEL LTSTPELAGV VPSAWATLEW SVGEKERTRF YSRLHTLNVS NGLAEFLRDY
LAAIGGGPRT LVLENIHQAD PTDQEFAAVL LRRSDLGQLT VVVGTGPDPV ADPQGEIAVS
LASVLAAHAE PVDCPASPAS PASSDGDAWD YVAGDGTTDD PELLAAYERL PREERARLHD
ERAALLTGRG EFSLLLGAVP YHAEHGSDPR GAGLGAIRKA LQHCKDIGLY QAAVELGLRA
RAIVDRPAQE ELWWYFTNST STCMASLGRA DEANAIYDEA RGVTQDPVVH MDLAYGTAML
YARHYPEERR DYQQARAWMN LSVAIASLLG DRKERAFHSV FANNGLALVE VRQKSSEVAL
RLLEDGMARL DRELGPDEHA LHRAVLRYNR AQVFGMTGRL EEALADYAVV VELDPEFPEH
HFNIGNILRR LGRNEEAVAA YERALRLSPP FPEAYYNLAD ARLELGDVPG AVADFVYTIE
LDPGHVDAHV NLAGLLHELD DAEAAWRVTT AGLALAPDNA HLLCLKGKLL AERGDADAAR
DALSAALRRD DALAEAWAAR GELAFEAGDL AGAAGDLDRA VELAGTPAIR FNRAVVYQEA
ARYAEAAADY GAVLAAIDDA EARERLDACL KAVAT