Gene Sros_3800 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_3800 
Symbol 
ID8667090 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp4237400 
End bp4239436 
Gene Length2037 bp 
Protein Length678 aa 
Translation table11 
GC content69% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003339463 
Protein GI271965267 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.658534 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCCCCCCA ACCGACGGAC GACGGTCGCG CTGATCGTCG CCGCTCTCTC CCTCGCCGCC 
TCGGGAGGGC TCGCCACCGT CTCGGCACCG CCCGCCGCGG CGGCCATGGC GGACACCACG
GTGCCCGCGA CCGACGACGT CTTCATCAGC CAGGCGGACC CCGCGAAGTC CTACGGGACC
GCGACCTGGC TGTCCGCCTG CGCCGCGTCG TGCAACGACA AGACCAACGG TGAGCGCCGC
GTGCTCACCG GCTTCACCGT CTCGGGTGTG CCGAAGGGCG CCCAGAACGT GAAGATGACC
CTTGAGGTCA CGCCCGGCAG GACCGTCGAG ACCGTGATCT CGGTGCACAA GGTGACCAGC
GCCTGGTCCG AGAGCGCCAC CACCTGGAAC AGCAGGCCCA CGCTCGGCGA GGCCTTCGCC
ACCCGTGACG GCTTCACCGC GAACAAGGCC GCCGGGATCG ACGTTTCCTC GGCCTTCACC
GGCAACGGGC GCTACACCTT CGCCCTCACC GCCGGCGAGG GACCCGTCGC CGTCCTGTAC
TCCTCGCGGG AGACGGGCAA CAGGGGACCA CGCCTGAAGA TCAGCTACTC CCCGGCCGGC
GCCACCCCGA CTCCCACGCC CAGTGCCACT CCCACCGCTA CGGCCACGCC CACGCCGCCG
CCGTCAACCG GGAACTGCCT GCCGTTCGAC AAGCCCTCGA CGGCGGCGCT GCGCTCCTTC
GACAAGAAGG TCTTCGCCTT CTACTTCCCG CCCTTCCCCG TGTCGATCGA CAACAAGGAC
CCCTCCAAGG ACCAGCACGC CTCCTGGCTG GACCCGATGG GCTCCAACGG GATGTACGCC
GGCCAGGGCG GGCACTCGCG TGACCGGCCG CTGCCGCGCC CGGTCCGCCC GGAGAAGAAC
TGGCGCCAGC TCGACTTCGA GGTCGAGGTA CGGCAGGCGA TCGCGATGGG CCTCGACGGG
TTCATCTACG AGCACCACAC CTCGGCCAGC GACCAGCGGT TCAACCAGTT CCCCGCGATG
CTGGCGGCGG CCAAGGCGGT CGACCCCAAC TTCAAGATCA TGCTCAGCCC CGACTTCCCC
ACGGCCAAGG ACTCCCCGCA CGACAAGGTC ATCGCGGACA TCCTCATGGC CAAGGGGCAC
CCGTCGCTCT ACAAGCTCGA CGACGGCACC ATCCCGCTCG CGCCGTTCTA TCCGGAGCGG
CACCCTGCGG CCTGGTGGGA CCAGCTGCGC GACAAGCTGG CCGCCCAGGG CATGAAGACC
TCGCTCTTCC CGATCTTCCT CAGCTGGAAC GGCACCGGGA AGACCGAGTG GAACGACCAC
GTCGTCGGCT ACTCCTCGTG GGGCAGCAAG TGGGTGAGCA CGACCGAATC CCTGCGCAAG
GGCGGCATCG AGGCCCACAA GCGGGGACGC CTCTACATGG CGCCGGCCTC GCTCGAGGAC
GTGCGCCCCA GGGACAAGCG CCTGTGGGAG CCGGCCAACA GCCAGCTGCT GCAGCAGTCG
TTCCTGAAGG CCGCGCAGGG TGACGCGGAC ATCATCGCGC TCATCACCTG GAACGACTAC
GCCGAGTCGT GGGTGGCGCC GTCGGTCAAG CGAGGGTACG CGCCCTCCGA CCTGATCGCC
TACTACACGA CGCTGTTCAA GACCGGGAAG GCGCCGGCGG TGGCTCGCGA CGCGCTCTAC
TACTTCCACC GCAGCCACCG CACCGACGCG CCGTTCGACG CCACCAAGCA GACCATCGGC
CCGATCAAGG TCCTCAACGG CGACCCCGCC ACGAACACCG TGGAACTCGT CGCCTTCCTC
AAGACGCCGG GCAAGCTGGT GATCAAGCAG GGTTCCCAGG TCAACACGTT GGACGCGCAG
GCGGGGCTGG TCTCGTTCAA GGCGGAGATG GTGCCGGGCA CGACGCCCGT CTTCGAGCTC
CAGCGCGGCG GGAAGACCGT CCAGACCGTG GAGAGCAAGA CCCCGATCCG CAAGAGCGTC
GTCTACCAGG ACCTCATCAA CCACGCCGGA GGTGGCCTGA GCTGCAACCG TCCGTGA
 
Protein sequence
MPPNRRTTVA LIVAALSLAA SGGLATVSAP PAAAAMADTT VPATDDVFIS QADPAKSYGT 
ATWLSACAAS CNDKTNGERR VLTGFTVSGV PKGAQNVKMT LEVTPGRTVE TVISVHKVTS
AWSESATTWN SRPTLGEAFA TRDGFTANKA AGIDVSSAFT GNGRYTFALT AGEGPVAVLY
SSRETGNRGP RLKISYSPAG ATPTPTPSAT PTATATPTPP PSTGNCLPFD KPSTAALRSF
DKKVFAFYFP PFPVSIDNKD PSKDQHASWL DPMGSNGMYA GQGGHSRDRP LPRPVRPEKN
WRQLDFEVEV RQAIAMGLDG FIYEHHTSAS DQRFNQFPAM LAAAKAVDPN FKIMLSPDFP
TAKDSPHDKV IADILMAKGH PSLYKLDDGT IPLAPFYPER HPAAWWDQLR DKLAAQGMKT
SLFPIFLSWN GTGKTEWNDH VVGYSSWGSK WVSTTESLRK GGIEAHKRGR LYMAPASLED
VRPRDKRLWE PANSQLLQQS FLKAAQGDAD IIALITWNDY AESWVAPSVK RGYAPSDLIA
YYTTLFKTGK APAVARDALY YFHRSHRTDA PFDATKQTIG PIKVLNGDPA TNTVELVAFL
KTPGKLVIKQ GSQVNTLDAQ AGLVSFKAEM VPGTTPVFEL QRGGKTVQTV ESKTPIRKSV
VYQDLINHAG GGLSCNRP