Gene Sros_4669 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_4669 
Symbol 
ID8667963 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp5192010 
End bp5193557 
Gene Length1548 bp 
Protein Length515 aa 
Translation table11 
GC content68% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003340266 
Protein GI271966070 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.679977 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.120633 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACTCTC AGCAGGTCAC CGCCGAGCCG CCGTCGAACG ACAGGGGCGA GCGTGACTCC 
ACGGCGGGCG ACGTCCCCGG AGTTCTCCGG TCGCCGGCCT CAGGCGCCGA TCGCGCGCCC
CTCATCTTCG ATGTGATCAT TGCCGGGTGC GGGCCGACCG GTGCGACGCT GGCCGCCGAA
CTGCGGCTGC ACGATGTGCG GGTACTCGTT CTGGAGAAGG AAACCGAGCC CGCGTCGTTC
GTCCGCATAG TCGGTCTGCA TATTCGCAGT CTCGAGCTGA TGGCCATGCG CGGACTGCTG
GATCGCATTC TCCAGCATGG AAGACAGCGT CCGGCCGGCG GCTTCTTCGC CGCCATCCCC
AAACCCGCGC CCAAGGGCCT GGATTCCGCA TACGCCTATC TGCTGGGCAT CCCGCAGCCG
GTCATCGTTC ACCTGCTCGA AGAACATGCG ATCGAACTGG GTGCGCAGGT CCGGCGCGGT
TGCGCGGTCG CCGGTTTCGA GCAGGACGAC GAGGGGGTGA CCGTCGAGCT GGCCGACGGG
GAACAGCTGC GTTCGCGCTA CCTCGTCGGC TGCGACGGCG GGCGCAGTAC GGTGCGCAAA
CTGCTCGGCG TCGGCTTCCC CGGCGAGCCC TCGCGGACCG AGACGCTGAT GGGCGAGATG
GAAGTGGGTG TGCCGCAGGA GGAGATCGCC GCCAAGGTGA CCGAAATCAG CGAGACCCAT
CAGCCATTCT GGCTCAGGCC CTTCGGCGAA GGGGTCTACA GCGTCGTCGT CCCCGCCGCG
GGAGTCAGCG ACCGCGCGGA ACCGCCCACC CTCGAGGATT TCAAACAACA GTTGCGCACC
ATCGCCGGAA CCGATTTCGG CGTGCACTCC CCGCGCTGGT TGTCCCGCTT CGGGGATGCC
ACCCGGCTGG CCGAACGTTA TCGGGTCGGG CGGGTGCTGC TGGCCGGCGA TGCGGCGCAC
GTCCATCCAC CCATCGGCGG ACAGGGCCTC AACCTGGGCG TTCAGGACGC GTTCAACCTC
GGCTGGAAAC TGGCCGCACA GATCCGCGGC TGGGCGCCGG AAACACTGCT GGACACCTAC
CGGGCCGAAC GGCATCCGGT CGCCGAGGAC GTGCTGGACA ACACCCGCGC CCAGACGGAA
CTGCTGTCCA CCGAGCCGGG TCCGCAGGCC GTGCGCAGGC TGCTCACCGA ACTGATGGAC
TTCGACGAGG TGAACCGCCA TCTGATCGAG AAGATCACCG CGATCGGCAT CCGCTACGAC
TTCGGCGCAG GCCCCGACCT GCTCGGCCGC CGCCTGCGCG ACATCGACGT GAAACAGGGC
CACCTCTATG GTCTGCTGCA TCGCGGCCGC GGCCTGCTGC TGGACCGCAC CGAACGCCTG
ACCGTCGACG GCTGGTCAGA CCGGGTCGAT TACCTCGCGG ATCCCACGGC GGCACTGGAT
GTTCCGTGCG TCCTGCTCCG TCCCGACGGC CACGTCGCCT GGATCGGCGA CGATCAGCAG
GATCTGGACG ACCACCTCTC CCGCTGGTTC GGCAAGCCCG CCGACTGA
 
Protein sequence
MHSQQVTAEP PSNDRGERDS TAGDVPGVLR SPASGADRAP LIFDVIIAGC GPTGATLAAE 
LRLHDVRVLV LEKETEPASF VRIVGLHIRS LELMAMRGLL DRILQHGRQR PAGGFFAAIP
KPAPKGLDSA YAYLLGIPQP VIVHLLEEHA IELGAQVRRG CAVAGFEQDD EGVTVELADG
EQLRSRYLVG CDGGRSTVRK LLGVGFPGEP SRTETLMGEM EVGVPQEEIA AKVTEISETH
QPFWLRPFGE GVYSVVVPAA GVSDRAEPPT LEDFKQQLRT IAGTDFGVHS PRWLSRFGDA
TRLAERYRVG RVLLAGDAAH VHPPIGGQGL NLGVQDAFNL GWKLAAQIRG WAPETLLDTY
RAERHPVAED VLDNTRAQTE LLSTEPGPQA VRRLLTELMD FDEVNRHLIE KITAIGIRYD
FGAGPDLLGR RLRDIDVKQG HLYGLLHRGR GLLLDRTERL TVDGWSDRVD YLADPTAALD
VPCVLLRPDG HVAWIGDDQQ DLDDHLSRWF GKPAD