Gene Sros_0035 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_0035 
Symbol 
ID8663298 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp39788 
End bp40996 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content69% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003335838 
Protein GI271961642 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCAAAC CCCTCTCCGC CACGGCACCC ACACATGTGA CCACTCGATC GCTGGCACCG 
GATCTGGCGC GCGGCTTCAT GCTGCTGCTG ATCGCCCTGG CGCACGCGCC GGCGTTCGTC
GGCGACTGGG ACGCCGGGCC CGCCGCGCTC AACACCGCCG CGAAGTTCGT CAAGTCCCTG
TTCGCCGACA ACCAGGCCCG CAGCATGTTC GTGCTGCTGT TCGGCTACGG TCTCGGCCAG
CTGGCCCATC GCCAGCACGC CCGGGGCGAC GACTGGACCT CGGTCCGGAA ACTGCTGCGG
CGCAGGGCCT TCTGGCTGAT CGTCATCGGC TTCGCGAACA CGGTACTGCT CGTGCCGATC
GACATCATCG CGGTGTACGG ACTGACGCTG CTGGTGCTCG CACCGCTCGT GCGAGCGCGG
GATTCGGTGC TGTGGTGGAC GAGCATCCTG ACGCTCATCC CCGCGACCCT CCTGCTGGCC
TGGCAGAGCG TGGCCGCCCA GGCGGGCCCC GTCACCATGG CGGAGTTCAT GGAGCCCACC
TTCGGCGCCC ACCTCGTCGC GAGCATTCCC TCCTGGCCGG TGGAGACCGC CATCTCCACG
ATCATCGTGG TGCCGGGCAT GCTGGTGGGA ATCTGGGCCG CCAGGCGCCG GATCCTCGAC
GAGCCCGAGC GCCATGCGTC GTTGCTGCGC CGCATCACTG TGATCTTCAT CGGGGTGTCC
GTCATCGGCA GGCTTCCCGC CGCTCTGCTG GCGGCCGGCG CGTGGACGAC CACCTCGGCC
CCGATCGGCT GGACGATTGC CATCGCGCAC GACCTGACCG GATACGCGGG CGGCATCGGC
ATGGCCGCCG CCGCCGGACT CGTCGCGATC AGGGTACGGC GTGGCCGTCT GATCACGGCC
CTGGCGGCGC TGGGGCAGCG CTCACTGACC TTCTACCTGC TCCAATCCGT GGTGTGGGTG
GCGCTGTTCT ACCCGTTCAC CCTGGGCTTG CGGGACGACA TGAGTTTCGC CGCCACTTTC
GGAATCGCCA TCGGACTCTG GGTGGCCTCT GTCCTGCTGG CCGAGTGGAT GCGCCGCGCG
GGCTACCGCG GCCCGGCGGA AGTGCTGTTG CGGCGACTGT CATACCGCCG CCCCGCTCCG
GCCTCCGTTT CCGATGAACC CCACGGCAGC CCCGCCCGGC AAGGCGAGAA CGCCGGACAC
CGGCTGTGA
 
Protein sequence
MAKPLSATAP THVTTRSLAP DLARGFMLLL IALAHAPAFV GDWDAGPAAL NTAAKFVKSL 
FADNQARSMF VLLFGYGLGQ LAHRQHARGD DWTSVRKLLR RRAFWLIVIG FANTVLLVPI
DIIAVYGLTL LVLAPLVRAR DSVLWWTSIL TLIPATLLLA WQSVAAQAGP VTMAEFMEPT
FGAHLVASIP SWPVETAIST IIVVPGMLVG IWAARRRILD EPERHASLLR RITVIFIGVS
VIGRLPAALL AAGAWTTTSA PIGWTIAIAH DLTGYAGGIG MAAAAGLVAI RVRRGRLITA
LAALGQRSLT FYLLQSVVWV ALFYPFTLGL RDDMSFAATF GIAIGLWVAS VLLAEWMRRA
GYRGPAEVLL RRLSYRRPAP ASVSDEPHGS PARQGENAGH RL