Gene Sros_8838 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_8838 
Symbol 
ID8672176 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp9757351 
End bp9759069 
Gene Length1719 bp 
Protein Length572 aa 
Translation table11 
GC content74% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003344214 
Protein GI271970018 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.139707 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTTACC ACGCTGTGGC ATGCGACTAC GACGGGACGC TGGCCGCAGA CGGTCACGTC 
GACGACGGCA CCGTGGCCGC CCTCGAACGC CTCGTGCGCT CCGGGCGGCG GCTGCTGCTG
GTCACCGGCC GGCAGATCGA CGAGCTCAGA CGGGACTTCG GGCGGCTGGA CCTGTTCGAC
CGGATCGTCG CCGAGAACGG GGCCGTGCTG TACCGCCCCA GGGAGCCCGC GGAGCAGGCG
ACCTCGCCCC TGGCCGAGGG GCCGCCCGCC GCGCTCGTCG AGCGCCTGCG CGACCTGGGC
GTGGAACCGC TCGGTGTCGG CTCGGTGATC GTGGCCACCT GGGAGCCCAA CGGCGAGACC
GTTCTCCACG CGATCCGCGA CCTCGGCCTG GAGATGCAGG TGATCTTCAA CAAGGGCGCG
ATCATGGTCC TGCCCTCGGG GATGAACAAG GCCAGCGGCC TGGCCGCCGC CCTGGCGGAA
CTCGGGATAT CGGAGCACAG CACGGTGGGC GTGGGCGACG CCGAGAACGA CCACGCGTTC
CTGGCGGCCT GCGAGTGCGC GGTGGCGGTG GCCAACGCGC TCCCCGCCGT CAAGGAACGC
TGCGACCTGG TGACCGGGCG GGACCACGGC GCCGGGGTCA CCGAGCTGGT CGACCGCCTT
CTCGCGGACG ACCTGGCCGG CGTGGACGTC GTGCGGCACC GCCTCCCGCT CGGCACCGGT
GCGGCCGGCC AGGTGTCCGT CCCGCCGTAC GGCCTGCGGC TGCTGGTCGC CGGGCCCTCG
CACAGCGGCA AGTCCACCGT CACCGCCGCG CTGCTGGAGC GTGTCGCCGG GGCCGGCTAC
CAGTTCTGCC TGATCGATCC GGAGGGGGAC TACGCCGACG GGGTCGAGGG CGCGGTCGTG
CTGGGCGACG CCCGGCGCGC GCCCACCGGC GAGGAGGTGC TCCGGCTGCT GGAGGACGTC
CGGCAGAGCG TCGTGGTCAA CCTGCTGGGC CTGTCCATCG ACGACCGGCC GGGCTTCTTC
GAGGCGTTGC TGCCCCGCCT GTCGGCGCTG TGCGCCCGCC AGGGGCACCC GCACTGGCTG
GTGGTCGACG AGGCCCACCA CATGATGCCC GAGGGCTTCG GCCTGCAGCC GGCCGGGCTG
CTGGGCGAGA TGGGCGGGCT GCTGCTGGTC ACCGTGCACC CCGGCGCGGT CAGCGAGCCG
GTCGTGCGGG CGCTCAACGC GGTCGTCGCG GTGGGGGAGC GCCCGGGGGA CATCCTCGGC
ACGTTCGCCG CCGCCACCGG CCAGGACATG TCCCACCGGG ACTTCCCCGA CCTGCCGACC
GGGGAGCTGC TGTTCTGGGA GCTCGGCGGC GAGCCGGTCC GGGTGGAGCT GATCCCGCCC
GAGGAGGAGC GCCGCCGGCA CCGCCGCAAG TACGCGACCG GCGAGCTCGG GGAGGACAAG
AGCTTCTACT TCCGCGGCCC CCGGGAGGCG CTGAACCTGC GGGCCGACAA CCTCACGGCG
TTCTGCCGCC TCGCCGAAGG CGTCGACGAC GACACGTGGA CCTATCACCT GGGCCGGGGC
GACTACTCGC GATGGCTGGC GGAACAGGTC AAGGACGAGG AGCTGGCGGC CGAGGTGGCC
GGGGTCGAAC GGGCTCCCGG AGAGTCTGCC GCCGAGACCA GAAGGCGCGT GTGCGAGCTC
ATCGAGGCCC GGTACACCGC CCCCGCCGAA CCCACCTGA
 
Protein sequence
MRYHAVACDY DGTLAADGHV DDGTVAALER LVRSGRRLLL VTGRQIDELR RDFGRLDLFD 
RIVAENGAVL YRPREPAEQA TSPLAEGPPA ALVERLRDLG VEPLGVGSVI VATWEPNGET
VLHAIRDLGL EMQVIFNKGA IMVLPSGMNK ASGLAAALAE LGISEHSTVG VGDAENDHAF
LAACECAVAV ANALPAVKER CDLVTGRDHG AGVTELVDRL LADDLAGVDV VRHRLPLGTG
AAGQVSVPPY GLRLLVAGPS HSGKSTVTAA LLERVAGAGY QFCLIDPEGD YADGVEGAVV
LGDARRAPTG EEVLRLLEDV RQSVVVNLLG LSIDDRPGFF EALLPRLSAL CARQGHPHWL
VVDEAHHMMP EGFGLQPAGL LGEMGGLLLV TVHPGAVSEP VVRALNAVVA VGERPGDILG
TFAAATGQDM SHRDFPDLPT GELLFWELGG EPVRVELIPP EEERRRHRRK YATGELGEDK
SFYFRGPREA LNLRADNLTA FCRLAEGVDD DTWTYHLGRG DYSRWLAEQV KDEELAAEVA
GVERAPGESA AETRRRVCEL IEARYTAPAE PT