Gene Sros_8481 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_8481 
Symbol 
ID8671815 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp9361791 
End bp9362951 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content73% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003343868 
Protein GI271969672 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGTCC CCGACATCGT CCGCAATGTC ACCAGAAATG TCCGCGAGGC GCTTTCCAGC 
CGCGAGGGGT TCAAAGAGAA GGCGAAGGAG CTGCCGCTCT ACGTGCTGCA GTCCGCGCTG
AGCGGGGTCG GCCAGGCCCT GCTGATCGGC GACCGCATGC GGACCACGAT CAAGCGCATC
GCCGGGCAGG ACGACGACAC CGAGGAGACC CGCCCGAAGA CCGCCGACGA GCCCGGCAAG
GGCGAGAAGG CGGAGGAGAA GGTCGCCGAG AAGCCGGCCC GGCGCGCGCC GGTCATCTTC
GCCCCGCGCC CGGAGAGCTC CTCCCCCAAG GAGGACAGGC CCGAGGCCAA CGGCGCCAAG
CCCCGCCCCG AGCCGGTCAT CTTCGCCCCC GCCAAGCCGA AGGCCGCCAC CGGACCCGCG
TCCGAGCCCA AGCCCGCCGA GGCCGAGACG ACCGAGACCA AGGCCGCTGA GACCAAGCAG
GAGGCCGAGC CGGTCACGGC CGAGACCAAG CCTGCCGGGA GCGAGCCCGC CAAGGCCACC
GAGCCGAAGC CCGCCGTAGC CGAGACGACC GAAGCCAAGG CCACCGAGAC CAAGGCCGCT
GAGATCAAGG CCGCCGAGGC CAAGACGACC GAGGCCACCG AAGCCAAGGC CACCGAGACC
AAGCCCGCCG AGGCCAAGAC GACCGAAGCC GAGGCCGCTG AGGCCGCCAA GCCGGAGACC
GAGGCCGAGG CCGCCAAGCC CGAGCCCGCC GCGACCACGC CGGCCGGGAG CGAGCCCGCC
AAGGCCGCGA GGACCACCGA GGCCACCGAG GCCACCGAGG CCACGGTCTC CGAGACCGAG
ATCCCCGCCC CTGCGGCCGT GCAGGTCGAG GTCACCGAGG TCAAGGTGGC CAAGCCCGAG
ACGGCTGACG CCGCGCTGGT CGAGGTCACG GAGACCAAGG TCGCCAAGCC GGCGGCCGCG
CCGGTCGGGG CGGTCACCGT ACCCGCCGAG CCGATGCCCG GTTACGGCCA GCTGACGGTC
GCGTCCCTGC GCGCCCGGAT GCGCGGCAAG ACGGCCGGGC AGATCCGGGA GTTCCTCGCC
TACGAGCGGG CCACCACCGC CCGCGCGGAG GTCGTTCGGA TGTACGAGAA CCGGCTGGCC
AAGCTGGAGG CGGCGGAATA G
 
Protein sequence
MAVPDIVRNV TRNVREALSS REGFKEKAKE LPLYVLQSAL SGVGQALLIG DRMRTTIKRI 
AGQDDDTEET RPKTADEPGK GEKAEEKVAE KPARRAPVIF APRPESSSPK EDRPEANGAK
PRPEPVIFAP AKPKAATGPA SEPKPAEAET TETKAAETKQ EAEPVTAETK PAGSEPAKAT
EPKPAVAETT EAKATETKAA EIKAAEAKTT EATEAKATET KPAEAKTTEA EAAEAAKPET
EAEAAKPEPA ATTPAGSEPA KAARTTEATE ATEATVSETE IPAPAAVQVE VTEVKVAKPE
TADAALVEVT ETKVAKPAAA PVGAVTVPAE PMPGYGQLTV ASLRARMRGK TAGQIREFLA
YERATTARAE VVRMYENRLA KLEAAE