Gene Sros_8848 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_8848 
Symbol 
ID8672186 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp9768859 
End bp9770067 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content72% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003344224 
Protein GI271970028 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGCACG CCGCGGGATG TCTCCCGATC GTCCTCCTCC CCCTCGCCAT CACGGGCTGC 
TCGTCGGTCG TGGCCGGAGG CGACCATCCG GCGGCGGCGC CGAGAGTGAA GATCTCCCCG
TCCCCGGACT CCGTGCGGGC CGGCACCGGC CGGGGCCTGG TCGTCCGGGC CGCCACGGGC
ACGCTGACCG GGGTGACCGC CTACGGGGGC GGCGCCCCGG TGCCGGGCCG GTTCGACGGC
ACCCGTTCCA CCTGGCGCTC GGACTGGACC CTGACCCCCG ACCGGGAGTA CATCGTCAAG
GTCACCGCGG CGGGCGGGGA CGGCGCCACG ACGACCACCT ACGGCAGGTT CCGCACGCTC
GCACCCTCCC GGACCTTCCA GGTCGCCTCC GTCGTCCCCG CGCCGGGCGA GACCGTGGGC
GTCGGCATGC CGATCATCGT GGACTTCACC GTCCCGGTGG AGGACAGGGC GGCCGTCGAG
AAGGCCCTGG AGGTCCGCTC CACCAAACCT GTCGAGGGCG CCTGGCACTG GGTGAGCGAC
ACCGAGGTGG TCTACCGCCC CCGCCGTGAC TGGCCGGCCC GGCAGCGGGT CTCCTTCACC
GCGCACCTGT CCGGGGTCCG CGCGTCCAGG GACACCTACG GCACGGCCGA CCACACGGTG
CCCTTCGCCA TCGGCCGGGG GCAGGTCAGC TTCATCGACA CCCGGACCCA CCAGATGCGG
GTCATGCGGG ACGGCAGGAC GGTCCAGCGG ATGGCCATCA GCGCCGGCAT GGCCACCACC
GAGGAATACA CCACGACGAG CGGCATCCAC CTGACCATGG ACAAGGCCGA CCCGGTCCGC
ATGGTCTCCC CCGGCCGCAA GAAGGGCGAC CCCGGCTTCT ACGACGTCAT GATCGACCAC
GCGGTCCGGA TCTCCAACAG CGGCGAATAC GTCCATGCCA AGGACAACGT GTGGGCGCAG
GGCAGGCAGA ACGTCAGCCA CGGCTGCGTC AACGCCCGGC CCGACCAGGC CGCCTGGTTC
TTCGACAGCT CCCTGCGCGG CGACCCGGTC GTCATCCAGG GCACCGACCG CGAGCTCCGC
TGGGACAACG GCTGGGGTTA CTGGCAGCGT TCCTGGGAGG AGTGGCTCGG CGGCAGCGCC
CTGCGCGCCG CCGAGCCGCC GCAGCTCCTG ATGACCCCTG ACACTCCGCC AGATAACGAC
ATACGGTAG
 
Protein sequence
MRHAAGCLPI VLLPLAITGC SSVVAGGDHP AAAPRVKISP SPDSVRAGTG RGLVVRAATG 
TLTGVTAYGG GAPVPGRFDG TRSTWRSDWT LTPDREYIVK VTAAGGDGAT TTTYGRFRTL
APSRTFQVAS VVPAPGETVG VGMPIIVDFT VPVEDRAAVE KALEVRSTKP VEGAWHWVSD
TEVVYRPRRD WPARQRVSFT AHLSGVRASR DTYGTADHTV PFAIGRGQVS FIDTRTHQMR
VMRDGRTVQR MAISAGMATT EEYTTTSGIH LTMDKADPVR MVSPGRKKGD PGFYDVMIDH
AVRISNSGEY VHAKDNVWAQ GRQNVSHGCV NARPDQAAWF FDSSLRGDPV VIQGTDRELR
WDNGWGYWQR SWEEWLGGSA LRAAEPPQLL MTPDTPPDND IR