Gene Sros_8471 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_8471 
Symbol 
ID8671805 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp9347449 
End bp9349887 
Gene Length2439 bp 
Protein Length812 aa 
Translation table11 
GC content73% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003343858 
Protein GI271969662 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.503181 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGGTTT CCCGCAAGGG GCTGAAGAGT TCCCGCAAGG GACGACTGGG CAAGCGGATC 
GCCATCGCGC TGGTGGCGCT GACCGCGTTC CTCGCACTCC CGCTGCTGTC GCCGGCGGGC
GCACCCACCG CCGTCGCCGC CGCCCCCTGC GACCTCTCGC CGGACCTGGC TCCCGAGGTC
GTCGGTTCAG GCGTGGACGG CCTGCTCAAG CCGCCGTCCC CGGCCGCGCC CACGCCGGGC
GCGCCGGCGC CCGCCCCGGC TCCTGCCACG CAGGGCAACT ACGCCACCTA CGGCATGAGC
GGCCAGTTCT GGCACACCCA CCAGCTCGGC TGCGACGACG TCGCGGCCGT GATGGGCAAC
ACGATGGCCA ACACGATCTT CTCGTGGGCC AAGGCGCTGG ACCGGCTGAC CATCACCACC
TATCAGGCGG CGGCCACCGA GGGCCCGCTG GAGTCGATCA AGGATGTCGT CGACGACATC
GTGATCAGCC TCTCCAACGC GATGTACTGG CCGTTCCTGC GGCCGATCGT GATCCTCGGG
GCCATCTGGC TGGCCTGGTA CGGCCTGATC CGCAAGCGGG CGACCACGAC GGCCGAGGGC
GTCATCTGGA TGGTCCTCGC GGTCACCGTC GCGGTCTGGT TCTTCAGCCG CCCCGGCGAC
TTCACCGGCA TGGGCAAGCT GGTCACCGAC AAGACCAGCG AGGTCGTCAA CTCCGCCTTC
TCCGGGCTGC CCGGCGCGGG CGGAACCTCC TGCATTCCAG CCAAGGGTGA CACCAATCCC
GAGGTCAAGT CCGGTGGTTA CGGTCAGACC GGCGTCCCGG GGGTCGAACA GAACGCCGAG
GCGCTCTGGT CCACCCTGGT CTGCAAGCCG TGGCTGATGG GCGAGTTCGG CACCGCCGAC
CCCGCGGCGC CGGCGGTGAC GGCATACGCC TCCAAGCTGC TGAGCGTCCA GGCACTCGAC
GAGGCCGAGC AACGGGCCAC GCAGACCCCG AACACCAGCG CCCACCAGGC ACAATATGAG
GCCGAGATCG CCGACAAGCT GGAGAACACG CCGATCTTCT TCCTCTTCCA GGGAAAGGAT
TGGACAAACC GGCTGGGCAT CGCGATCGGC GCGCTGATGG CCGCGATGGT CGCGGGACTG
CTGATCTTCC TGGTGGCCGT CTCCCTGCTG GTGCTCAAGG TCGGCTTCCT GCTGTTGCTG
ATCCTCGCGC CGGTCTTCCT GCTGATCGGC GTGCATCCGG GGTCCGGCCG GATCATCGCG
ATGCGCTGGG TGGAGATGCT GGTCGGCACC CTGCTCAGAC AGGCGGTGCT GACACTGGTC
CTGGGCGTCC TGGTGTACGG CTACGCCCTG ATCATCTCCA CGGCGATGCC GTGGGGCATG
CAGGTCATGT TCATGGCTCT GCTGACGATC GCGGTCTTCT TCTACCGGCG TCCGTTCCAG
CACCTGTTCG CCTCCATGGA CGGGCACACG ATCGCCACCC GCGTGCTGGG CGACGCGGTC
ACCGCGCCGA CCCTGCAGCG CGCGGCGGGC GTGCTGCCAC CGGTCGCGGC GGCCCGGATG
GGCCGCTGGG GCATGCGCAA GGCGGAACCC GTGATGCAGG CGGCCGCGCT CGCCGGCGGC
GCCTCCGTCG CGACGGCGGC CGCCGCGGTC GCCCAGGGCA AGGTCCGCGG GGAGGAGGGC
GCCCCCGGCG GGAACACCCC CTCGGGCGCC CGGGTCCCGG CCGGCGCGCA GCCCACGCCA
CTCGACACCG ACGCCCAGGC GGGCGGCCGG CGCAAGGGCA CCGGCCGGCC GGTAACCGCG
GCGCGCGCCG GCGCGGCGCC GCCGCTCAAC CTCTCCGGAG GCGGTACGGC GGGCGGCACC
GTCCCGACCC GTTCGGGCGG TGCCGGCCGC GGCGGCGCGG GCGGCTGGTT CAGCGGCCGC
TCCGGCGGCT GGGCCCCGCA GGGAGGCTCG TCTCCGGCGG GCGGATCCTC TCCGGCGCCC
TCGTCCGGCG GCAGGTCCTC CGCCCCCTCC TCCGGTGGGG GGCCGGCGTC CCCGTCGGGG
GGCGGCCGGT CGGGCGGGAG CGTCTTCGGC TCCGGCGGCG GTTCCCGGCG CTGGGACTCC
GGCTCCCGAG GCTCGGGCTC CGGCTCGCGG AGCGGTTCCG GGTCCGGGTC GGGCTCGCGT
GGCGGGGGCG GCGGTTCCCG CGGCTCTTCG GGATCCGGCG GTTCCCGGAG CTCCTCCGGC
TCCGGCGGCG GCATCTTCGG GGGCTCCCGT GGCTCTTCGG GGTCGGGCGG CAGCCTGTTC
GGCGGATCCT CCGGCGGATC GAGGAACAAC GGCGGCGGGC GCCCGGACAG GTCGTCCGAG
GCACCCCCGC TCTGGCTGCC CAGCCGGTCC GAGCGGGGCA GGCAGGCGGA TGAGGCCGCG
CCCTTCTGGC TCCGCGCGTC CAACCCCGAC AAGGACTGA
 
Protein sequence
MKVSRKGLKS SRKGRLGKRI AIALVALTAF LALPLLSPAG APTAVAAAPC DLSPDLAPEV 
VGSGVDGLLK PPSPAAPTPG APAPAPAPAT QGNYATYGMS GQFWHTHQLG CDDVAAVMGN
TMANTIFSWA KALDRLTITT YQAAATEGPL ESIKDVVDDI VISLSNAMYW PFLRPIVILG
AIWLAWYGLI RKRATTTAEG VIWMVLAVTV AVWFFSRPGD FTGMGKLVTD KTSEVVNSAF
SGLPGAGGTS CIPAKGDTNP EVKSGGYGQT GVPGVEQNAE ALWSTLVCKP WLMGEFGTAD
PAAPAVTAYA SKLLSVQALD EAEQRATQTP NTSAHQAQYE AEIADKLENT PIFFLFQGKD
WTNRLGIAIG ALMAAMVAGL LIFLVAVSLL VLKVGFLLLL ILAPVFLLIG VHPGSGRIIA
MRWVEMLVGT LLRQAVLTLV LGVLVYGYAL IISTAMPWGM QVMFMALLTI AVFFYRRPFQ
HLFASMDGHT IATRVLGDAV TAPTLQRAAG VLPPVAAARM GRWGMRKAEP VMQAAALAGG
ASVATAAAAV AQGKVRGEEG APGGNTPSGA RVPAGAQPTP LDTDAQAGGR RKGTGRPVTA
ARAGAAPPLN LSGGGTAGGT VPTRSGGAGR GGAGGWFSGR SGGWAPQGGS SPAGGSSPAP
SSGGRSSAPS SGGGPASPSG GGRSGGSVFG SGGGSRRWDS GSRGSGSGSR SGSGSGSGSR
GGGGGSRGSS GSGGSRSSSG SGGGIFGGSR GSSGSGGSLF GGSSGGSRNN GGGRPDRSSE
APPLWLPSRS ERGRQADEAA PFWLRASNPD KD