Gene Sros_1240 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_1240 
Symbol 
ID8664515 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp1271189 
End bp1273618 
Gene Length2430 bp 
Protein Length809 aa 
Translation table11 
GC content71% 
IMG OID 
ProductGlucose/sorbosone dehydrogenase-like protein 
Protein accessionYP_003336981 
Protein GI271962785 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.00167459 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGATCTC CTATCCGAGC GCGTTCGCTG GCCGGCGCGC TCGTGGCCGG GCTGCTGCTC 
GGCCTGCTGG CCGCGCTCCC CGCCGCGGCC GCGCCGACCA GGTACGAAGC GGAAACCGCC
ACCATCTCCC AGGGCGCCGT GGAATCCAAC CACGCCGGAT TCTCCGGCAC CGGGTTCGTC
AACACCGACA ACGTCGCCGG CTCCTACGTC GAGTTCACCG TCAACGCCGC CGGGGCCGGA
TCCGCCGCCA TCGCCATCCG GTACGCCAAC GGCACCACCG TCAACCGCCC CGCCGACGTC
GCCGTCAACG GCACCGTCGT CTCGGCCGGC CGCGCGTTCG ACGGCACCGG CGCCTGGGCC
ACCTGGGCGA CCTCCACCCT GACCGCGCCG GTGAGGGCCG GGGCCAACAC CGTCCGGATC
ACGTCCACCA CCGCGGGCGG CACGCCCAAC CTGGACTTCC TGGACGTCGA GGTGACGGTG
GCCGGCAACG ACTACCAGGC GGAGAACGCC ACGATCTCCC AGGGGGTCGT GGACACCAAG
CACGCCGGGT TCACCGGCAC CGGATTCGTC GACTACGCCA ACGTCGCCGG CTCCTACGTC
GAGTTCACCG TCAACGCGGC GACCGCCGGA AACTACGGCC TGAGGTTCCG CTTCGCCAAC
GGCACCGCGA CCGACAGGCC GATGGACGTC TCCGTCAACG GCGTGACGGT CTCGGCCGGG
CTGTCGTTCC CCGGCACGGG CGCGTGGACC ACCTGGGTGG AGAGATCCGT CACCGCGGGC
CTCGTCGCCG GTGCCAACAA GGTCCGGGCC ACCGCCACCA CCGCGGGCGG CGGCCCCAAC
CTCGACCGGC TGAGCGCGGC GGCGCCGGCC GACGCCGAGC CGCCGACCGC CCCCGCGAAC
CTGCGCGTCG TCGGCGAGGT CAGGCCGACC TCGGCCGACC TGGCCTGGGA CGCGTCCACC
GACAACGTGG GCGTGTCGCA GTACAAGATC TACAACGGCG GCAACGTGCT CATGACCGTG
GGCGGCAACG TCACCGCCAC CACGCTGCCG GGCCTGACGC CCAACACCAG GTACGTGCTG
AGCGTGCTCG CCTACGACGC GGCGGGCAAC GCCTCCCAGG GCGGCAACAA CGTCGACGTC
ACCACGCCGC CGAGCGACGA CGTCCAGCCG CCGAGCACGC CGGCCGGCCT GCGCGCGACC
AACGTCGCGG CCGGCACGGT CACGCTGGCC TGGAACGCCT CGACCGACAA CATCGGGGTC
ACCGGCTACC ACGTCTACCG CGACGGCACG CGGTTCGCCA CCGTGCCCGA CCTGACGGCG
ACCGCGGACG GGCTGGCGCC CAACACGACC TACGCGTTCA CCGTCGAGGC GTTCGACGCC
AACGGCAACG TCTCGCCGCG CAGCGCGCCA CTGGCGGTGA AGACGAGCGG TACGGCTGGC
GGAGGCGACC CCGGCTACGA CAGGGACGTC GTCAAACTCG ACCTGCCGTG GGGCGTCGCC
TTCCTGCCCG ACGGCAGCGC GCTGGTCGCC GAGCGGGACC GGTTCGAGAT CGTCCGCGTC
ACCCGGAGCG GGCAGAAGAC CGTGGCCGGA AAGATCACCG AAGCGGTGAC GACCAGCGGC
GAGGGCGGGC TCCTGGGCCT GGCGATCTCG CCGGACTTCG CCACCGACCA CTACGTCTAC
GCCTTCCACA CGGCCGCGTC CGACAACCGC GTCGTGCGGT TCACGTACGA GAACGGGCAG
ATCGGTGCCC GCGAGCCGCT CGTCACCGGC ATCGCCAAGA ACAAGTTCCA CAACGGTGGC
CGGATCAAGT TCGGCCCGGA CGGCTTCCTC TACATCACCA CCGGCGACGC CCAGGACGGC
AACCGGGCGC AGAACCTCAA CTCGCTCAAC GGCAAGATCC TGCGCGTGAC GCCGACCGGC
GCGGGCGCGC CCGGCAACCC GTTCCCGAGC GCGCCGCGGG TGTACTCCCT CGGCCACCGC
AACCCGCAGG GCCTGGCCTG GGACTCCCAG GGACGGCTGT GGCAGTCGGA GTTCGGCGAC
GCCACACTGG ACGAGCTCAA CCTGATCCAG CCCGGCAAGA ACTACGGCTG GCCCAACTGC
GAGGGCAGGT GCAGCAACTC GGCATACGTC AACCCGGTCC AGCAGTGGGA CGTCGCCGCC
GCGTCGCCGA GCGGCCTGGA GATCGTCAAC GACTGGATCT ACATGGCGGC CGTCCGGGGC
CAGCGGCTCT GGGTCATGAA GATCACCGGT AGCACCACCG ACACGCCCCG GGCGTTCTTC
AACGGCCGCT GGGGCCGCCT GCGCACCGTC GTCAAGACCC CCGACGGCGG GCTCTGGCTC
ACCTCGACCA ACAACGACAA GAACGGCGGC ACGCCGTCCG TCCTCGACAA CACCGTGGTG
CGGCTGAAGT TCGCCGGCGC GGCCGGCTGA
 
Protein sequence
MRSPIRARSL AGALVAGLLL GLLAALPAAA APTRYEAETA TISQGAVESN HAGFSGTGFV 
NTDNVAGSYV EFTVNAAGAG SAAIAIRYAN GTTVNRPADV AVNGTVVSAG RAFDGTGAWA
TWATSTLTAP VRAGANTVRI TSTTAGGTPN LDFLDVEVTV AGNDYQAENA TISQGVVDTK
HAGFTGTGFV DYANVAGSYV EFTVNAATAG NYGLRFRFAN GTATDRPMDV SVNGVTVSAG
LSFPGTGAWT TWVERSVTAG LVAGANKVRA TATTAGGGPN LDRLSAAAPA DAEPPTAPAN
LRVVGEVRPT SADLAWDAST DNVGVSQYKI YNGGNVLMTV GGNVTATTLP GLTPNTRYVL
SVLAYDAAGN ASQGGNNVDV TTPPSDDVQP PSTPAGLRAT NVAAGTVTLA WNASTDNIGV
TGYHVYRDGT RFATVPDLTA TADGLAPNTT YAFTVEAFDA NGNVSPRSAP LAVKTSGTAG
GGDPGYDRDV VKLDLPWGVA FLPDGSALVA ERDRFEIVRV TRSGQKTVAG KITEAVTTSG
EGGLLGLAIS PDFATDHYVY AFHTAASDNR VVRFTYENGQ IGAREPLVTG IAKNKFHNGG
RIKFGPDGFL YITTGDAQDG NRAQNLNSLN GKILRVTPTG AGAPGNPFPS APRVYSLGHR
NPQGLAWDSQ GRLWQSEFGD ATLDELNLIQ PGKNYGWPNC EGRCSNSAYV NPVQQWDVAA
ASPSGLEIVN DWIYMAAVRG QRLWVMKITG STTDTPRAFF NGRWGRLRTV VKTPDGGLWL
TSTNNDKNGG TPSVLDNTVV RLKFAGAAG