Gene Sros_0397 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_0397 
Symbol 
ID8663665 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp392555 
End bp394435 
Gene Length1881 bp 
Protein Length626 aa 
Translation table11 
GC content71% 
IMG OID 
Productcarbohydrate ABC transporter 
Protein accessionYP_003336171 
Protein GI271961975 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGGGAGG CCGAAGGACC CGAGGGTCAA GAGGGAAGGA GCTCCGGGGC GGGCTCCCTG 
TTCCGCCGCG GACAGGTCGC GGGCGAGCTG GCCTGGCGGG CGCAGCCGGC CGCCGTGGCA
GGTTCGCTGG TGGTGGTGCT GCTCATGGGG GTGGCGCCGG TGGCAGCCGC CTGGCTGACC
AAGCTGGTCC TCGACGGGCT GGCCGGCGGC GGGGGCCGGA GCGGCCTGCT CCCGCTCGCA
GCCGCCCTCG GGGTGGCGGG CCTGGCTCTG ATCGTCCTGC CGCACGTTTC CGAGTATCTC
AACGGGCAGC TGCACCGGGG GCTTCAGAGG CTGATCTACG ACCGGCTGTT CCGGGCCGTC
AACGCCGACC CCGGGCTCAG CCGCTTCGAG AACCCCTCCT TCCACGACGA ACTGCGGCTG
GCCCACGAGG CCGGCCAGAA CGCTCCGCAG CAGATCGTCG GCTCGTCCCT GGCGATCTGC
CAGGGCATCC TGACGCTGCT GGGGTTCATC GTCACGCTGA CCGTGCTCAG TCCGGTCATG
GTCGTCATCG TCATCGCCGC CGCGTTGCCG TCAGCCCGGG TGCAGCTCGC GCTGAGCCGC
CGCAGGGCCG GCACGATGTG GGGCATCAGC TCCAACACGC GCCGCCAGAT CTTCTTCAGT
GCCCTTCTCA CCAGTCCCGA CGCGGCCAAG GAGGTGCGGC TGTTCGGGCT GGGCGACTTC
CTGCGCAGCC GGATGCTCAC CGAACTCAAG ACCGTCAACG AGGCCGAACG GGACCTGGAC
CGGCGGACAC TGCACAGCCA GGGACTGCTG TCCCTGCTGG GCGCCGCCGT CGCGGCCGGC
GGCCTGTTCT GGGCCGTGCT GACCGCCATC GAGGGAGGTC TGAGCGTCGG GGACGTCTCG
GTGTTCGTCG CCGCCGTCGC CGGGGTGCAG ACGGCCCTGA CCCTGATCGT CGGCCAGTGG
GCGAGCGCGC ACAACGCCCT GCTGATGTTC GGGTACTACC TCGGCGTCGT CCAAGCCGAT
CCCGGGCTGC CGGTGGCCGC GGAACCCACC GCGCTGCCCC CGCTGCGCGA GGGCATCGAG
CTGCGCGACG TGTGGTTCCG CTACGACGAC GGGCACCCCT GGGTGCTGCG AGGGGTTGAC
CTCTTCATCC CGCACGGCCA GGCCGTCGCC CTGGTCGGCC TGAACGGGGC GGGCAAGAGC
ACCCTGGTCA AACTGCTGTG CCGGCTGTAT GACCCGGTCC GCGGCTCCAT CCACTGGGAC
GGCGTGGACC TGCGCGACGT GCACCCGGAC GACCTGCGCG CCCGGGTCGG CGCGGTGTTC
CAGGACTACA TGTCATACGA CCTCACCGCC ACCGAGAACA TCATGCTGGG CGACCTGACC
GCGGGCGCCG ACCCGGAGCT CGTCCGGACC GCCGCCCGGC GCGCGGGGAT CCACGACACG
CTCGACGATC TGCCCAGGGG CTACGACACG CTGCTCAGCC GGATCTTCTT CGGCGACAAC
GACGACCGGC AGGCGGGAGT GACGCTGTCG GGCGGCCAGT GGCAGCGGCT GGCGCTGGCC
CGCGCCCTGA TGCGCCACCA CCGGGATCTG CTCATCCTCG ACGAGCCGAG CTCCGGTCTG
GACGCGGAGG CCGAGCACGC CGTCCACCTG GGACTGCGCC GGCACCGGGC AGGCCGTACC
AGCGTCCTGA TCTCACACCG GCTCGGGGCC CTGCGGGACG CGGACCTCAT CATCGTGCTG
AGCGAGGGGA AGATCACCGA ACGCGGTTCC CACAGCGAGC TCATGGCGCT GGGAGGAGAA
TACGCGCGGC TGTTCAGCCT GCAGGCCAGC GGCTACGAGC TCGCACCGGA CCCCTCCGCC
GCGCCTTCCG GAAAGCCCTG A
 
Protein sequence
MREAEGPEGQ EGRSSGAGSL FRRGQVAGEL AWRAQPAAVA GSLVVVLLMG VAPVAAAWLT 
KLVLDGLAGG GGRSGLLPLA AALGVAGLAL IVLPHVSEYL NGQLHRGLQR LIYDRLFRAV
NADPGLSRFE NPSFHDELRL AHEAGQNAPQ QIVGSSLAIC QGILTLLGFI VTLTVLSPVM
VVIVIAAALP SARVQLALSR RRAGTMWGIS SNTRRQIFFS ALLTSPDAAK EVRLFGLGDF
LRSRMLTELK TVNEAERDLD RRTLHSQGLL SLLGAAVAAG GLFWAVLTAI EGGLSVGDVS
VFVAAVAGVQ TALTLIVGQW ASAHNALLMF GYYLGVVQAD PGLPVAAEPT ALPPLREGIE
LRDVWFRYDD GHPWVLRGVD LFIPHGQAVA LVGLNGAGKS TLVKLLCRLY DPVRGSIHWD
GVDLRDVHPD DLRARVGAVF QDYMSYDLTA TENIMLGDLT AGADPELVRT AARRAGIHDT
LDDLPRGYDT LLSRIFFGDN DDRQAGVTLS GGQWQRLALA RALMRHHRDL LILDEPSSGL
DAEAEHAVHL GLRRHRAGRT SVLISHRLGA LRDADLIIVL SEGKITERGS HSELMALGGE
YARLFSLQAS GYELAPDPSA APSGKP