Gene Sros_8742 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_8742 
Symbol 
ID8672080 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp9641863 
End bp9643872 
Gene Length2010 bp 
Protein Length669 aa 
Translation table11 
GC content69% 
IMG OID 
Productcarbohydrate ABC transporter 
Protein accessionYP_003344121 
Protein GI271969925 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGACA AGCCCGTGAC GACCACACGA CCGCCGGCGG GCGGCGGAGG CTTCGGCCGG 
GGGCCCTTCG GGGGGGCCGG GATGCCCGCG GAGAAGTCGA TGAACTTCGG GCCGTCCCTG
CGGCGGCTGA TGCGGCGGCT GAGCCCCGAA CGCCCGAGGA TCCTGGCCGT GATCGGCCTC
GCCGTGACCA GCGTGGTTTT CGCCGTCGTG GGACCGAAGA TCCTCGGCCA CGCGACGGAC
CTGATCTTCA GCGGTGTGAT CGGCAAGCAG CTTCCCGCCG GGACGACCAC GGAGCAGGCG
GTCCAGGCCG CCCGCGCGTC GGGCGACGAC AACTTCGCCG CCCTGCTCGC ACGGATGGAC
GTCGTGCCCG GTCACGGGAT CGACTTCGGC GCACTGGGCA CGGTCCTGGT GTGGGCCCTG
GCGCTCTACG TCGCGGCTTC GGTCTTCAGC TGGTTGCAGG GTTACCTGCT CAACGACGTG
GTGCAGCGCA GCGTGTTCCG GCTGCGGGCG GACGTCGAGG ACAAGCTGAA CCGGCTCCCC
CTGAAGTTCT TCGACGGCCA GCCGCGCGGT GAGCTGCTCA GCCGGGTCAC CAACGACATC
GACAACGTGT CCCAGACCCT GCAGCAGACG ATGAGCCAGC TGCTGACCTC GCTGCTGACC
GTGATCGGCG TGCTGGCGAT GATGTTCGTG ATATCGCCGC TGCTCGCGCT GATCGCCCTG
GTGACCATCC CGCTGTCGAT AATCGTCACC GGGCAGATCG CGAAACGCTC GCAGAAGTTC
TTCGTCGCGC AGTGGGCCAA CACCGGCACC CTGAACGCCC ACATCGAGGA GGCCTTCACC
GGCCACGAGC TGGTGAAGGT CTTCGGGCGG CAGCCGGAGG TCGAGCAGGT GTTCCGGGAC
AGGAACGAGG AGCTGTTCAA GGCCAGCTTC GGCGCCCAGT TCGTCTCCGG GATCATCATG
CCGATGATGA TGTTCATCGG GAACCTCAAC TACGTCGCCA TCGCCGTCGT CGGGGGCCTG
CGGGTCGCCA CCGGGTCGAT GAGCCTGGGT GACGTCCAGG CGTTCATCCA GTACTCACGG
CAGTTCACCC AGCCCCTGAC CCAGGTCGCC TCGATGGCCA ACCTGCTGCA GTCGGGGGTG
GCGTCGGCCG AGCGGGTCTT CGAGCTGCTG GACGCCGAGG ACCAGTCCCC GGACCCCGCC
GACCCGGTGC CGCCCACCGC CCGGCGGGGC CGCGTCGAGT TCGAGCAGGT GTCCTTCCGC
TACGTCCCGG AGCAGCCCCT CATCGAGGAC CTGTCGCTGG TGGCCGAGCC CGGGCACACG
GTCGCGATCG TCGGCCCGAC CGGTGCGGGA AAGACGACCC TCGTCAACCT GATCATGCGC
TTCTACGAGC TGGACGCCGG CCGGATCACC CTCGACGGGG TCGACATCAC CGCGATGCGC
CGCGAGGACC TGCGCTCGCA CATCGGCATG GTGCTCCAGG ACACCTGGCT GTTCGGCGGC
ACCATCCGCG AGAACATCGC CTACGGCAAC CCCGGGGCGA CCGAGGAGCA GCTCCAGGCC
GCCGCGCGGG CCACGTTCGT CGACCGGTTC GTCCGCACCC TGCCCGACGG CTACGACACC
GTGATCGACG ACGAGGGCAA CAACGTCAGC GCCGGAGAGA AACAGCTCCT GACCATCGCC
CGCGCCTTCC TCGCCGACCC GTCGCTGCTC ATCCTCGACG AGGCCACCAG CTCGGTCGAC
ACCCGGACCG AGGTCCTGGT CCAGCACGCG ATGGCCGCCC TCCGCTCCGA CCGCACCAGC
TTCGTCATCG CCCACCGCCT GTCCACCATC CGCGACGCCG ACCTCATCCT GGTGATGGAC
GGGGGCCGCA TCGTCGAGCA GGGCACCCAC GACGAGCTGC TGGCCGCTCG GGGCGCCTAC
CACCGGCTGT ACTCCGCCCA GTTCAGCGGC GCCCTGGTCC CGGACGGCGA CGAGGTGAGC
GGGGAGGAGG TGACGGGGGT GCGAGGGTAG
 
Protein sequence
MSDKPVTTTR PPAGGGGFGR GPFGGAGMPA EKSMNFGPSL RRLMRRLSPE RPRILAVIGL 
AVTSVVFAVV GPKILGHATD LIFSGVIGKQ LPAGTTTEQA VQAARASGDD NFAALLARMD
VVPGHGIDFG ALGTVLVWAL ALYVAASVFS WLQGYLLNDV VQRSVFRLRA DVEDKLNRLP
LKFFDGQPRG ELLSRVTNDI DNVSQTLQQT MSQLLTSLLT VIGVLAMMFV ISPLLALIAL
VTIPLSIIVT GQIAKRSQKF FVAQWANTGT LNAHIEEAFT GHELVKVFGR QPEVEQVFRD
RNEELFKASF GAQFVSGIIM PMMMFIGNLN YVAIAVVGGL RVATGSMSLG DVQAFIQYSR
QFTQPLTQVA SMANLLQSGV ASAERVFELL DAEDQSPDPA DPVPPTARRG RVEFEQVSFR
YVPEQPLIED LSLVAEPGHT VAIVGPTGAG KTTLVNLIMR FYELDAGRIT LDGVDITAMR
REDLRSHIGM VLQDTWLFGG TIRENIAYGN PGATEEQLQA AARATFVDRF VRTLPDGYDT
VIDDEGNNVS AGEKQLLTIA RAFLADPSLL ILDEATSSVD TRTEVLVQHA MAALRSDRTS
FVIAHRLSTI RDADLILVMD GGRIVEQGTH DELLAARGAY HRLYSAQFSG ALVPDGDEVS
GEEVTGVRG