Gene Sros_6890 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_6890 
Symbol 
ID8670200 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp7590220 
End bp7592523 
Gene Length2304 bp 
Protein Length767 aa 
Translation table11 
GC content70% 
IMG OID 
Productcellobiohydrolase A (1 4-beta-cellobiosidase A)- like protein 
Protein accessionYP_003342336 
Protein GI271968140 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAGTTC ATCCGCGCCG TCAAGCCTGG CGCAATCGGG CGGTGACCAC GGCGGCCCTG 
CTCGTGGCCG CGGCCGGTCT CGCCACCGGG CAGGCCGCCA CGGCACACGC CGCAGTCTCG
TGCGACGTCG CCTACAGCAC CAACGAATGG CAGGGCGGTT TCACCGCCAG CGTCACGGTC
AAGAACCTCG GTGACGCGCT CGCCGGCTGG ACCCTCGGCT TCGCCTTCCC CGGCACCCAG
CAGGTCCAGC AGGGCTGGTC GGCGACCTGG AGCCAGACCG GCAACCAGGT CACCGCGAAG
AACCTCGACT GGAACGGCAA CCTCGCCACC GGCGCGTCGA CCAGCATCGG CTTCAACGGG
TCGTGGAGCG GGAGCAACCC CAAGCCCACC GCCTTCACGG TCAACGGCGT CACCTGTGGC
GGCCAGCCGC CCGTAAACCA GGCTCCGACG GTCAGCCTGA CCAGCCCGGC GAGCGGCGCC
TCCATCCCCG CGGGCTCCGC CGTACCGCTC GCGGCGACGG CCGGCGACGA CGGCGCGGTC
AGCAAGGTCG AGTTCTACGT CGACGGCGCG CTCGTCAACA CCGACACCTC CGCCCCGTAC
GGCTACTCGG CGACGGGCGT GGCGGCGGGC AGCCACACCG CCAGGGCGAA GGCCTACGAC
AATGGGACCC CGGTCCTGTC CGCCGAGACC GCCGAGGTCC CCTTCACCGT CGGCTCCGAC
GGGGGCACCG CGGCGGTCGT CGCCTCCGCG ACCTCGGTCA GCGTCCCCGA AGGCGGCTCC
AAGACGGTCG GCTTCAGGCT CAGCAAGGCG CCCAGCGGCA ACGTCACGGT CAACCTGACC
AAGACCGGTG ACGCCGACCT CACCATCGCG CCCTCCACGC TCACCTTCAC CCCCGCCAAC
TGGAACACGG CGCAGAACGT GACCGTCTCG GCCGCCCAGG ACGCCGACCA GACCGACGGC
ACCGCCACCA TCGCGGCCGC CGCCACCGGG CACACCGGTG TCTCCGTCAC CGCGACCGAG
TCCGACGACG ACGTCGTCAC GCAGCCCGGC GAGCACGTCG AGAACCCGTA CGCCGGGGCC
ACCGGCTACG TGAACCCCGA CTGGGCGGCG AAGGCCGCGG CGGAGCCGGG CGGCGACGCG
GTGGCCGACA TCTCGACCGG CGTGTGGCTC GACCGCATCG CCGCCATCGA GGGCACCGCC
TCCGCCAGGG GCCTCCGCGC CCACCTGGAG GAGGCCCTCA GGCAGGACGC GGCCAACGGC
AGCAAGCCCC TGACGATCCA GTTCGTCATC TACAACCTGC CCAACCGCGA CTGCTCGGCG
CTGGCCTCCA ACGGTGAGCT GCTCATCGCC CAGAACGGGC TTAACCGCTA CAAGACCGAG
TACATCGACC CGATCGCGGC GATCATGGCC GAGCCGAGGT ACGCCACGCT CCGGATCTCG
ACGGTCATCG AGATCGACTC GCTGCCCAAC CTGATCACCA ACCTCAACGT GCCCAAGTGC
CAGGAGGCCA AGTCGAGCGG CGCCTACGTC GACGGCGTCC GCTACGCCCT GAACAAGCTG
CACGCGATCA AGAACGTCTA CACCTACATC GACGCCGCGC ACCACGGCTG GCTGGGCTGG
GACACCAACT TCGGCCCCTC GGCCGACCTG TTCGCGAGCA CCGTCGCCGG TACCACGGCT
GGGTTCGACA GCGTCGACGG CTTCATCACC AACACCGCCA ACTACTCGGC TCTGAAGGAG
CCGCACTTCA CCATCAACAC CACCGTCAAC GGCACCACGG TGCGCCAGTC GCGGTGGCTC
GACTGGAACT TCTACGTCGA CGAGCTGTCC TACGCCCAGG CCTTCCGGAC CCTGCTCATC
CAGAAGGGCT TCAAGCCGGG CCTCGGCATG CTGATCGACA CCTCCCGCAA CGGGTGGGGC
GGTACGGCCC GCCCGGCCGC TCCCAGCACC TCCACCGACG TGAACACCTT CGTCAACCAG
TCGCGGGTGG ACCGCCGCAT CCACGCCGGC AACTGGTGCA ACCAGAGCGG CGCCGGTCTC
GGCGAGCGCC CGCAGGCCAG CCCCGCCGCC GGCCTCGACG CCTACGTCTG GATCAAGCCC
CCGGGCGAGT CCGACGGCGC CAGCAAGCTC ATCCCCAACG ACGAGGGCAA GGGCTTCGAC
CGGATGTGCG ACCCGACCTA CACCGGCAAC GAGCGCAACG GCAACAACAT GACCGGCTCC
CTGGCCGACG CCCCGCTCTC CGGCCAGTGG TTCTCGGCGC AGTTCCGTGA GCTGCTCAAG
AACGCCTACC CCGCCCTGCC GTAA
 
Protein sequence
MRVHPRRQAW RNRAVTTAAL LVAAAGLATG QAATAHAAVS CDVAYSTNEW QGGFTASVTV 
KNLGDALAGW TLGFAFPGTQ QVQQGWSATW SQTGNQVTAK NLDWNGNLAT GASTSIGFNG
SWSGSNPKPT AFTVNGVTCG GQPPVNQAPT VSLTSPASGA SIPAGSAVPL AATAGDDGAV
SKVEFYVDGA LVNTDTSAPY GYSATGVAAG SHTARAKAYD NGTPVLSAET AEVPFTVGSD
GGTAAVVASA TSVSVPEGGS KTVGFRLSKA PSGNVTVNLT KTGDADLTIA PSTLTFTPAN
WNTAQNVTVS AAQDADQTDG TATIAAAATG HTGVSVTATE SDDDVVTQPG EHVENPYAGA
TGYVNPDWAA KAAAEPGGDA VADISTGVWL DRIAAIEGTA SARGLRAHLE EALRQDAANG
SKPLTIQFVI YNLPNRDCSA LASNGELLIA QNGLNRYKTE YIDPIAAIMA EPRYATLRIS
TVIEIDSLPN LITNLNVPKC QEAKSSGAYV DGVRYALNKL HAIKNVYTYI DAAHHGWLGW
DTNFGPSADL FASTVAGTTA GFDSVDGFIT NTANYSALKE PHFTINTTVN GTTVRQSRWL
DWNFYVDELS YAQAFRTLLI QKGFKPGLGM LIDTSRNGWG GTARPAAPST STDVNTFVNQ
SRVDRRIHAG NWCNQSGAGL GERPQASPAA GLDAYVWIKP PGESDGASKL IPNDEGKGFD
RMCDPTYTGN ERNGNNMTGS LADAPLSGQW FSAQFRELLK NAYPALP