Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sros_6407 |
Symbol | |
ID | 8669716 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Streptosporangium roseum DSM 43021 |
Kingdom | Bacteria |
Replicon accession | NC_013595 |
Strand | - |
Start bp | 7014810 |
End bp | 7016156 |
Gene Length | 1347 bp |
Protein Length | 448 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | |
Product | Cellobiohydrolase A (1 4-beta-cellobiosidase A)- like protein |
Protein accession | YP_003341864 |
Protein GI | 271967668 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.0259803 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCAAGAA GGTCGATTCT GGCCACGCTC TGCGCGGCCC TGGTGGTCGT GACCGGTGCC GCGGTGTCCG CGGCCTCCGG AGCGTCCGCG GCCGATTCCC CGTTCTACGT CGACCCCGAG ACCAGTGCGG CCAAGTGGGT CGCGGCCAAT CCGGGGGACT CCCGCACGCC GGTCATCCGC GACCGGATCG CCGCCGTCCC GCAGGCGCGC TGGTTCACCA CGACCAACAC CTCCGCGGTG CGCGGGCAGG TGTCGGCGTT CGTCGGCGCC GCCGCGAGCG CCGGCAAGAC CCCCATCCTG GTCGTCTACA ACATCCCCAA CCGGGACTGC AGCGGCGCGA GCACGGGTGG CGCGCCCACC CACGCGGCCT ACCGGCAGTG GATCGACGAG CTCGCGGCCG GTCTTCAGGG ACGTCCCGCG ACGATCGTCC TGGAGCCCGA CGTGCTCCCG ATCATGACCA ACTGCATGAG CTCCTCCCAG CAGCAGGAGA CCAACGCCTC CATGGCGTAC GCGGGCAAGA GGCTGAAGGC CGGTTCGGCG TCGGCGAAGG TCTACTTCGA CATCGGGCAC TCCGGATGGC TGTCCGCGTC CGAGGCCGGG GCCCGGCTGA GGGCCGCGGA CGTCGCCAAC AGCGCCGACG GCATCTCCCT CAACGTCTCC AACTACCGCT GGAGCTCCAC CGAGGTGGCG TACGCCAAGA GCGTCATCTC CGCCAGCGGC GTGTCCCGGC TGCGCGCGGT GATCGACACC AGCCGCAACG GCAACGGCCC GCAGGGCGGC GAGTGGTGTG ATCCGGGCGG CCGGGCGATC GGGACGTTGA GCACGACCGG CACCGGAGAC TCGATGATCG ACGCGTTCCT CTGGATCAAG CTGCCCGGCG AGGCCGACGG CTGTATCGCC GGCGCCGGGC AGTTCGTGCC GCAGCGGGCC TACGACCTGG CCATCGCGGC CCCGCCGCCC ACCCCCACCC CCACCCCCAC CGTGACCCCC ACCCCGACCC CGACTCCCAC CGTCACCCCC ACCCCGACCC CGACCGGCGG GAAGGCCTGC ACGGCCGCGT ACAAGCTGGT CGGCTCCTGG CAGGGCGGCT TCCAGGCGGA GGTGACGGTG AAGAGCACCG GCGGCGCGGC CATCGCGGGC TGGACGGTGA GCTGGTCCTT CCCGAACGGC CAGAGCGTCA CCCAGCTCTG GAACGGACGG CACACCCAGA GCGGCGCCGA GGTCTCGGTA CGCAACGCCG ACCACAACGG CGCCCTCTCC CCGGGTGCCT CGGCGTCCTT CGGCTTCACC GGCAACTGGT CCGGGACCAA CGGTGTGCCG GCCTCGGCCG GCTGCGCCGC CGCCTGA
|
Protein sequence | MPRRSILATL CAALVVVTGA AVSAASGASA ADSPFYVDPE TSAAKWVAAN PGDSRTPVIR DRIAAVPQAR WFTTTNTSAV RGQVSAFVGA AASAGKTPIL VVYNIPNRDC SGASTGGAPT HAAYRQWIDE LAAGLQGRPA TIVLEPDVLP IMTNCMSSSQ QQETNASMAY AGKRLKAGSA SAKVYFDIGH SGWLSASEAG ARLRAADVAN SADGISLNVS NYRWSSTEVA YAKSVISASG VSRLRAVIDT SRNGNGPQGG EWCDPGGRAI GTLSTTGTGD SMIDAFLWIK LPGEADGCIA GAGQFVPQRA YDLAIAAPPP TPTPTPTVTP TPTPTPTVTP TPTPTGGKAC TAAYKLVGSW QGGFQAEVTV KSTGGAAIAG WTVSWSFPNG QSVTQLWNGR HTQSGAEVSV RNADHNGALS PGASASFGFT GNWSGTNGVP ASAGCAAA
|
| |