Gene Sros_6407 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_6407 
Symbol 
ID8669716 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp7014810 
End bp7016156 
Gene Length1347 bp 
Protein Length448 aa 
Translation table11 
GC content72% 
IMG OID 
ProductCellobiohydrolase A (1 4-beta-cellobiosidase A)- like protein 
Protein accessionYP_003341864 
Protein GI271967668 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.0259803 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAAGAA GGTCGATTCT GGCCACGCTC TGCGCGGCCC TGGTGGTCGT GACCGGTGCC 
GCGGTGTCCG CGGCCTCCGG AGCGTCCGCG GCCGATTCCC CGTTCTACGT CGACCCCGAG
ACCAGTGCGG CCAAGTGGGT CGCGGCCAAT CCGGGGGACT CCCGCACGCC GGTCATCCGC
GACCGGATCG CCGCCGTCCC GCAGGCGCGC TGGTTCACCA CGACCAACAC CTCCGCGGTG
CGCGGGCAGG TGTCGGCGTT CGTCGGCGCC GCCGCGAGCG CCGGCAAGAC CCCCATCCTG
GTCGTCTACA ACATCCCCAA CCGGGACTGC AGCGGCGCGA GCACGGGTGG CGCGCCCACC
CACGCGGCCT ACCGGCAGTG GATCGACGAG CTCGCGGCCG GTCTTCAGGG ACGTCCCGCG
ACGATCGTCC TGGAGCCCGA CGTGCTCCCG ATCATGACCA ACTGCATGAG CTCCTCCCAG
CAGCAGGAGA CCAACGCCTC CATGGCGTAC GCGGGCAAGA GGCTGAAGGC CGGTTCGGCG
TCGGCGAAGG TCTACTTCGA CATCGGGCAC TCCGGATGGC TGTCCGCGTC CGAGGCCGGG
GCCCGGCTGA GGGCCGCGGA CGTCGCCAAC AGCGCCGACG GCATCTCCCT CAACGTCTCC
AACTACCGCT GGAGCTCCAC CGAGGTGGCG TACGCCAAGA GCGTCATCTC CGCCAGCGGC
GTGTCCCGGC TGCGCGCGGT GATCGACACC AGCCGCAACG GCAACGGCCC GCAGGGCGGC
GAGTGGTGTG ATCCGGGCGG CCGGGCGATC GGGACGTTGA GCACGACCGG CACCGGAGAC
TCGATGATCG ACGCGTTCCT CTGGATCAAG CTGCCCGGCG AGGCCGACGG CTGTATCGCC
GGCGCCGGGC AGTTCGTGCC GCAGCGGGCC TACGACCTGG CCATCGCGGC CCCGCCGCCC
ACCCCCACCC CCACCCCCAC CGTGACCCCC ACCCCGACCC CGACTCCCAC CGTCACCCCC
ACCCCGACCC CGACCGGCGG GAAGGCCTGC ACGGCCGCGT ACAAGCTGGT CGGCTCCTGG
CAGGGCGGCT TCCAGGCGGA GGTGACGGTG AAGAGCACCG GCGGCGCGGC CATCGCGGGC
TGGACGGTGA GCTGGTCCTT CCCGAACGGC CAGAGCGTCA CCCAGCTCTG GAACGGACGG
CACACCCAGA GCGGCGCCGA GGTCTCGGTA CGCAACGCCG ACCACAACGG CGCCCTCTCC
CCGGGTGCCT CGGCGTCCTT CGGCTTCACC GGCAACTGGT CCGGGACCAA CGGTGTGCCG
GCCTCGGCCG GCTGCGCCGC CGCCTGA
 
Protein sequence
MPRRSILATL CAALVVVTGA AVSAASGASA ADSPFYVDPE TSAAKWVAAN PGDSRTPVIR 
DRIAAVPQAR WFTTTNTSAV RGQVSAFVGA AASAGKTPIL VVYNIPNRDC SGASTGGAPT
HAAYRQWIDE LAAGLQGRPA TIVLEPDVLP IMTNCMSSSQ QQETNASMAY AGKRLKAGSA
SAKVYFDIGH SGWLSASEAG ARLRAADVAN SADGISLNVS NYRWSSTEVA YAKSVISASG
VSRLRAVIDT SRNGNGPQGG EWCDPGGRAI GTLSTTGTGD SMIDAFLWIK LPGEADGCIA
GAGQFVPQRA YDLAIAAPPP TPTPTPTVTP TPTPTPTVTP TPTPTGGKAC TAAYKLVGSW
QGGFQAEVTV KSTGGAAIAG WTVSWSFPNG QSVTQLWNGR HTQSGAEVSV RNADHNGALS
PGASASFGFT GNWSGTNGVP ASAGCAAA