Gene Sros_7521 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_7521 
Symbol 
ID8670842 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp8318834 
End bp8320279 
Gene Length1446 bp 
Protein Length481 aa 
Translation table11 
GC content72% 
IMG OID 
ProductBetaine-aldehyde dehydrogenase 
Protein accessionYP_003342945 
Protein GI271968749 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0747664 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGATC TGTTCATCGG AGGTGAGTGG GGCCCCGCGG AGGCGGGCGC CACCTTCACG 
ACGCTGAATC CGGCCGACGG CTCGGTGTTC GAGGAGTGCG CCGAGGCGGG CGCCCCCGAC
GTGGACCGCG CGGTCGCCGC GGCCCGCTCC GCGCTCGCCG ACCCCGCCTG GGCCGGCATG
ACCGCCGCCG CGCGGGCGCG GATGCTGTGG CGGGTGGCCG ACCTCGTCGA GGAGCACGCC
GACGAGCTCG CCGAGCTGGA GACGCGGGAC AACGGACAGC CGCTGGCGGT GGCCGGGGGC
GTCACGGTCC CCGGCGCGGC CGAGCACTTC CGCTACTTCG CGGGGTGGTG CACCAAGATC
GAGGGATCGG TGGTGCCGGT CTCCTTCCCC GACACCCTCC ACTACACGCG GCGCGAGCCG
GTGGGCGTCT GCGCGCTGAT CACGCCGTGG AACTTCCCGC TGATGCTGGC CGCGTGGAAG
CTCGCCCCGG CCCTGGCCTG CGGCAACACC GTGATCATCA AGCCCGCCGA GCAGACCCCG
CTGAGCACGG TCATGCTCGT CGAGCTGATG GAGAGGGCCG GCTTCCCGCC CGGCGTGGTC
AACCTGCTGA CCGGCGGCCC GGCCACCGGG GCCGCCCTCG CCGAGCACTC CGGCGTGGAC
AAGGTCTCCT TCACCGGATC CACCGAGGTC GGCCGCAAGC TCGTGCACGC CAGCGCGGGC
AACCTCAAGC GCCTCACCCT CGAACTGGGC GGCAAGACGC CGAGCATCAT CGCCGCCGAC
GCCGACATCG ACGCCGCCGT GGCGGGAAAC GTGCAGGGCG CGCTGTTCAA CAGCGGCCAG
GTCTGCGCCG CCTACGCCCG CTTCTACGTG GACCGCCGGC GGGCCGACGA GTTCACCGAG
AAGATGGCCG CCGCTGCCGC CGCGCTGGTC CTCGGACCGG GCCTGGACCC GGCCTCGCAG
CTCGGGCCGC TGGTCAGCGA GGAGCACCTC GCCAAGGTCG ACTCCCATGT CCGCGGCGCC
CGCGCCGAGG GCGCGGAACT CGTGACCGGC GGGCGCCGGG CGGGCGGGAG GCTGGCCGAG
GGATTCTTCT ACGAGCCGAC CGTCTTCGCC GGGGTCACCG ACGAGATGGC GATCGCCAGG
GAGGAGGTGT TCGGGCCGGT CATCCCGGTG CTCGCCTACG ACGACCCTGA TGAGATCGTG
GAGCGTGCCA ACGACTCGGC GTACGGCCTG GCGGCCTCCG TCTGGACCCG CGACCTGTCC
ACCGCACACC GCCTGGCGGC GAAGGTCAGG GCCGGAGCCG TATTCATCAA CATGATCCAT
GTCCCGGACG CCGCGACCAC CTGGGGTGGT TTCAAGGCCA GCGGCTGGGG CCGGGAGATG
GGACCGTACG CCATCGACGC CTATACAGAG GTCAAAGGCG TCTGGACGCA CCTGGGAGGG
GCGTGA
 
Protein sequence
MTDLFIGGEW GPAEAGATFT TLNPADGSVF EECAEAGAPD VDRAVAAARS ALADPAWAGM 
TAAARARMLW RVADLVEEHA DELAELETRD NGQPLAVAGG VTVPGAAEHF RYFAGWCTKI
EGSVVPVSFP DTLHYTRREP VGVCALITPW NFPLMLAAWK LAPALACGNT VIIKPAEQTP
LSTVMLVELM ERAGFPPGVV NLLTGGPATG AALAEHSGVD KVSFTGSTEV GRKLVHASAG
NLKRLTLELG GKTPSIIAAD ADIDAAVAGN VQGALFNSGQ VCAAYARFYV DRRRADEFTE
KMAAAAAALV LGPGLDPASQ LGPLVSEEHL AKVDSHVRGA RAEGAELVTG GRRAGGRLAE
GFFYEPTVFA GVTDEMAIAR EEVFGPVIPV LAYDDPDEIV ERANDSAYGL AASVWTRDLS
TAHRLAAKVR AGAVFINMIH VPDAATTWGG FKASGWGREM GPYAIDAYTE VKGVWTHLGG
A