Gene Sros_5222 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_5222 
Symbol 
ID8668516 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp5740059 
End bp5741543 
Gene Length1485 bp 
Protein Length494 aa 
Translation table11 
GC content67% 
IMG OID 
ProductBetaine-aldehyde dehydrogenase 
Protein accessionYP_003340736 
Protein GI271966540 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.938507 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.927599 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGTACT CCATAGACAC GTTGCTCATC GACGGGCGCC GGGTGCCGGC CGAGAGCGGT 
GAAGTGCTCA CAGTCGTCAA CCCGGCGACC GGTGAGGTCT TCGCCACCGC GCCGTCGGCT
TCCAGGCGCG ACGCCGTGGC CGCCATCACC GCCGCCCGCC GTGCCTTCGA GGAGGGGCGG
TGGCCACGGC TGTCATCGGC CGAACGCGGC CGCCTTCTCA AGCGGCTGGC CCTGGCCATG
GACCGGCGCC GTGAGGAACT GATCGATCTG GTGATCAAGG AGAGCGGATT CCCGAGGGCA
GTGGCCGACC GGGTCCACCT GCAGTCCTCC ATCGACTTCC TCTGCGACCT GTCCGACCGC
CTGCTTCCCG GGCTGGCGTT CACGACACCG TTGAACCCGC ACACCGGGAT CTCGATGACC
GGCCGGCCCC AGACGACGCA GGGCATGGTG GTCAAGGAGC CGATCGGGGT GGCGGCGTTG
ATCACGCCGT TCAACGCCGC GGTTCCTCTG ACCGTCCACA AGCTCGGCTG GGCGCTGGCC
GCCGGCTGCA CCACGGTGGT CAGGCCTTCC CCCTACACTC CCCTGCAGGT CCTGTTCCTG
GCCGACCTCA TCGAGGAAGC CGGCTTCCCG CCCGGCGTGG TCAACATCAT CACCGGTGAC
CTGGACGCCG GCCTCGAGAT GACCACCCAT CCCGACGTCG ACATCATCAG CTTCACCGGC
TCCGACGCGG TCGGCCGCAG GATCATGGCC CAGGCCGCGC CCACGCTGAA GAAGGTGGTC
TTGGAACTCG GCGGTAAGTC CGCCAACATC GTCTTCGCGG ACGCCGATCT GGATCGAGCC
GCCCTGGAAG TGATGGGCAA CATCGTCTCC AACGCCGGCC AGGGCTGCCT TCTCCTCACC
CGGACCCTGG TCGAAGAGCC GGTCCACGAC GAACTGCTCA GCAAGGTGGT CGCCCTGCTG
AGCACTGTGA CCGTGGGCGA TCCGGCCGAC CCGGCCACGG TCATGGGTCC TCTCATCAGT
GCAAGGGAAC GGGACCGGGT GGAGGCGATG ATCCACCGAG GGGTGGCCGA GGGCGCCACC
CTCGCCTACG GTGGTGGCCG CCCAGCAGGA CTGGGCCGGG GTTTCTTCCT GGAGCCGACA
CTGTTCACCG ACGTGGACAA CTCCATGAGC ATCGCGCAGC AGGAGTTCTT CGGGCCGGTG
AACACGGTGA TCGGCTTCAA GGATGACGCC GAAGCCGTAC GCATCGCCAA CGACAGCGAT
TTCGGCCTCA ACGCCGGAAT CTTCACCCAG GACTTCGAAC GGGCCTATGC GACCGCCGCG
CGGATCCGCA GCGGTACGGT CAACATCAAC GCTTCTTGGG GCACCAACCC CGACGCCCCG
TTCGGCGGCT ACAAGCAGAG CGGCCTGGGC CGGGAGGGTG GCGCGTACGG GATCGCGGAG
TTCCTGGAGG AGAAGTTCGT CTCCTGGCCG GTAGGCCGCC TGTGA
 
Protein sequence
MTYSIDTLLI DGRRVPAESG EVLTVVNPAT GEVFATAPSA SRRDAVAAIT AARRAFEEGR 
WPRLSSAERG RLLKRLALAM DRRREELIDL VIKESGFPRA VADRVHLQSS IDFLCDLSDR
LLPGLAFTTP LNPHTGISMT GRPQTTQGMV VKEPIGVAAL ITPFNAAVPL TVHKLGWALA
AGCTTVVRPS PYTPLQVLFL ADLIEEAGFP PGVVNIITGD LDAGLEMTTH PDVDIISFTG
SDAVGRRIMA QAAPTLKKVV LELGGKSANI VFADADLDRA ALEVMGNIVS NAGQGCLLLT
RTLVEEPVHD ELLSKVVALL STVTVGDPAD PATVMGPLIS ARERDRVEAM IHRGVAEGAT
LAYGGGRPAG LGRGFFLEPT LFTDVDNSMS IAQQEFFGPV NTVIGFKDDA EAVRIANDSD
FGLNAGIFTQ DFERAYATAA RIRSGTVNIN ASWGTNPDAP FGGYKQSGLG REGGAYGIAE
FLEEKFVSWP VGRL