Gene Sros_2261 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_2261 
Symbol 
ID8665543 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp2443520 
End bp2445106 
Gene Length1587 bp 
Protein Length528 aa 
Translation table11 
GC content70% 
IMG OID 
ProductMalate synthase 
Protein accessionYP_003337986 
Protein GI271963790 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.0633899 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGGCG TTGAGATCAC CGGCCCCTCG CAGGACCGGT CAGACGAGAT CCTCACGCCG 
GAGGCCCTGG CCTTCGTGGC CGCCCTCCAG CGCGAGTTCG GCGCCCGGCG CCTTGAGCTG
CTGGAGGCGC GCCGGGAACG CCAGGCGGAG CTGTCGGCGG GCGGCACCCT CGACTTCCTG
CCCGAGACCA AGCACGTGCG CGAGTCCGAG TGGCGGGTCG CCCCGCCCGC GCCCGGCCTG
GAGGACCGCC GCGTGGAGAT CACCGGCCCG GTGGACCGGA AGATGACGAT CAACGCGCTG
AACTCCGGCG CCAAGGTCTG GCTGGCCGAC TTCGAGGACG CCAACGCCCC CACCTGGGAA
AACACCATCA ACGGCCAGCT CAACCTGCGG GACGCGCTGG ACCGGACCAT CGACTTCTCG
GCCGGGGAGA AGCACTACGC GCTCAAGCCC GACGAGGAGC TCGCCACCAT CGTGGTCCGC
CCGCGCGGCT GGCACCTGGA GGAGAAGCAC CTCACCCTCG ACGGCGCCCC CTTCTCCGCC
TCGCTCGTCG ACTTCGGCCT CTACTTCTTC CACTCCGCGC GGCGGCAGAT CGCCAAGGGC
AGGGGCCCCT ACTTCTACCT GCCGAAGATG GAGTCGCACC TGGAGGCCCG CCTCTGGAAC
GACGTCTTCG TCAGGGCTCA GGACCTGCTC GGCATCCCGC AGGGCACGAT CCGCGCGACC
GTCCTGATCG AGACCTACCC CGCGGCGTTC GAGATGGAGG AGATCCTCCA CGAGCTCCGC
GAGCACTCCG CGGGTCTCAA CGCCGGACGC TGGGACTACC TGTTCAGCGT GATCAAGAAG
TTCCGCACCC GGGGCCGGGA GTTCCTGCTG CCCGAGCGGA ACGCGGTCAC CATGACCGCC
CCCTTCATGC GCGCCTACAC CGAGCTGCTG GTCCGCACCT GCCACAAGCG CGGCGCCCAC
GCCATCGGCG GCATGGCCGC GTTCATCCCC TCCCGCCGCG ACCCGGAGGT CAACAAGGTC
GCCCTGGAGA AGGTCACCGC CGACAAGACC CGCGAGTCCG GCGACGGCTT CGACGGCTCC
TGGGTGGCCC ACCCCGACCT GGTCCCGATC TGCCGCGACG TCTTCGACGG TGTCCTCGGC
GACCGGCCGA ACCAGCTCGA CCGCCTGCGA GAGGACGTCT CGGTCTCCGC CGCCGACCTG
CTGTCGGTCT CCGAGACCCC GGGCGACATC ACCGAGGCGG GCCTGCGCAA CAACGTGGAC
GTGGCCCTGC GCTACCTGGC CGCCTGGATG GGCGGGCTGG GCGCGGTGGC GATCCACAAC
CTGATGGAGG ACGCCGCCAC CGCCGAGATC TCCCGCTCCC AGATCTGGCA GTGGATCCAC
AACGACATCG AGCTCGCCGA CACCGGGGCC GTGGTCACGA AAGAGCTCGT CGAGCGGATC
ATCGACGAGG AGCTCGCCAA GATCAAGGCG GAGCCCGGCT ACGACGAGGC CCTCTACGCC
CAGGCCACCG CCCTGTTCAA GGAGGTCGCC CTCGACGACG ACTTCGCGGA GTTCCTCACC
CTTCCCGCCT ACGCGCGCAT GCCGTGA
 
Protein sequence
MDGVEITGPS QDRSDEILTP EALAFVAALQ REFGARRLEL LEARRERQAE LSAGGTLDFL 
PETKHVRESE WRVAPPAPGL EDRRVEITGP VDRKMTINAL NSGAKVWLAD FEDANAPTWE
NTINGQLNLR DALDRTIDFS AGEKHYALKP DEELATIVVR PRGWHLEEKH LTLDGAPFSA
SLVDFGLYFF HSARRQIAKG RGPYFYLPKM ESHLEARLWN DVFVRAQDLL GIPQGTIRAT
VLIETYPAAF EMEEILHELR EHSAGLNAGR WDYLFSVIKK FRTRGREFLL PERNAVTMTA
PFMRAYTELL VRTCHKRGAH AIGGMAAFIP SRRDPEVNKV ALEKVTADKT RESGDGFDGS
WVAHPDLVPI CRDVFDGVLG DRPNQLDRLR EDVSVSAADL LSVSETPGDI TEAGLRNNVD
VALRYLAAWM GGLGAVAIHN LMEDAATAEI SRSQIWQWIH NDIELADTGA VVTKELVERI
IDEELAKIKA EPGYDEALYA QATALFKEVA LDDDFAEFLT LPAYARMP