Gene Sros_3288 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_3288 
Symbol 
ID8666576 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp3582476 
End bp3583807 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content70% 
IMG OID 
Product3-deoxy-7-phosphoheptulonate synthase 
Protein accessionYP_003338970 
Protein GI271964774 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.195174 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCATCA ACCTCGATTC CTGGCGGCAG CTGCCTGCGG CGCAGCAGCC CGAGTGGCCC 
GACCGCGGCG AGCTGGACAA GGTCGTCGCC GAACTGCAGG GGCTGCCGCC CCTGGTCTTC
GCAGGGGAGT GTGACAACCT CAAGGCGGAT CTGGCGGCGG TGGCGCGGGG CGAGGCCTTC
GTGCTGCAGG GCGGCGACTG CGCCGAGACG TTCGCCGGGG CCACCGCGGA CGACGTGCGC
AACAAGCTGA AGACGCTGCT GCAGATGGCG ATCGTGCTCA CCTACGCGGC GAAGGTACCG
GTCGTGAAGA TCGGCCGGAT GGCCGGGCAG TTCGCCAAGC CCCGTTCCAA GAACACCGAG
ACCCGGGACG GCGTGGAGCT GCCCGCCTAC CGGGGCGACA TGGTCAACGG CTTCGACTTC
ACCCCCGAGT CCCGCGTCCC CGACCCCTGG CGGCTGCTGC GCGCCTACCA CTCCTCCGCG
GTGACGCTGA ACCTGGCCCG CGCCTTCACC AAGGGCGGCT ACGCCGATCT GCGCCAGGTG
CACGCCTGGA ACCAGGACTT CGTGATCGAG TCCCCGGCCG GGAAGCGCTA CGAGCAGCTC
GCCCGGGAGA TCGACCAGGC GCTGGCGTTC ATGCGCGCCT GCGGGGCCGA GCCGGAGGAG
TTCCACAGCG TCGAGTTCTA CTCCTCGCAC GAGGCCCTGA TCCTCGACTA CGACCGCGCG
CTCACCAGGA TCGACTCGCG GACCGGCCAG CCGTACGACG TGTCGGCGCA CATGGTCTGG
ATCGGCGAGC GCACCCGCCA GCTCGACAGC GCGCACGTGG AGTTCTTCGC CCGGATCCGC
AACCCGATCG GCGTGAAGCT CGGCCCGACG ACCACGCCGG AGGACGCCCT CGCGCTGATC
GACAAGCTGA ACCCGGACAA CGAGGCCGGG CGGCTGACGT TCATCACCCG GATGGGCGCG
CCGAAGATCC GCGAGCACCT TCCCGCGCTG GTGGAGAAGG TCACCGCGAG CGGCGCCCAG
GTGGCGTGGA TCTGCGACCC CATGCACGGC AACACCTTCG AGGCGCCCAG CGGCCACAAG
ACCCGCCGCC TGGACGACGT GCTGAACGAG GTGGCGGGCT TCTTCGACGT CCACCGCGAC
CTCGGCACCC ACCCCGGCGG CATCCACATC GAGTTCACCG GTGACGACGT CACCGAGTGC
GTGGGCGGCG GCGCGGAGAT CGTCGAGGAC GACCTGGCCC TGCGCTACGA GACGGCGTGC
GACCCGCGCC TCAACCGGAG CCAGTCGCTG GACCTGGCCT TCCGGGTGGC GGAGCTCTAC
CGCTCGGTCT GA
 
Protein sequence
MSINLDSWRQ LPAAQQPEWP DRGELDKVVA ELQGLPPLVF AGECDNLKAD LAAVARGEAF 
VLQGGDCAET FAGATADDVR NKLKTLLQMA IVLTYAAKVP VVKIGRMAGQ FAKPRSKNTE
TRDGVELPAY RGDMVNGFDF TPESRVPDPW RLLRAYHSSA VTLNLARAFT KGGYADLRQV
HAWNQDFVIE SPAGKRYEQL AREIDQALAF MRACGAEPEE FHSVEFYSSH EALILDYDRA
LTRIDSRTGQ PYDVSAHMVW IGERTRQLDS AHVEFFARIR NPIGVKLGPT TTPEDALALI
DKLNPDNEAG RLTFITRMGA PKIREHLPAL VEKVTASGAQ VAWICDPMHG NTFEAPSGHK
TRRLDDVLNE VAGFFDVHRD LGTHPGGIHI EFTGDDVTEC VGGGAEIVED DLALRYETAC
DPRLNRSQSL DLAFRVAELY RSV