Gene Sros_8038 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_8038 
Symbol 
ID8671366 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp8847937 
End bp8849514 
Gene Length1578 bp 
Protein Length525 aa 
Translation table11 
GC content70% 
IMG OID 
Productputative alpha-isopropylmalate/homocitrate synthase family transferase 
Protein accessionYP_003343436 
Protein GI271969240 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGACG ACCGCTTCCA TGTGTACGAC ACCACGCTCC GTGACGGTGC CCAGCAGGAG 
GGCCTCAACC TCACGGTGGT CGACAAGCTG GCCATCGCGC GCCACCTGGA CGGCCTCGGC
GTCGGGTTCA TCGAGGGCGG CTGGCCCGGT GCGAACCCGA AGGACACAGA GTTCTTCAGG
CGGGCTCAAA CAGAGCTCGA CCTGAAGCAC GCGCAGCTCG CCGCGTTCGG CGCGACCCGC
CGAGCCGGTG TGAAGGCAGC CGACGACCCG TTGGTGGCCG CCCTGCGCGA ATCCGGCGCG
CCGGTCGTGA CCCTTGTCGC CAAGAGTCAC GACCGGCACG TGGAGCTGGC CCTCCGCACG
ACTCTCCAGG AGAACCTCGC GATGATCCGC GACACGGTCT CCCACCTCCG TGCGGAGGGA
CAGCGGGTCT TCCTCGACGC CGAGCACTTC TTCGACGGCT ACGCCTCCAA CCCAGCCTAC
GCGCTGGAGG TCCTGCGCAC CGCCGCCGAG GCCGGAGCCT CGGTCATCGC CCTGTGCGAC
ACCAACGGCG GCATGCTCCC CGACGAGCTC GCCGAGGTCG TGCACGCGGC CGTGCAGACC
TCCGCCCGGG TCGGCATCCA CTGCCACGAC GACACCGGCT GCGCGGTGGC CAACACCCTG
GCCGCGGTGA AGGCGGGCGC CACCCACGTG CAGGGCTGCG CCAACGGGTA CGGCGAGCGG
TCCGGCAACG CCAACCTCTT CACCGTCGTG GCCAACCTCC AGCTCAAGCG GGGCTTCGAC
CTCGTCCCGC GGGAGGCGCT GGCCGACATG ACCCGGATCG CCCACGCGAT CACCGAGGTC
ACCAACGTCA CCCCGAACTC CCACGCCCCC TATGTGGGCG TCTCCGCCTT CGCGCACAAG
GCCGGGCTGC ACGCCAGCGC CATCAAGGTG GACCCCAACC TCTACCAGCA CATCGACCCG
GCGCGGGTCG GCAACGACAT GCGCATGCTC GTCTCCGACA TGGCCGGTCG CGCCTCCGTC
GAGCTCAAGG GCCGCGAGCT GGGCTACGAG CTGTCGCCGG AGATCTCGCG CGAGCTGGTC
AACCGGGTCA AGGACATGGA GTCCAAGGGC TACACCTTCG AGGCCGCCGA CGCCTCCTTC
GAGCTGCTGC TCCGCGACAC CGTGGCGGGC GAGCGCAGGC GCCACTTCGA GGTGGAGTCC
TGGCGGGTGA TCGTCGAGCG GACCCGCGGC GGCGAGCTGG TCAGCGAGGC CACGGTCAAG
CTGCACGCCA AGGGCGAGCG CATCGTCGCC ACCGGCGAGG GCAACGGCCC GGTCAACGCC
CTGGACAGGG CCGTCAGGCT GGGGCTGGAG AGGCTCTACC CCGAGCTGGC GGAGGTCGAG
CTGACCGACT TCAAGGTGCG GATCCTGGAG GGCACCCACG GCACCGACGC CATCACGCGC
GTCCTCATCA CCTCCAGCGA CGCGACCGGC GAGTGGGCCA CCGTCGGCGT CGACGAGAAC
ATCATCGAGG CCTCCTGGCA GGCCCTTGAG CAGGCCGTCA CCTATGGCCT GGTCCGGGCG
GGCCACGTCG CCGACTGA
 
Protein sequence
MPDDRFHVYD TTLRDGAQQE GLNLTVVDKL AIARHLDGLG VGFIEGGWPG ANPKDTEFFR 
RAQTELDLKH AQLAAFGATR RAGVKAADDP LVAALRESGA PVVTLVAKSH DRHVELALRT
TLQENLAMIR DTVSHLRAEG QRVFLDAEHF FDGYASNPAY ALEVLRTAAE AGASVIALCD
TNGGMLPDEL AEVVHAAVQT SARVGIHCHD DTGCAVANTL AAVKAGATHV QGCANGYGER
SGNANLFTVV ANLQLKRGFD LVPREALADM TRIAHAITEV TNVTPNSHAP YVGVSAFAHK
AGLHASAIKV DPNLYQHIDP ARVGNDMRML VSDMAGRASV ELKGRELGYE LSPEISRELV
NRVKDMESKG YTFEAADASF ELLLRDTVAG ERRRHFEVES WRVIVERTRG GELVSEATVK
LHAKGERIVA TGEGNGPVNA LDRAVRLGLE RLYPELAEVE LTDFKVRILE GTHGTDAITR
VLITSSDATG EWATVGVDEN IIEASWQALE QAVTYGLVRA GHVAD