Gene Sros_1947 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_1947 
Symbol 
ID8665229 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp2087063 
End bp2088364 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content69% 
IMG OID 
Productaminotransferase 
Protein accessionYP_003337678 
Protein GI271963482 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGGACC TTCTCGCACG CCACCGGGCG GTAATGCCCA ACTGGCTGGC CCTCTATTAC 
AACGAGCCCA TCGAGATCGT CGGCGGCAAG GGCAACCGGG TCGTCGACGC GTCGGGGAAG
AGCTATCTGG ACTTCTTCGC GGGCATCCTG ACCAACATGA TCGGGTACGA CGTGCCCGAG
GTGCGCGAGG CCGTCGAGCG CCAGCTCGCC ACCGGGGTCG TCCACACCTC CACGGTCTAC
CTGCTCCGAG GCCAGGTCGA GCTCGCCGAG AAGATCGCCA AGCTGTCCGG TATCAAGGAC
GCGAAGGTCT TCTTCACCAA CTCCGGCACC GAGGCCAACG AGACCGCGCT GCTGCTGGCG
ACCTACGCGC GCAAGTCCGA CCAGGTGCTC GCCATGCGGC AGAGCTACCA CGGCCGCTCC
TTCGGCGCGG TCAGCGTGAC CAGCAACCGC GCATGGAAGA ACAACTCGCT GTCGCCGCTG
AACGTGCACT TCCTGCACGG CGCCGACCGG CACCTCACGC AGTTCCGCGG GCTGTCGGAC
GCCGACTACA TCAAGGCGTG CGTCGACGAC CTGCGCCACG TGCTCTCCAC GTCCGTCTCC
GACGACGTGG CCGCGCTCAT CGCCGAGCCG ATCCAGGGTG TCGGCGGGTT CACCATGGCC
CCCGACGGGC TGTTCGCCGC CTACAAGGAG GTCCTGGACG AGCAGGGCAT CCTGTTCGTC
TCCGACGAGG TGCAGACCGG CTGGGGCCGT ACCGGCTCGG CCTTCTTCGG CATCCAGAAC
CACGGCGTCA CGCCGGACAT GATGACCTTC GCCAAGGGGC TGGGCAACGG CTTCGCGGTC
GGCGGCGTCG TGGCCCGCGG CGACCTGATG GACAACCTGC ACGCGGTCGG TCTCGCGACC
TTCGGCGGCA ACCCGATCTC GATGGCCGCC GCCAACGCGA CGCTGGACTA CGTCCTCGAC
CACGACCTGC AGGCGAACGC CGCCCGGACC GGTGCCCTGA TCATCGACGG GCTCCGCGAG
GCCGCGCCGC GCCTGCCGAT CGTCGGCGAC GTCCGCGGGA AGGGCCTGAT GTTCGCCGTC
GAGCTCGTCG ACCCGGCGAC CGGCGAGCCC TCCCCGGCAC TCGCGGCCCG GTTCATGGAG
GAGACCAAGA AGGCCGGTCT GCTCGCGGGC AAGGGCGGCC TGTACGGCAA CGTGCTCCGC
ATGGCCCCGC CGCTCACCCT GACGCCGGAC GAGGCCGCCG AGGGCCTCGG GATCATCGTC
AACGCACTCG AAGTCATCAA CGCCGAGGTG GTCCCGTCGT GA
 
Protein sequence
MSDLLARHRA VMPNWLALYY NEPIEIVGGK GNRVVDASGK SYLDFFAGIL TNMIGYDVPE 
VREAVERQLA TGVVHTSTVY LLRGQVELAE KIAKLSGIKD AKVFFTNSGT EANETALLLA
TYARKSDQVL AMRQSYHGRS FGAVSVTSNR AWKNNSLSPL NVHFLHGADR HLTQFRGLSD
ADYIKACVDD LRHVLSTSVS DDVAALIAEP IQGVGGFTMA PDGLFAAYKE VLDEQGILFV
SDEVQTGWGR TGSAFFGIQN HGVTPDMMTF AKGLGNGFAV GGVVARGDLM DNLHAVGLAT
FGGNPISMAA ANATLDYVLD HDLQANAART GALIIDGLRE AAPRLPIVGD VRGKGLMFAV
ELVDPATGEP SPALAARFME ETKKAGLLAG KGGLYGNVLR MAPPLTLTPD EAAEGLGIIV
NALEVINAEV VPS