Gene Sros_3761 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_3761 
Symbol 
ID8667051 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp4173990 
End bp4176101 
Gene Length2112 bp 
Protein Length703 aa 
Translation table11 
GC content72% 
IMG OID 
Productpara-aminobenzoate synthase 
Protein accessionYP_003339426 
Protein GI271965230 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0331515 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.466461 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGCACGC TGATCATCGA CAACTACGAC TCCTACACCT ACAACCTGTT CCAGCTCATC 
GCGGAGACCT ACGGGGTCGA GCCGGTCGTC CTGGCCAACG ACGACGACAG CTGGCACAGG
ATGGACCTGG ACCGGTTCGA CGCGCTGATC ATCTCCCCCG GCCCCGGGCG TCCGCAGGTG
GAGCGGGACA TCGGGGCGTC CCTCGACGTG ATCCGCGAGT CCGGGCTGCC GGTGCTGGGG
GTGTGCCTGG GCCATCAGGC ACTCGGCTGG CTGTCGGGCG CCGACGTGGT GCCGGCCCCC
GCCCCGATCC ACGGGTACGT CGAGGAGATC CGGCACTGCG GGCGCGACCT GTTCCACGGC
CTGCCGCAGG GCTTCAACGC CGTGCGCTAC CACTCCCTGT GCGTGGCCAC CCCCGCCCCG
GACGACCTGG AGATCACGGC GTGGGCCGGC GACTCCGTCG TCATGGGGCT GCGGCACCGG
ACGATGCCGT GGTGGGGCGT GCAGTTCCAC CCCGAGTCGA TCTCGACGGA GTACGGCCCG
GCGCTGCTGC GCAACTTCCG CGAGCTGGCC CTGTCGGCGG GACGGAAGGA CGGCGGCGGG
GCCTGCCCGA CCCGGAGTCC CCGATGGTCG CTGGAGGTGG AGCGTGTCCC CGGCGCCGCC
GACGCCGAGA CGGTGTTCGC CGAGCTGTTC GGCGCGCAGC CGTACGCGTT CTGGCTGGAC
AGCAGCCGCA CCGGCCAGGG GGCAGCGCGG TTCTCCTTCC TCGGTGACGC GGGCGGCCCC
CACGGGGAGG TGCTGTCCTG CGGCGCCGGC TCGGGAAGCG TGCAGGTCCA CGACGCCTCG
GGCATCGGCA CGGCGATCTA CAACTCGATC TTCGACGTGC TGGACGACCG CGTGCGCTCG
CGCGCCGTGG CGGCCGACCC CGCGCTGCCG TTCGATCTGA ACAGCGGCTA CGTGGGGCAC
TTCGGCTACG AGCTGAAGCG CGACTGCGGC GCGGCCTCGC CCCATGTCTC CCCGCTCCCC
GACGCGGTCT GGATGTCGGC CACGCGCTTC GTCGCGATCG ACCACGAGAC GGAGGAGACG
TGGGCGGTGG CGCTCACCGA CGGCCGGCCC CGCAGCAGGA CCCTGGCCAG GCGCTGGGTG
TCCGGGGCGG CCGAGGCCGC CGAGGCGGGC CGGCGGCCGG TCGATGACCC CGTGGTGGCG
GACGGGCCGC CGGACGGGGA GGAGTTCGAT CCGGAGCCGT GGCTGGTGCG CTCGCGCGAG
GGCTACCTCG CCGACATCGA GGAGTGCCTG CGGCTGCTGC GGGCCGGGGA GAGCTACGAG
ATCTGCCTGA CCAACGTGGC GGAGATCCCG TTCGACGGCT CGGCGCACGC GCTCTACCTG
CGCCAGCGCC GTCTCAACCC CGCCCCGTAC GCGGCCTACC TGCGGCTGGG CGACCACGAG
GTACTGTGCT CGTCCCCCGA ACGTTTCCTC AAGGTCGGCA GGGACGGGAT CGCGGAGTCC
AAACCGATCA AGGGCACCGC CCCGCGCCAT CCCGAGCCGC GCAGCGACGA GGCGCTCAGG
GACGAGCTGG CGTCGTCGGA CAAGACCCGC GCGGAGAACC TGATGATAGT CGACCTGCTC
CGCAACGATC TCGGCAGGAT CTGCGACATC GGCACGGTCG ACGTGCCGCG GTTCATGGCC
GTGGAGTCCT ACACCACGGT CCACCAGCTG GTCTCCACGG TGCGCGGGCG GCTGCACGAG
GACATGAGCG CCGTGCTGGC CGCGCGCGCC TGCTTCCCCG GGGGGTCGAT GACCGGCGCG
CCCAAGCTGC GCACGATGGA GATCATCGAC AGGCTGGAGG GCAGGGCGCG GGGCGTCTAC
TCCGGGACGA TCGGCTTCTT CGGGCTCAAC GGCACCGCGG ACCTCAACAT CGTCATACGG
ACCGCGATCG CCCACTCCGG CAGGCTCACG GTCGGCGCGG GCGGCGCGAT CGTGCTGGAC
TCCGATCCCG TCGAGGAGTA CGAGGAGATG CTGCTGAAGG CCAGAGCTCC GCTGCGGGGC
CTGAACGGCC TCGCAGGGGC GCCCGTCAGC CGTGGAACGC CTGCCGCGGT TCGGGCAGCA
GACCCCGGTT GA
 
Protein sequence
MRTLIIDNYD SYTYNLFQLI AETYGVEPVV LANDDDSWHR MDLDRFDALI ISPGPGRPQV 
ERDIGASLDV IRESGLPVLG VCLGHQALGW LSGADVVPAP APIHGYVEEI RHCGRDLFHG
LPQGFNAVRY HSLCVATPAP DDLEITAWAG DSVVMGLRHR TMPWWGVQFH PESISTEYGP
ALLRNFRELA LSAGRKDGGG ACPTRSPRWS LEVERVPGAA DAETVFAELF GAQPYAFWLD
SSRTGQGAAR FSFLGDAGGP HGEVLSCGAG SGSVQVHDAS GIGTAIYNSI FDVLDDRVRS
RAVAADPALP FDLNSGYVGH FGYELKRDCG AASPHVSPLP DAVWMSATRF VAIDHETEET
WAVALTDGRP RSRTLARRWV SGAAEAAEAG RRPVDDPVVA DGPPDGEEFD PEPWLVRSRE
GYLADIEECL RLLRAGESYE ICLTNVAEIP FDGSAHALYL RQRRLNPAPY AAYLRLGDHE
VLCSSPERFL KVGRDGIAES KPIKGTAPRH PEPRSDEALR DELASSDKTR AENLMIVDLL
RNDLGRICDI GTVDVPRFMA VESYTTVHQL VSTVRGRLHE DMSAVLAARA CFPGGSMTGA
PKLRTMEIID RLEGRARGVY SGTIGFFGLN GTADLNIVIR TAIAHSGRLT VGAGGAIVLD
SDPVEEYEEM LLKARAPLRG LNGLAGAPVS RGTPAAVRAA DPG