Gene Sros_2801 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_2801 
Symbol 
ID8666087 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp3040836 
End bp3042266 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content68% 
IMG OID 
Productamino acid permease 
Protein accessionYP_003338502 
Protein GI271964306 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.764757 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGGGATT CCTTTGGCGA TCGTCCACGC GGACGCACCG CCGCCGAAGA AGGAGTGATC 
CGCGACGACG CGGCCGAGGC CATAGCGTCC GAACACGGCA AACTCGGCCT CCCCGCGGCG
ACGTCACTCG TCGTCGGCAA CATCGTGGGC ACGGGGGTGT TCCTGCTGCC CGCCTCCCTC
GCCGCCTACG GCACGGTCAG CATCCTGGCG ATGGCCCTGG TGTCGATCGG CGCCATCGCG
CTCGCCGTCG TGTTCGGCAG GCTCGGCGCG CGGGTGCCCG CGGGCGGCGG GCCGTACGCC
TACGCCAAGG ACGCCTTCGG CGAGTTCCCC GGCTTCTGGA ACGCCTGGTC GTTCTGGCTG
ACCGCCTGGA TCGGCAACGC CGCGATCGCC GTCGTCTGGG TCAACTACGT CAACTACTTC
CTGCACTGGG ACTCCGCCGT CGCCCAGACC GCCCTGGCCT TCGTCGCCCT GTGGATCCCC
GCGCTGATCA ACCTGAGCGG CGTGCGGAAC ATCGGCGCCT TCACCCTCGT CACGACGGTG
CTGAAGTTCA TCCCGCTGAT CTTCGTCGCG GTGGTCGGCC TGTTCTTCGT CCGGAGCGCC
AACTTCGGCC CGTTCAACGC CACCGACGGC AACTGGATCG GCGCCGTGTC CACCGCCGGC
GCGCTCGCGC TGTTCATCTA CTCCGGCGTC GAGAGCGTCA CCATCGTGGC GGAGAAGATC
AAGAATCCGG CGCGCAACAT CGGCAGGGCC AGCGTGTACG GCGTGCTGAT CTGCGCCGCC
ATGTACATGC TCAGCACCGT CGCCATCTTC GGCACCGTCC CGCACGACGC CCTCGTCAAC
TCCCCCGCCC CGTTCGCCGA CGCGATCAAC AACATGTTCG GCGGCGGCAT CGGCGGCGGC
ATCATGGCGG CCTGCGCGGT CGTCTCCGGA ATCGGCGCCA TCAACGGCTG GACCATGCTC
GTGGCCGAGA TGCCGATGGC CGCGGCCAGG GACGGCCTGT TCCCGGAGAT CTTCACCAGG
GAGAACCGCC GCGGCGCCCC GTGGGTGGGC ATCGTCCTGG GCACCGCGCT GACCTCGCTG
GTCGCGGTCT ACAACTACTT CGGCACCACC GAGGGCTTCA ACAAGATCTT GCTGATCGCC
ACCTTCACCA CGGTCATCCC CTACTTCTTC TCCGCGTGCG CCCAGCTGTT CTGGCTGGTC
ACCGGGGCCA GAAAGGTCCG CGGAGCCCGC CTGGGCCGCG ACCTGACCAT CACCGCCGTG
GCCATCCTGT TCGCCTTCTG GATGACCTAC GGCGCCGGCA TGGAGGCAGT CTTCATCGGC
TTCCTGATGA TGCTCGTGGG CATCCCGGTC TACATCTGGA CCAAGGCGAA GCGTGGCGAG
TACGGCACCA GGGAGGGAGC ACCGGCCTCA CCCCCCGGAC GATCTCGTTA A
 
Protein sequence
MGDSFGDRPR GRTAAEEGVI RDDAAEAIAS EHGKLGLPAA TSLVVGNIVG TGVFLLPASL 
AAYGTVSILA MALVSIGAIA LAVVFGRLGA RVPAGGGPYA YAKDAFGEFP GFWNAWSFWL
TAWIGNAAIA VVWVNYVNYF LHWDSAVAQT ALAFVALWIP ALINLSGVRN IGAFTLVTTV
LKFIPLIFVA VVGLFFVRSA NFGPFNATDG NWIGAVSTAG ALALFIYSGV ESVTIVAEKI
KNPARNIGRA SVYGVLICAA MYMLSTVAIF GTVPHDALVN SPAPFADAIN NMFGGGIGGG
IMAACAVVSG IGAINGWTML VAEMPMAAAR DGLFPEIFTR ENRRGAPWVG IVLGTALTSL
VAVYNYFGTT EGFNKILLIA TFTTVIPYFF SACAQLFWLV TGARKVRGAR LGRDLTITAV
AILFAFWMTY GAGMEAVFIG FLMMLVGIPV YIWTKAKRGE YGTREGAPAS PPGRSR