Gene Sros_0430 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_0430 
Symbol 
ID8663698 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp432297 
End bp433724 
Gene Length1428 bp 
Protein Length475 aa 
Translation table11 
GC content72% 
IMG OID 
Productputative L-arabinose isomerase protein 
Protein accessionYP_003336201 
Protein GI271962005 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTGCCG AGCTGTCCAG GATCACCGCT CCCCGTACCA AGGTGGGGCT CGTCGCCGGA 
GGCCTCGGAG CCTACTGGCC GCAGTTCCCC GACCTGCTGC CGCAGCTGCG GCATTCCTCC
GAGCGCGTCT CGGAGCGGAT GCGGGCCCTC GGCTGCGAGG TGGTGGACGT CGGTTTCATC
TCCGACGCCC AGGAGGGCGC CGTCGCGGCC GAGAAGCTCC GCGCGGCCGG CTGCGACATC
ATCGTCGGCT TCCTCACCAC CTACATGACC GCCACCATGC TGCTCCCGGT CGCGCAGCGC
AGCGGTGCGC CGGTGCTGCT GATCAACCTG CAGCCGACCG AGTCGATGGA CCACGCCACC
TTCGACACCG GCCAGTGGCT GGCCTACTGC GGGGCCTGCC CGCTGCCGGA GATGGCCAAC
ACCTTCATCC GCTGCGGCAT CCCGTTCCGC TCGGTCTCCG GTTACCTGGA GGACGAGCGG
GCCTGGGCGA AGATCGGTCG GTGGGTCAAG GCCGCCGGGG TGCGGGCCGC GCTCCGGCGC
GGGCGGCACG GCCTGATGGG CCACCTCTAC CCGGGCATGC TCGACGTGGC CACCGACCTC
ACCCTGGTCC CGGCGAACCT CGGCGGCCAT GTCGAGGTGC TGGAGTTCGA CGACCTGCGG
GTGCGGGTGG AGAAGGTGAC CGACGCCGAG GTCACGGAGC GCATGGACCT GGCCCGGGAC
GTCTTCGAGC TGGCCGACAC GGTCAACGGC GACGACTTCT CCTGGGCCGC CCGGGTATCG
GTCGGCCTGG ACCGGCTCGT CGAGGACTTC GCCCTCGACA GCCTGGCCTA CTACCACCGG
GGCCTGGACG GCGAGATCCA CGAGCGGCTC GGCGCCGGGA TGATCCTCGG CGCGTCGCTG
CTCACCGCCC GCGGCGTCCC GTCGGCCGGG GAGTACGAGC TGCGCACCTC GCTGGCCATG
CTCATCATGG ACCGCCTCGG CGGGGGCGGC TCGTTCACCG AGCTCCAGGC GCTCGACTTC
GCCCGCGGGC ACGTCGAGAT GGGCCACGAC GGGCCCGCCC ACCTGGCCAT CAGCTCGAAA
CGGCCGTTGC TGCGCGGGCT GGGCGTCTAC CACGGCAAGC GCGGCTGGGG CGTGTCGGTG
GAGTTCGACG TCACGCACGG CCCGGTCACC GCGTTCGGCC TGCTGCACCG GCCGGACGGC
CGGTTCGGCT TCGTCGTCTC CGAGGGCGAG GTCGTCGACG GGCCGCTGCT GCGGATCGGC
AACACCACCT CCCGGGTCGA CTTCGGCTGC GACCCCGGGG AGTGGACCGA CGCCTGGAGC
GCCACCGGCA TCTCCCACCA CTGGGCGCTG GGCACCGGCC ACCGCGCCGC CGAGCTGCGC
GCCGTGGCCG ACCTCCTCGG CGCCGACCTC ATCGAGGTGA AGCCGTGA
 
Protein sequence
MRAELSRITA PRTKVGLVAG GLGAYWPQFP DLLPQLRHSS ERVSERMRAL GCEVVDVGFI 
SDAQEGAVAA EKLRAAGCDI IVGFLTTYMT ATMLLPVAQR SGAPVLLINL QPTESMDHAT
FDTGQWLAYC GACPLPEMAN TFIRCGIPFR SVSGYLEDER AWAKIGRWVK AAGVRAALRR
GRHGLMGHLY PGMLDVATDL TLVPANLGGH VEVLEFDDLR VRVEKVTDAE VTERMDLARD
VFELADTVNG DDFSWAARVS VGLDRLVEDF ALDSLAYYHR GLDGEIHERL GAGMILGASL
LTARGVPSAG EYELRTSLAM LIMDRLGGGG SFTELQALDF ARGHVEMGHD GPAHLAISSK
RPLLRGLGVY HGKRGWGVSV EFDVTHGPVT AFGLLHRPDG RFGFVVSEGE VVDGPLLRIG
NTTSRVDFGC DPGEWTDAWS ATGISHHWAL GTGHRAAELR AVADLLGADL IEVKP