Gene Sros_2485 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_2485 
Symbol 
ID8665771 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp2709961 
End bp2711307 
Gene Length1347 bp 
Protein Length448 aa 
Translation table11 
GC content71% 
IMG OID 
Productalpha-N-arabinofuranosidase 
Protein accessionYP_003338204 
Protein GI271964008 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.0107043 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGCGACCA CCGCCCGTCT CACGCTGGAC CCCGCCTTCC GGATCGGCCC GGTGGACCCC 
CGGCTCTTCG GCTCGTTCGT CGAGCACATG GGCCGCTGCG TCTACACCGG CGTCTTCGAG
CCCGGCCATC CCCTGGCCGA CGCCGACGGT TTCCGCACCG ACGTACTGGA GCTGACCCGC
GAGCTCGGGG TGACGCTGGT CCGTTACCCC GGAGGCAACT TCGTCTCCAA CTACCGCTGG
GAGGACGGCG TCGGCCCGGT GGAGGACCGG CCGGCCCGGC TGGAACTGGC CTGGCGGAGC
CTGGAGGGCA ACAGCTTCGG GCTCAACGAG TTCATGGCCT GGGCCGCCAA GGCCGGGGTG
GAGCCGATGA TGGCGCTCAA CCTGGGCACC CGCGGCGTGG CCGAGGCGCT GGAGCTGGTG
GAGTACGCCA ACTATCCCGG GGGCACGCGC CTGTCCGAGC TGCGCCGCGC GCACGGCGCC
GACAAGCCGC ACGACGTGCG GCTGTGGTGC CTGGGCAACG AGCTGGACGG CCCCTGGCAG
ATGGGCCACA AGACCGCCGG GGAGTACGGC CGGCTCGCCG CCGAGACGGC GCGGGCGCTC
AAACGCTTCG ACCAGGGGCT GTCCCTGGTG GCCTGCGGCA GTTCCAACAG CGGCATGCCG
ACGTTCGGCG CGTGGGAGGC GGAGGTCCTG GAGGCGACCT ACGAGATGGT CGACTACGTC
TCGCTGCACG CCTACTACGA TCCGTCCGAC GGTGACGTCG ACTCCTTCCT GGCCAGCGGC
GCCGACATGG AGCACATGAT CCGTTCGATC GCCGCCACCG CCGACCACGT GGGCGCGAAG
CTGCGCAGCG ACAAGAAGAT CAAGCTCTCC TTCGACGAGT GGAACGTCTG GTACCAGAGC
CGTTTCAACG GAGAGTCCTC GCTGGAGTGG ACCGAGCACC CCCGGCTGAT CGAGGACTCC
TACGACGTCA CCGACGCGGT GGTGGTCGGC AGCCTGCTCA TCACCCTGCT GCGCAACGCC
GACCGGGTCG GCGTCGCCTG CCAGGCGCAG CTGGCCAACG TGATCGCCCC GATCAGGACG
GAGCCCGGCG GCCCCGCCTG GCGGCAGACC ATCTTCCATC CGTTCGCGCT GACCGCCAGG
CACGCCCGCG GCGAGGTGCT CCGGGTGGAG CCCGAGTGCG CCACGATCCC CACCGCCAAG
TACGGCGAGG CCCCCGCGAT CTGGGCGACC GCCACCCACG ACGCGGCGAC CGGCGCGGTG
ATCGCCAGTC TGCCGGTGAT CGTGCTCTAC CTCGTCGCCC AGCGCTGGGT GATCGAGGGA
ATCTCCCGCT CGGGGCTCAA GGGATGA
 
Protein sequence
MATTARLTLD PAFRIGPVDP RLFGSFVEHM GRCVYTGVFE PGHPLADADG FRTDVLELTR 
ELGVTLVRYP GGNFVSNYRW EDGVGPVEDR PARLELAWRS LEGNSFGLNE FMAWAAKAGV
EPMMALNLGT RGVAEALELV EYANYPGGTR LSELRRAHGA DKPHDVRLWC LGNELDGPWQ
MGHKTAGEYG RLAAETARAL KRFDQGLSLV ACGSSNSGMP TFGAWEAEVL EATYEMVDYV
SLHAYYDPSD GDVDSFLASG ADMEHMIRSI AATADHVGAK LRSDKKIKLS FDEWNVWYQS
RFNGESSLEW TEHPRLIEDS YDVTDAVVVG SLLITLLRNA DRVGVACQAQ LANVIAPIRT
EPGGPAWRQT IFHPFALTAR HARGEVLRVE PECATIPTAK YGEAPAIWAT ATHDAATGAV
IASLPVIVLY LVAQRWVIEG ISRSGLKG