Gene Sros_6217 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_6217 
Symbol 
ID8669522 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp6825633 
End bp6827114 
Gene Length1482 bp 
Protein Length493 aa 
Translation table11 
GC content71% 
IMG OID 
ProductL-arabinose isomerase 
Protein accessionYP_003341689 
Protein GI271967493 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.620258 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAAGGTCT GGTTTCTCAC CGGCAGTCAG GGGCTTTACG GCGAGGACAC GCTGCGCCAG 
GTGGCCGAGC AGTCCCAGCG GATCGCCGCG GCCCTGGACG AGGCGCTCCC CTTCGAGGTC
GAGTGGGAGC CGGTGCTCAC CGACGCCGCG GCGATCCGCA GGATGTGCCT GGAGGCGAAC
TCCTCCGACG AGTGCGTCGG GCTGATCGCG TGGATGCACA CCTTCTCCCC GGCCAAGATG
TGGATCGCCG GCCTGGACGC CCTGCGCAAG CCGCTGCTGC ACCTGCACAC CCAGGCCGAC
CTGGAACTGC CGTGGAGCTC CATCGACATG GACTTCATGA ACCTGAACCA GGCGGCGCAC
GGTGACCGCG AGTTCGGCTA CATCCAGGCC AGGCTCGGAG TGCCCCGCAA GACCGTGGCC
GGCCACGTGA GCGACCCCTC CGTGGCGGCG CGGATCGAGG CGTGGGCCAG GGCGGCGGCA
GGCCGGGCCG AGGTGGGCTC GCTCAGGCTG GCCAGGTTCG GCGACAACAT GCGCGACGTG
GCGGTGACCG AGGGCGACAA GGTCGAGGCC CAGCTCCGGT TCGGCGTGTC GGTCAACACC
TACGGCGTCA ACGACCTGGT GGCCGCGGTC GACGCCTCCT CCGACACCGA GGTCGCCACG
CTGGTCAAGG AGTACGAGGA CCTGTTCCAG GTCGCCCCCG AGCTGCGCGC CGGCGGCGAG
CGGCACGACT CCCTGCGCTA CGCGGCCCGG ATCGAGCTGG GCCTGCGCCA CTTCCTGGAG
GCGGGCGGGT TCAAGGCGTT CACCACCAAC TTCGAGGACC TCGGCGGGCT GCGGCAGCTA
CCGGGCCTGG CCGTGCAGCG CCTGATGGCC GACGGCTACG GCTTCGGTGG CGAGGGCGAC
TGGAAGACCT CGGTGCTGCT GCGCACGCTG AAGGTGATGT CGGCCGGACT GCCGGGCGGA
ACCTCGTTCA TGGAGGACTA CACCTACCAC CTGACGCCGG GACAGCAGCT CATCCTCGGC
GCGCACATGC TGGAGGTCTG CCCGACGATC GCCTCGGGCG TCCCGTCGTG CGAGATCCAC
CCGCTCGGCA TCGGCGGCCG GGAGGATCCG GTCCGGCTGG TGTTCGACGC CGAGCCCGGC
CCCGCCGTCG TCGTCGGCCT GGCCGACATG GGGGAGCGGT TCCGGCTGGT CGCCAACGAG
GTCGACGTGG TCGCCCCGGT CGAGCCGCTG CCGAACCTGC CCGTGGCCAG GGCGGTCTGG
AGGCCCCGGC CCGACCTGCG CACCTCGGCC GAGGCGTGGC TCACCGCCGG CGCCCCGCAC
CACACCGTCC TGTCGGCCGC GGTCGGCGCC GAGGAACTCA CCGACTTCGC CGACATGCTC
GGTGTCGAAC TGCTCGTCAT CGACGCCGAC ACCACGCCAC GCGGGTTCGC CAAGGAACTG
CGCTGGAACC AGGCCTACTA CCGCCTCGCC CAGGGATTCT GA
 
Protein sequence
MKVWFLTGSQ GLYGEDTLRQ VAEQSQRIAA ALDEALPFEV EWEPVLTDAA AIRRMCLEAN 
SSDECVGLIA WMHTFSPAKM WIAGLDALRK PLLHLHTQAD LELPWSSIDM DFMNLNQAAH
GDREFGYIQA RLGVPRKTVA GHVSDPSVAA RIEAWARAAA GRAEVGSLRL ARFGDNMRDV
AVTEGDKVEA QLRFGVSVNT YGVNDLVAAV DASSDTEVAT LVKEYEDLFQ VAPELRAGGE
RHDSLRYAAR IELGLRHFLE AGGFKAFTTN FEDLGGLRQL PGLAVQRLMA DGYGFGGEGD
WKTSVLLRTL KVMSAGLPGG TSFMEDYTYH LTPGQQLILG AHMLEVCPTI ASGVPSCEIH
PLGIGGREDP VRLVFDAEPG PAVVVGLADM GERFRLVANE VDVVAPVEPL PNLPVARAVW
RPRPDLRTSA EAWLTAGAPH HTVLSAAVGA EELTDFADML GVELLVIDAD TTPRGFAKEL
RWNQAYYRLA QGF