Gene Smed_5583 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5583 
Symbol 
ID5319885 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp550099 
End bp551607 
Gene Length1509 bp 
Protein Length502 aa 
Translation table11 
GC content58% 
IMG OID640777330 
ProductAlpha-N-arabinofuranosidase 
Protein accessionYP_001314262 
Protein GI150377667 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3534] Alpha-L-arabinofuranosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.759858 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.105411 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGAGGCTC GGGTTACCGC GCATTCGCGC TATACAATCG CGGACATAGA TAAGCGGCTA 
TACGGATCGT TCCTGGAGCA CTTGGGTAGG GCTGTCTATA CAGGCATTTA CGAACCTGGC
CATCCGACGG CGCTCCCCAA CGGGATGCGC AAGGACGTGA TCGATCTCGT GCGGGAACTC
GATACGCCGA TTTGCCGCTA TCCAGGTGGC AATTTCGTGT CGGCCTACAA TTGGGAAGAC
GGCATTGGGC CGAAAGAGGG CCGTCCAACG CGCCTTGATC TCGCATGGCG CACCTCGGAA
TCTAACCAGG TCGGTATCCA TGAGTTCGCC GAATGGGCTG AGGCTGCTGG CACCGAAATG
ATGCTGGCGG TCAATCTGGG CTCGCGCGGC CTGGACGCCG CACGGAACTT CGTCGAGTAC
GTGAACCATC CCGGCGGCAG TCAGTGGTCG GACCTGCGAC GCCAGAACGG GCGCGCGGAG
CCTTGGAATG TCAAGCTGTG GTGCCTTGGC AACGAAATGG ACGGGCCGTG GCAAGTGGGT
CATAAAAGCG CCGCCGAGTA CGGCCACCTT GCAAACGAGA CCGCAAAGGC CATCCGCGCC
TTCGACGACA AGCTTGAACT TGTCGTTTGC GGATCATCCC ACTCCGATAT GGCGACCTAT
CCTCGGTGGG AGGCAACGGT CCTCGACGCG ACATACGACC AGGTCGACTA CATCTCCCTC
CACATGTATT TCGAGAACTA CGAGAAGAGA ACTTCAGAAT TTCTCGCTCT TCCGGAAAAG
CTCGATCGCT ACATCGGCAC GGTCAGTGGT GTTATCGATT TTGTGAAGGG CAGCAAACGG
TCCAATCGAG ACGTCAAGAT TTCCTTCGAC GAATGGAACG TCTGGTATCA CGAGCGCAAG
AACGACGCCA AGCGGATGCG AGAATGGAAT TGGCCTCATG CGCCGGTTCT CCTCGAAGAC
ATATATAACT TCGAGGATGT GCTCCAGGTC GGGTGCATCA TCAACACCTT CATCCGCCGC
TCGGACGTCG TGCGAGTCGC CTGTATAGCG CAGCTGGTCA ACGTGATCGC GCCGATCATG
ACCGAGCCAG GCGGTAGTTC CTGGCGTCAG ACGATCTACC ATCCCCTCCA TCTGGCATCT
CGCTACGGTC GAGGCACCGC GCTGCAACTC GACGTCGACT GTGCGACTTA TGTCTCCAAC
GTTTCCGAAG CTGTGTCCTA TCTCGATATC GCCGGCGTTC ATGACGCCGA CGCGGGCACT
CTCACCTTCT TTGCGGTCAA TCGCCACTCG TCCGAAGCGG CAAGCATCAA GCTCGCCGTG
GAGCGTTTCG GAACCATCAG GGGCGTCGAG CACACCCTGA TCAAGCACGA CGATCTCGAG
GCCCGGAATA CCAAGGACAA TCCGGACAAC GTCTCACCGC GCAAGACGTC CGATGCGGTC
ATGGACGGAA ACAGCGTCAG TGTTTCCGTT CCGCCATACT CGTATTCGAT GATCAGAATT
CGACTTTAA
 
Protein sequence
MEARVTAHSR YTIADIDKRL YGSFLEHLGR AVYTGIYEPG HPTALPNGMR KDVIDLVREL 
DTPICRYPGG NFVSAYNWED GIGPKEGRPT RLDLAWRTSE SNQVGIHEFA EWAEAAGTEM
MLAVNLGSRG LDAARNFVEY VNHPGGSQWS DLRRQNGRAE PWNVKLWCLG NEMDGPWQVG
HKSAAEYGHL ANETAKAIRA FDDKLELVVC GSSHSDMATY PRWEATVLDA TYDQVDYISL
HMYFENYEKR TSEFLALPEK LDRYIGTVSG VIDFVKGSKR SNRDVKISFD EWNVWYHERK
NDAKRMREWN WPHAPVLLED IYNFEDVLQV GCIINTFIRR SDVVRVACIA QLVNVIAPIM
TEPGGSSWRQ TIYHPLHLAS RYGRGTALQL DVDCATYVSN VSEAVSYLDI AGVHDADAGT
LTFFAVNRHS SEAASIKLAV ERFGTIRGVE HTLIKHDDLE ARNTKDNPDN VSPRKTSDAV
MDGNSVSVSV PPYSYSMIRI RL