Gene Smed_1710 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_1710 
Symbol 
ID5322568 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp1787907 
End bp1789532 
Gene Length1626 bp 
Protein Length541 aa 
Translation table11 
GC content62% 
IMG OID640790648 
ProductAlpha-N-arabinofuranosidase 
Protein accessionYP_001327380 
Protein GI150396913 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3507] Beta-xylosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.783601 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGCGA CAATCCGCAA TCCGGTTCTG CCGGGCTTCA ATCCAGACCC ATCCATCTGC 
CGGGTCGGTG ACGATTATTA CATCGCGACC TCGACATTCG AATGGTATCC AGGCGTACAG
ATCCACCATT CGCGCGACCT CGTGAACTGG CGGCTGGTGC GTCGTCCGCT CGAGCGGGCA
AGCCAGCTCG ACATGCGCGG CAATCCCGAT AGCTGCGGCG TGTGGGCGCC GTGCCTCTCC
TATTGCGACG GGCTGTTCTG GCTGGTCTAT ACCGACGTCA AGCGCCTGGA CGGCAACTTC
AAGGATGCGC ACAACTATAT CGTGACGGCG GAGGCGGTGG AGGCCACCTG GTCCGACCCG
GTCTATGTGA ACTCGTCGGG TTTCGATCCG TCGCTCTTCC ACGATGATGA CGGCCGCAAG
TGGTTTCTCA ACATGCAGTG GAATCACCGC ACCGAAAGCT TCGGCGGCTC GCCCAAGAGC
CCCGCATTCG ACGGTATTCT GCTGCAGGAA TGGGACGAGC GGACCCGCAA GCTCGTGGGA
CCGGTGAAAA ACATCTTCGC CGGCAGTCCG CTCGGCCTTG TCGAAGGTCC GCATCTCTTC
AAACGCGACG GCTGGTATTA TCTCACGGTC GCGGAGGGAG GAACCGGTTA CGACCACGCC
GTTGGAATGG CACGATCCCG TACGATCGAT GGCCCCTATG AGATGCACCC GAACGTTCAT
CTCATCACTT CCAAGGATCA CCCGGAGGTG GCTCTGCAGC GTGCGGGGCA CGGCCAATAT
GTCGAGACGC CGGACGGGCA GGCCTATCAT ACGCATCTAT GCGGCCGTCC GCTGCCGCCG
CTGCGCCGCT GCACGCTGGG GCGCGAGACA GCGCTGCAGA AATGCGTCTG GCGGGAGGAT
GGCTGGCTCT ATCTGGAAAA TGGCGGTCCG GTGCCGGAGC TAACCGTGCC GGCGCCCGGA
GCCACTGGAA CCGTCGCTGA GCGCGTCGTG AACGAAACTG ACTTCGACGA GCTGGCGCTT
CCACCGGAGT TTCAGTGGCT GCGTACCCCT GTTCCCTCGC GCATTTTCTC GTTGACCGCC
CGGACCGGTC ATCTCCGGCT CTTCGGCCGC GAGAGCATCG GCAGCTGGTT CGAGCAGGCG
CTCGTCGCCC GGCGGCAGGA GCATCATTCT TTCCGCGCCG AAACAGTCAT CGACTTTGCG
CCCGATACAT ACCAGCAGGT GGCGGGGCTC ACGCATTACT ATAACCGGCA AAAGTTTCAT
GCTCTCGGCA TCACACATCA CGAGACGCTT GGGCGCGTTA TCACCATACT CTCCTGTCCG
GGGGATTTCC CGAACGGACG GTTGGCGTAC CCGATCGGGA GCGGTGTCGC CTTAGCGGAT
GGCCCCGTTC AGCTTGCCAT GGAGGTGAGG GACAACGATC TCCGGTTCTT CTGGCGGGGA
ACGGCTCAGG CCGAGTGGGC GGCCATCGGT CCTGTGCTCG ACGCCGGCGT CATTTCGGAC
GAAGGCGGCC GCGGCGAGCA TGGTTCCTTT ACCGGTGCGT TTACGGGAAT GTTCGCTTTC
GACATCTCGG GCAGAGCGAT CCCTGCGGAC TTCGACCGCT TCCGGTATCA GGCGCTCACC
GTATGA
 
Protein sequence
MTATIRNPVL PGFNPDPSIC RVGDDYYIAT STFEWYPGVQ IHHSRDLVNW RLVRRPLERA 
SQLDMRGNPD SCGVWAPCLS YCDGLFWLVY TDVKRLDGNF KDAHNYIVTA EAVEATWSDP
VYVNSSGFDP SLFHDDDGRK WFLNMQWNHR TESFGGSPKS PAFDGILLQE WDERTRKLVG
PVKNIFAGSP LGLVEGPHLF KRDGWYYLTV AEGGTGYDHA VGMARSRTID GPYEMHPNVH
LITSKDHPEV ALQRAGHGQY VETPDGQAYH THLCGRPLPP LRRCTLGRET ALQKCVWRED
GWLYLENGGP VPELTVPAPG ATGTVAERVV NETDFDELAL PPEFQWLRTP VPSRIFSLTA
RTGHLRLFGR ESIGSWFEQA LVARRQEHHS FRAETVIDFA PDTYQQVAGL THYYNRQKFH
ALGITHHETL GRVITILSCP GDFPNGRLAY PIGSGVALAD GPVQLAMEVR DNDLRFFWRG
TAQAEWAAIG PVLDAGVISD EGGRGEHGSF TGAFTGMFAF DISGRAIPAD FDRFRYQALT
V