Gene Rleg_6803 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_6803 
Symbol 
ID8022733 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012858 
Strand
Start bp243378 
End bp244886 
Gene Length1509 bp 
Protein Length502 aa 
Translation table11 
GC content60% 
IMG OID644833669 
ProductAlpha-N-arabinofuranosidase 
Protein accessionYP_002984803 
Protein GI241666719 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3534] Alpha-L-arabinofuranosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.905258 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAAGACGA ACGTTGTGGT TCACCGCGAT TTCCGCATCG CGACCATTGA TTCCAGGCTC 
TACAGTTCAT TTCTAGAGCA TCTCGGCAGG GCGATCTACG GGGGCATTTA TGAACCCGGT
CACCCGACGG CCGATGAGGA CGGATTCCGC CAGGATGTCC TTGATCTCGT CCGTGATCTC
GACACGCCCT ATTGCCGCTA TCCCGGCGGC AACTTCGTCT CGGCCTATAA TTGGGAAGAC
GGCGTCGGCC CGCGTGCCGA GCGCCCGGTG CGCCTCGACC TTGCCTGGCG CACCCGCGAA
GCCAACCAGA TCGGCGTCAA TGAATTCGTC GACTGGTGCA AGAAGGCAAA TACCAAGCCG
ATGCTCGCCG TCAATCTCGG ATCACGCGGC CTGGATGCGG CTCGCAATTT CCTCGAATAT
TGCAATCATC CCGGCGGCAC CTACTGGTCG GATCTGCGCC GCAAGCACGG CTGGTCTAAT
CCGCACGATG TCAAATTGTG GTGCCTCGGC AACGAGATGG ATGGCCCCTG GCAGGTCGGC
CACAAGTCGG CCTATGAATA TGGCCGGCTG GCGGATGAGA CGGCCAAAGC CATGCGCGGC
TTCGACAAGT CGCTTGAGCT CGTGGTCTGC GGCTCGTCCA ATTCCGACAT GAAGACTTAT
CCCGAGTGGG AAGCCCAGGT CCTCGAGCAG TGCTATGACA GCGCCGACCA TATCTCGCTG
CATATGTATT TTGCCAACCG CGAGAAAAAT ACCCTCAACT ATCTTGCCCG CGCGACGAAG
CTCGACCGCT ACATCACCAC GATCGGCGGC GTGATCGACT ACATCAAGGC GAAAAAACGC
TCGAAGAAGA CGATCGGCAT TTCCTTCGAC GAATGGAATG TCTGGTATCA TTCCAACCAG
CAGGACAAGG AGATCCTGGC GCGCGACGAA TGGCCGGATG CGCCGCATCT CTTGGAAGAC
ATCTATAATT TCGAAGACGT GCTGCAGGTC GGCGGCATCC TCAACACCTT CATCCGACGT
TCCGACCGGG TGCGCATCGC CTGCATCGCG CAGCTCGTCA ACGTCATTGC CCCGATCATG
ACCGAGGACG GCGGTGCGGC GTGGCGCCAG ACCATCTATT ACCCGTTCTA TTACGCCTCC
AGATATGGCC GTGGAACGGC ACTGCAGCTG GTCGTCGATG GCCCGACCTA TGACAGCGAC
GAGGAGAACG ACGTCCCCTA TCTCGACGTA TCGGCAGTCC ATTCCGAAGA CGGCAAGACG
CTGACCTTCT TTGCCGTCAA CCGCCATCCG AGCACGGCGC TCGATCTCGA TGTGCGACTG
GAAGGCTTCG GAAATGCCCG GGTGGTCGAG CAGGTGGAGA TGACCCACGG CGACCTGGAA
GCCGTCAATA CGGCGGTGCG GCCGAAGACG GTCGCCCCCG TCAACGTCGA GAGTGGAAAG
ATCGAGGATG GACGCCTGCG GGCGGCGCTG AAACCGTTCT CCTACAACGT CATCCGGCTG
TCGGTGTGA
 
Protein sequence
MKTNVVVHRD FRIATIDSRL YSSFLEHLGR AIYGGIYEPG HPTADEDGFR QDVLDLVRDL 
DTPYCRYPGG NFVSAYNWED GVGPRAERPV RLDLAWRTRE ANQIGVNEFV DWCKKANTKP
MLAVNLGSRG LDAARNFLEY CNHPGGTYWS DLRRKHGWSN PHDVKLWCLG NEMDGPWQVG
HKSAYEYGRL ADETAKAMRG FDKSLELVVC GSSNSDMKTY PEWEAQVLEQ CYDSADHISL
HMYFANREKN TLNYLARATK LDRYITTIGG VIDYIKAKKR SKKTIGISFD EWNVWYHSNQ
QDKEILARDE WPDAPHLLED IYNFEDVLQV GGILNTFIRR SDRVRIACIA QLVNVIAPIM
TEDGGAAWRQ TIYYPFYYAS RYGRGTALQL VVDGPTYDSD EENDVPYLDV SAVHSEDGKT
LTFFAVNRHP STALDLDVRL EGFGNARVVE QVEMTHGDLE AVNTAVRPKT VAPVNVESGK
IEDGRLRAAL KPFSYNVIRL SV