Gene Rleg_6839 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_6839 
Symbol 
ID8022422 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012858 
Strand
Start bp284287 
End bp285513 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content63% 
IMG OID644833705 
Productprotein of unknown function DUF1228 
Protein accessionYP_002984839 
Protein GI241666755 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0480421 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.134894 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTAAAC ACAGCCCCAG CCCATATGTG GTCGTGCTCG CGGGATCGGC GTCCCTCGCC 
ATCGCCATGG GCGTCGGTCG GTTCGCATTC ACTCCAATTC TGCCGATGAT GCTACACGAC
GGCGTCGTTG ATCTTTCGCG AGCCGGAGGA CTCGCGACTG CCAACTACGT CGGCTACCTG
GTCGGTGCAT TGGCAGCGAT GGCAGTACCC AAGAGGTGGG ACCATACCTT CGTCATACGC
CTGACACTGG TGGCCACGGT TCTGCTTACG GCTCTGATGT CAGTGCCCTA TGCCGAGGCC
TGGGTCGCTC TTCGGTTCTT GGCGGGCGTC GCTTCGGCGA TCGGATTCGT CTTCACCTCC
GGTTGGTGTC TCGCCCAGCT TTCGGGAACC GGCAGCTCGA TCGGAAGCGC CATATTCACG
GGACCCGGCG CAGGCATCGC CGTGTCCGGG CTGGCCGCCA GCGGGATGAC CATTCTCGGG
CTGTCCGGCC ACACCGCCTG GCTGATATTC GCCGCGATCT CCGCCACGAT CAGCGGGATC
ATCTGGAAGA CCTTCGGCGA GAGCGCGAAG CCATCCGACG CCTATTCGGT GGGGGCGCCG
ACGCGCGCCT CCGGCAAGGT GCCGAAATCG GAAATGCCTC TATTTGCGAT CGCATACGGG
CTGGCCGGCT TCGGCTACAT CGTCACGGCG ACCTATCTTC CGGTGATCGC GAAGAACAGC
ATTCCTGGTT CACCCTTGCT CACCGTCTTC TGGCCGCTTT TCGGGGTCGC GGCAGTCGTC
GGATCGCTGC TGGCGGCGCG CGTTCCGCAT AGCGCCGACG TGCGGCTCCA TCTGATTGCC
GCATACCTCG TGCAGGCGGT CGGGGTGGGG CTGTCGGTTA TCTGGCAGGA CGCTTTCGGC
CTTGCACTCA GCAGCGTTCT CGTAGGCCTG CCGTTCACTG CGATCAGCTT CTTCGCCATG
AACGAAGTTA GGCGGATCAG ATCGAGCCAC CACGCGCGTT ACATGGGACT GCTGACGGCG
GTGTTCGCGA TCGGACAGAT CATGGGGCCG CCTGCCGTAG GAGTGATCAT GAGGCATGTG
GTGAACGTAG ACGCCGGGTT CGATCTTGCG CTTGCCGTCG CCAGCATCGC GCTCGTAGTC
GGCGCCGCGA TCTATGTTGC GATGATCCTG CTGTTTCCAA GCGAGCGGAA TGCGAGGACC
GCAGGGCGCT CGGTCCGTCC CACTTGA
 
Protein sequence
MSKHSPSPYV VVLAGSASLA IAMGVGRFAF TPILPMMLHD GVVDLSRAGG LATANYVGYL 
VGALAAMAVP KRWDHTFVIR LTLVATVLLT ALMSVPYAEA WVALRFLAGV ASAIGFVFTS
GWCLAQLSGT GSSIGSAIFT GPGAGIAVSG LAASGMTILG LSGHTAWLIF AAISATISGI
IWKTFGESAK PSDAYSVGAP TRASGKVPKS EMPLFAIAYG LAGFGYIVTA TYLPVIAKNS
IPGSPLLTVF WPLFGVAAVV GSLLAARVPH SADVRLHLIA AYLVQAVGVG LSVIWQDAFG
LALSSVLVGL PFTAISFFAM NEVRRIRSSH HARYMGLLTA VFAIGQIMGP PAVGVIMRHV
VNVDAGFDLA LAVASIALVV GAAIYVAMIL LFPSERNART AGRSVRPT