Gene Rleg2_4778 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_4778 
Symbol 
ID6977872 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011368 
Strand
Start bp411300 
End bp412874 
Gene Length1575 bp 
Protein Length524 aa 
Translation table11 
GC content62% 
IMG OID643393942 
ProductABC transporter related 
Protein accessionYP_002278760 
Protein GI209546842 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1129] ABC-type sugar transport system, ATPase component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAACG GCGTCTCAGC CGTCCGGATG ACCGGAATAT CGAAGGCCTT CGGCGGCGTT 
CGCGCGCTCG AAGGCGTCGA TTTCGAAGTC CGGCCAGGCG AGATCCACGC ACTTCTCGGC
GGCAATGGCG CGGGTAAATC GACGATCCTC AAGATCCTCA ACGGCGTGCA CAAGCCGGAT
AGGGGAAACA TCGAGGTCGC GGGACGAAAC CTGAGCGCCC ATACGCCGGA GGAGTCGCGT
GCGGCGGGGA TCGCGATGAA TTACCAGGAA ATGAGCCTGG TACCGACCTT GACGGTGGCG
CAGAACATCT TCCTGACGCG TGAAAGCCGC AACGGTATGG GCCTGATCGA CGATGGCGAG
GCCGAACGTA AGGCCGCCGA GCTGTTTGCG ATGCTGGAGG TTTCCGTCGA CCCGCGCGCC
ATTGTCGGCG ATATCGGTGC GGGGCAGAAG CAGCTGACGG AGATCGCCAA GGCGATTTCG
CAGGACGCGA AGATTCTGGT CCTGGACGAA CCATCGACAG CCCTGGCCGT GTCCGACGTC
GAACGCCTTT TCGCCTTCCT AAGGAAGCTC AAGGACAAGG GTGTCGCGAT AATCTATGTC
AGCCATCGCA TGGACGAGAT CGCCCGCATT TCTGATCGCG CCACGATCCT GCGCGACGGG
CGGCACGTGA TCACGGCGCC GATCAGCGAA TTGCCGATCG ACACGATGAT CGAACACATT
GTCGGACGTC GCTCAATGGG ACTGTCCGAT GTGTTGCGCG GGACTGCATC GCGCGGCGAC
GTGGTGCTCG AGGTTACGAA ACTGAGTGGC ACGCACAAAC CGAGGGACGT ATCGTTCGCA
CTGCATAGCG GCGAGGTTCT CGGACTGGCA GGGCTCCTCG GCTCCGGCCG ATCTTCGCTC
GCGAGAGTGC TTGCCGGGAT AGAGCCGGCC TCAAGCGGTA CAATCCGCGT CCACGGCTCC
GACGCTTCCA TCAGGACGCC CAAACAGGCG ATCGACGCCG GGCTGGCACT TGTTCCGGAA
GCGCGCGCCA CCCAGGGTAT CATCCCTGCC CATTCCGTTT CCGCAAACAT GGTGATGGCG
GTGATCGGAC GGCTGTCGCG GGCAGGCTTC ATGGACGCAG CCGCGGAACG GAAGCTGACG
GACGACCAGA TCCAGCGGCT GGCGGTCAAG ACGGCAAGCC GCGATCATGC CGTATCGACG
CTGTCTGGCG GCAACCAGCA GAAGGTGGTC ATCGGCAAAT GGCTGGCGAC CGACCCGGAT
ATCCTGATCC TGGATGAACC GACCGCCGGC ATCGATATCG GCTCCAAGGC CGAGATCATA
AAACTTGTGC GCGAACTCGC GGCCGCGGGC AAAGCGATCA TTATGGTCTC ATCCGAACTG
TCGGAATTGC TCACGGCATG CGACCGGATC CTCGTCATGA CTGAGGGGCG TGTTCATCAG
GACCTGCCAC GGGAGGCTTT CGACGATCCG TCCCTCCCCG CCACCGATCT TGCACACCGG
CTCCAGGCCG CCGAGCAGCG CCTTCAAATC GAGATTCAGC GGGCCTTGAG GCTCCAGGAG
ACTTCAGATG CCTAA
 
Protein sequence
MNNGVSAVRM TGISKAFGGV RALEGVDFEV RPGEIHALLG GNGAGKSTIL KILNGVHKPD 
RGNIEVAGRN LSAHTPEESR AAGIAMNYQE MSLVPTLTVA QNIFLTRESR NGMGLIDDGE
AERKAAELFA MLEVSVDPRA IVGDIGAGQK QLTEIAKAIS QDAKILVLDE PSTALAVSDV
ERLFAFLRKL KDKGVAIIYV SHRMDEIARI SDRATILRDG RHVITAPISE LPIDTMIEHI
VGRRSMGLSD VLRGTASRGD VVLEVTKLSG THKPRDVSFA LHSGEVLGLA GLLGSGRSSL
ARVLAGIEPA SSGTIRVHGS DASIRTPKQA IDAGLALVPE ARATQGIIPA HSVSANMVMA
VIGRLSRAGF MDAAAERKLT DDQIQRLAVK TASRDHAVST LSGGNQQKVV IGKWLATDPD
ILILDEPTAG IDIGSKAEII KLVRELAAAG KAIIMVSSEL SELLTACDRI LVMTEGRVHQ
DLPREAFDDP SLPATDLAHR LQAAEQRLQI EIQRALRLQE TSDA