Gene Rleg_4008 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_4008 
Symbol 
ID8014817 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp4086157 
End bp4087155 
Gene Length999 bp 
Protein Length332 aa 
Translation table11 
GC content58% 
IMG OID644826577 
ProductTRAP dicarboxylate transporter, DctP subunit 
Protein accessionYP_002977788 
Protein GI241206692 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1638] TRAP-type C4-dicarboxylate transport system, periplasmic component 
TIGRFAM ID[TIGR00787] tripartite ATP-independent periplasmic transporter solute receptor, DctP family 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.182946 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTAGAAT TCAATATCAT GCGGGCTCAC GCCGGTCGCC TTTCCATGTC CACCATCGCC 
GGCATCATGT TGCTTTGCAC CGGCCAAGCG AATGCCGAAA CTCTTCGGCT CGCCCACGCA
TCGAGCTCGA AGAGCCTCAT TCAGGAGGCT GTCGTCATGT TCGCCGACAA GCTCGCCGGC
GAAACCAAGG GCGGCCTTAC CGTCCAGATT TTTCCAGATG GTCAGTTGGG CGATGAGGGA
CCGATCGCCG ATGGTGTCGG CTCCGGCTCC ATCGATATCG GGTTAGGCGG CGTTGCCGAT
GCGATCGATC CGAAGCTCAA CGTCGTCACC TTGCCGTTCT TGTTTTCCGA TGCAAACGCA
GCGCACACCT TTCTCGACGG ACCAGTCGGG AAGAAGGTCT TCGACACGGG TGCCGACAAC
GGCTTCAAGA TGCTCGGCGC GCTTGATTCC GGTTTCCGCC AATTTGCAAC TGTCAGCAAA
TCAATCGCGA CGCCGGAGGA TATCAAGGGT CTGAAGCTGC GCACGCCGCC GAACCCCGTC
ATTCTCGCAA CCATCGAACA GCTGGGTGCC CTGCCGCAAT CGATTCCATT CGGGGAGGTC
TATACCTCGC TGCAATCGCA TGTGGTCGAC GGCGTGGAGC CGGAAATACG CGATTTCGCG
GATCAGAAAT GGTACGAAAG CGCGAAGTTC CTATCGGTCT CGAACTATAT CTGGACGCCG
AATTACTGGT TCATGAACAA GGAGCGCTTC GACGCTCTGA GCCCGGAAAA CCAGGCTGCG
GTGACCAAGG CAGTCGAAGA GACGACGATC TGGTACCGCG GAAAACTCGA CGAAGTCTAT
GCCCAGGTCA TTGAGGACCT CAAGTCGAAG GGCGTCACCG TAACGACGGT GGACACGACA
CCCTTCCGTG CGATGGTTGA TCCTGTCTAT GTGAAATTCG GGGCGGAATG GGGCGACGAT
CTGGTGTCGT CCGTGCGCTC GGCAGCAGCC GGAAAATAG
 
Protein sequence
MLEFNIMRAH AGRLSMSTIA GIMLLCTGQA NAETLRLAHA SSSKSLIQEA VVMFADKLAG 
ETKGGLTVQI FPDGQLGDEG PIADGVGSGS IDIGLGGVAD AIDPKLNVVT LPFLFSDANA
AHTFLDGPVG KKVFDTGADN GFKMLGALDS GFRQFATVSK SIATPEDIKG LKLRTPPNPV
ILATIEQLGA LPQSIPFGEV YTSLQSHVVD GVEPEIRDFA DQKWYESAKF LSVSNYIWTP
NYWFMNKERF DALSPENQAA VTKAVEETTI WYRGKLDEVY AQVIEDLKSK GVTVTTVDTT
PFRAMVDPVY VKFGAEWGDD LVSSVRSAAA GK