Gene Rleg_2027 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_2027 
Symbol 
ID8013060 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp2019689 
End bp2020777 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content64% 
IMG OID644824614 
Productcitrate synthase 2 
Protein accessionYP_002975845 
Protein GI241204749 
COG category[C] Energy production and conversion 
COG ID[COG0372] Citrate synthase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAACG GCTTGGAAGA TGTCATTGCT GCCGAAACGC AGCTTTCGGA TGTCGATGGC 
GAAGCGGGAC GACTGATCAT CCGCGGTGTA TCGCTGGATC ACCTGGTTGC AGACGGCACC
TATGAAGGCG TCGCCGCCCT GTTGCTCGAT GGGCTGATGG AAAAAAGCTT CGACGAAGCG
GAATTGCGCG ACTGGTTGGC GCAGGCGCGA ACGAGGATTT TCGGCCATAT CAAGGCCGCC
GATGCCGCCC TGCTCGCTTT GCCTCCTGTT GATGCGATGC GGGCGCTGAT CGCCCGCCTG
CCCGACGGCG AGGATTTCGA TACTGCGCTC AGCCTTCTGG CTGCGCCCGC AGTTTTCCTG
CCGGCGATCC TTCGCATGCA AAGCGGCAAA AGACCGATCG CGCCCGACGC CTCGTTGCCG
CAGGCGGCCG ATATCCTGCG TATGCTGACC GGAAAATTGC CGACCAGGGA GCAGACGGCG
GCACTCGACA CCTATCTCGT GACGATATCA GACCATGGCC TCAATGCCTC GACCTTCGCA
TCACGCGTCA TTGCCTCGAC GCAGGCCGGC CTCACTTCTT CCGTGCTCGC CGCACTGAGC
GCGCTGAAAG GGCCACTGCA CGGCGGCGCG CCCGGTCCTG TGCTCGACAT GCTGGATGCG
ATCGGAACGG CGGAGAATGC TTACTCGTGG CTCGGCGAAG CGCTCGACCG CGGCGAAAGG
CTGATGGGCT TCGGCCACCG CATCTATCGC GTCCGCGATC CGCGCGCCGA TGCACTGAAG
GGAGCGCTGA AGCCGCTGAT ATCAACCGGA CAGGTAAACA GCGCCCGCGG TACATTGGCC
GAAGCCGTGG AGGCCTCTGC ATTGGCCATC TTGAAGGCGC GCAAGCCGAA CCGGCCGCTC
GACGTCAATG TCGAGTTCTA CACTGCGCTT CTGCTCGAAG CGCTCGGCTT TCCCCGCGAG
GCCTTCACCG GCGTCTTCGC GATCGGCCGC ACCGTCGGAT GGCTGGCGCA TGCCCGCGAA
CAGGCGCTCG ACGGCCGGCT GATCCGTCCA CGTTCGGTCT ATATCGGGCC GCTGCCCGCC
GCTGCCTGA
 
Protein sequence
MKNGLEDVIA AETQLSDVDG EAGRLIIRGV SLDHLVADGT YEGVAALLLD GLMEKSFDEA 
ELRDWLAQAR TRIFGHIKAA DAALLALPPV DAMRALIARL PDGEDFDTAL SLLAAPAVFL
PAILRMQSGK RPIAPDASLP QAADILRMLT GKLPTREQTA ALDTYLVTIS DHGLNASTFA
SRVIASTQAG LTSSVLAALS ALKGPLHGGA PGPVLDMLDA IGTAENAYSW LGEALDRGER
LMGFGHRIYR VRDPRADALK GALKPLISTG QVNSARGTLA EAVEASALAI LKARKPNRPL
DVNVEFYTAL LLEALGFPRE AFTGVFAIGR TVGWLAHARE QALDGRLIRP RSVYIGPLPA
AA