Gene Rleg2_3701 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_3701 
Symbol 
ID6982463 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp3828671 
End bp3829957 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content66% 
IMG OID643398423 
Producthypothetical protein 
Protein accessionYP_002283190 
Protein GI209551273 
COG category[S] Function unknown 
COG ID[COG4223] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTATCGG GAAACCCGCC ACGCCATTCG AAGAGCGCCG ACGAGCCGGT CACGATCGAC 
CTCGAAGGAC AGGATTTCGC CTCTGCAGCC GATACCGAAA AGCCGGTTGA GAAGGATGTC
GGCGACGCCG ACAACAGCAC CGCCGATGCC GGCGTGACGC CCGAAACCGA GGCTGCGCCG
CAGTTCGAAC AGGCACGAAC CGAGCAGGAA CAAGCCGAAC ACGAACCAGC CGAACAAGAG
GACCAGCCTG CAACGGATGC GCCGGAGGAG GAGCCGGCAG CCCCGGAGCC TGCCTTCGCG
CCGCCTCCCG AACAGCCGCG GCGCGCCGCC ACCTCCGGTC TGATCGCGGC CGGCATCTTT
GGCGGCCTGG TGGCGCTGCT TGGCGCCGGC GCCATTCAAT ATGCCGGCTA TCTCCCGGGT
TCCTCCGCGC CGCAGGCGAC ATCGCCTGAC ATCGCCGATC TTTCCGGCGA GATCGATGGC
TTGAAACAGA CCGTTGCCAA TCTTGCCGCC AATCCAGCGA GTACAGATGA CGGCGCGCTT
GAAAAGCGCA TCGCCGCGCT GGAAACGACT GCCAAGGCGC CCGCAGCCGC CGCCCCGGCC
GATTCGGCAA ATGTCGAGGC ACTCAACCAG AAGATTGCCG AGCTGACCGG CCAGGTCGAC
CAACTGCGTG CCACCCTGGC CCAGTCTTCC GAGCAACAGA CGACGAGCGG CGCCGATATC
GCCAAACGTC TCGACGAGGC CGAAAAGAAG CTGAACGAGC CGCGCGAGGA TGTCGCCGTC
GCCCGGGCGA TCGCGGCGGC CGCCCTCAAG GCGGCGATCG ATCGCGGCGG GCCGTTCCTG
GCCGAACTCG ATACTTTCGC CGGCGTCGCC CCCGACGATC CCGCAGTCGC CGACCTTCGA
GCCTTTGCCG AAACCGGCAT TCCCTCGCGC GCCGAACTCA TGCGTCAGGT TCCCGATGTC
GCCACGGCGA TCGTCGAAGC CGTCAACCAG CCGGATCCAA ACGAGAGCTG GTCGGACCGG
TTGATGTCGA GCGCCAAGTC GCTGGTATCG GTCCGTCCCG TCGGCAATAT CGAGGGCGAC
AGCGTAGAAG CCATCGCCGC CCGCATGGAG GACAAGGTGA AGAGCGGCGA TTTGCCGGGC
GCTTCCGCCG AATGGAACAA CCTGCCGGCT CCCGGCAAGC AGGCGTCCGC CGCCTTCAAG
CAATCGCTCG AAGCGCGTAT CCGCGTCGAG GAACTGGTCG GCGGGGCGCT GTCGAAAGCG
GTTTCCGGCA CCGGCAAGGA GGGATGA
 
Protein sequence
MVSGNPPRHS KSADEPVTID LEGQDFASAA DTEKPVEKDV GDADNSTADA GVTPETEAAP 
QFEQARTEQE QAEHEPAEQE DQPATDAPEE EPAAPEPAFA PPPEQPRRAA TSGLIAAGIF
GGLVALLGAG AIQYAGYLPG SSAPQATSPD IADLSGEIDG LKQTVANLAA NPASTDDGAL
EKRIAALETT AKAPAAAAPA DSANVEALNQ KIAELTGQVD QLRATLAQSS EQQTTSGADI
AKRLDEAEKK LNEPREDVAV ARAIAAAALK AAIDRGGPFL AELDTFAGVA PDDPAVADLR
AFAETGIPSR AELMRQVPDV ATAIVEAVNQ PDPNESWSDR LMSSAKSLVS VRPVGNIEGD
SVEAIAARME DKVKSGDLPG ASAEWNNLPA PGKQASAAFK QSLEARIRVE ELVGGALSKA
VSGTGKEG