Gene Rleg_4633 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_4633 
Symbol 
ID8015377 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp4755971 
End bp4757068 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content64% 
IMG OID644827208 
Productprotein of unknown function DUF917 
Protein accessionYP_002978408 
Protein GI241207312 
COG category[S] Function unknown 
COG ID[COG3535] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.951403 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.119918 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGACGCA TACTCGTTGA GAAGGACGTG GAAGCTGCCG TCAAGGGCGG CTCCGTCTAT 
GCCGCCGGCG GCGGCGGCTG GGCCGATCAC GGGCGGATGC TTGGTTATGC CGCCGTCAAT
GTCGGCAAGC CGGAGCTGGT CTCGATCGAC GAATTGCGGG ACGAGGACTG GATCGCGACT
GCGGCTGCGA TCGGCGCGCC GGCCTCCACC ACGCCCTGGG AAATGCAAGG CATCGACTAT
GTGAAGGCGG TGCAATTGCT GCAGGAGGCG CTGGGCGAAA AGCTTTCCGG GCTGATCATC
GGCCAGAACG GCAAGTCCTC GACGCTGAAC GGCTGGCTGC CCTCGGCGAT CCTCGGCACC
AAGGTAGTCG ACGCCGTCGG CGATATCCGC GCACATCCGA CGGGCGACAT GGGCTCGATC
GGCATGGCCG GTTCGCCCGA GCCGATGATC CAGACCGCTG TCGGCGGTAA TCGCGCCGAG
AACCGTTACA TCGAACTGGT GGTGAAGGGG GCGACGGCGA AGATCTCGCC GGTGCTGCGC
GCCGCAGCCG ACCAATCCGG CGGCTTCATC GCCAGCTGCC GCAATCCGCT CCGCGCCTCC
TATGTCCGCA GCCATGCAGC ACTCGGCGGC ATATCGATGG CGCTTGCGCT CGGCGAAGCG
ATCATCGCGG CGGAGAAGCG CGGCGGATCT GATGTCATCG ACGCGATCTG CAAGACGACG
GGCGGACATA TCCTTGCCGA AGGCGTCATC ACCCGCAAGG ACGTCGTCTA TACCAAGGAA
GCCTTCGACA TCGGCACGAT CACCGTCGGC GCAGGCGAAA CGTCGGTGAC GCTGCATGTG
ATGAACGAAT ATATGGCGGT GGACGATGCG GATGGCGGGC GGCTAGCGAC CTTCCCCGCG
GTGATCACCA CGCTTTCACC AGAGGGCGAG CCGCTGAGTG TCGGCCAGCT CAAGGAGGGC
GTGCATGTGT TCATCCTGCA TGTGCCGATG GATATCATTC CGCTGTCGGC AAGCGTGCTC
GATCCGACCG TCTATCCCGT CGTCGAAAAG GCGATGGGGA TCGAGATCGC ACGCTATGCA
CTGGCGACGA AGGCCTGA
 
Protein sequence
MGRILVEKDV EAAVKGGSVY AAGGGGWADH GRMLGYAAVN VGKPELVSID ELRDEDWIAT 
AAAIGAPAST TPWEMQGIDY VKAVQLLQEA LGEKLSGLII GQNGKSSTLN GWLPSAILGT
KVVDAVGDIR AHPTGDMGSI GMAGSPEPMI QTAVGGNRAE NRYIELVVKG ATAKISPVLR
AAADQSGGFI ASCRNPLRAS YVRSHAALGG ISMALALGEA IIAAEKRGGS DVIDAICKTT
GGHILAEGVI TRKDVVYTKE AFDIGTITVG AGETSVTLHV MNEYMAVDDA DGGRLATFPA
VITTLSPEGE PLSVGQLKEG VHVFILHVPM DIIPLSASVL DPTVYPVVEK AMGIEIARYA
LATKA