Gene Rleg_5016 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_5016 
Symbol 
ID8007607 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012848 
Strand
Start bp401499 
End bp402773 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content64% 
IMG OID644821931 
ProductFAD dependent oxidoreductase 
Protein accessionYP_002973191 
Protein GI241113356 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0665] Glycine/D-amino acid oxidases (deaminating) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.208645 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.555113 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGGTTCG AATCCTACTG GCACGACACG GCACCAGCCT TTGCCGGCGG CGCTCAGGGC 
CCCGTCGAAG GACATTATGA CGTCGCCATC ATCGGCGCCG GCTTCACCGG GCTTGCCGCG
GCGCGCCAGC TTGCAAAGGC CGGCGTCAAG GTCGTCGTGC TGGAGGCTGA ACGGGTCGGC
TGGGGGGCAT CGGGGCGCAA TGGCGGCCAT CTCAACAACG GGCTCGCCCA CAGTTTCCTG
TCTGCCAAAT CAGCACTCGG CGTGGAGCGG GCCGTCGCCC TCTACAAGGC GTTCGACGAT
TCCATCGACA CGATCGAGGC GATTATCGCC GAAGAGGAGA TCGACTGCAG TTTCCGCCGC
GCCGGCAAGC TGAAACTTGC CTCCAAGCCG CAGCATTTCG ACGCCATTGC CCGCAATTTC
GAGGCTGTTC ACAGAGAAGT CGATCCCGAC ACGGCACTTC TGACGGCGAG TGACCTGAAA
AGCGAGGTTG GGTCGCCCTT CCATGGCGCC ATGCTCTCGA AGAAAAGCGC GATGATGCAT
ATGGGCTGCT ATGTCGTGGG GCTCGCCACG GCGGCCGCGC GTCACGGCGC GACCATCTTC
GAAAAGGCGG CCGTGACCGC CCACCGGCAG GGCAACGGCC GGCACAGCCT GACGACGGCA
CGCGGTACCG TCACCGCAGA CCACGTGCTG GTCGCGACAG GCGCCTATAC GCCGTCGCTT
TTTAATTATT TCCGCCGCCG GATTATTTCC GTCGGCAGCT TTCTGATCGC CACCCGTCCG
CTGACCGACG CCGAAATCGC CGCTACGATG CCGGGCAACC GGACCTGCGT GACGTCGATG
AACATCGGTA ATTATTTCCG GCTATCGCCG GACAAACGGC TGATCTTCGG CGGCCGTGCG
CGATTTTCCG CCACGTCGGA TCAGCGCTCG GACGCAAGGA GCGGCGATAT TCTGCGCGCC
AGCCTTGCTG AAATCTTCCC GCAGCTTGCC GGCGTCGAAA TCGACTATTG CTGGGGCGGG
CTCGTCGACA TGACGAAGGA CCGCTATCCG CGCGCCGGCT ATGTCGACGG TGTCTGGTAT
GCCATGGGCT ATTCCGGCCA CGGCGCCCAG CTCTCCACCC ATCTCGGCAT GATCACGGCC
GACGCCATCC TCGGCAAGGC CGACCTTAAT CCGATCAAGG GGCTCGACTG GCCCGCCGTC
CCCGGCCATT TCGGCAAGCC GTGGTTCCTG CCGCTCGTCG GGCTCTATTA CAAGACGCTC
GATCGCTTCC AGTAA
 
Protein sequence
MWFESYWHDT APAFAGGAQG PVEGHYDVAI IGAGFTGLAA ARQLAKAGVK VVVLEAERVG 
WGASGRNGGH LNNGLAHSFL SAKSALGVER AVALYKAFDD SIDTIEAIIA EEEIDCSFRR
AGKLKLASKP QHFDAIARNF EAVHREVDPD TALLTASDLK SEVGSPFHGA MLSKKSAMMH
MGCYVVGLAT AAARHGATIF EKAAVTAHRQ GNGRHSLTTA RGTVTADHVL VATGAYTPSL
FNYFRRRIIS VGSFLIATRP LTDAEIAATM PGNRTCVTSM NIGNYFRLSP DKRLIFGGRA
RFSATSDQRS DARSGDILRA SLAEIFPQLA GVEIDYCWGG LVDMTKDRYP RAGYVDGVWY
AMGYSGHGAQ LSTHLGMITA DAILGKADLN PIKGLDWPAV PGHFGKPWFL PLVGLYYKTL
DRFQ