Gene Rleg2_3503 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_3503 
Symbol 
ID6982257 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp3621716 
End bp3622966 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content69% 
IMG OID643398221 
ProductHI0933 family protein 
Protein accessionYP_002282996 
Protein GI209551079 
COG category[R] General function prediction only 
COG ID[COG2081] Predicted flavoproteins 
TIGRFAM ID[TIGR00275] flavoprotein, HI0933 family 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.849047 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCAGA AGCGAATTGC CATCATCGGC GGCGGCCCGG CGGGCCTGGC GGCCGCCGAA 
CTGCTTTCGC TCTCCGGCCA TGCGGTGACG GTCTACGACG CCATGCCGAC TTTCGCCCGC
AAGTTCCTGC TCGCCGGCAA ATCGGGTCTG AACATCACCC ATTCCGAGGA TTATGCCCGT
TTCGCCACGC GCTTCGGCCC GGCCTCCGCC CGCCTGCGCC CCGCCCTGGA TGCCTTTACC
CCTGTCGATA TCAGGGACTG GGCGGCAGGG CTCGGGACGG AGACCTTCGT CGGTTCGTCC
GGCCGGGTCT TCCCGATGGT GATGAAAGCC TCTCCCTTGC TGCGCGCCTG GCTCAAGCGA
TTGGAGGCGC AGGGTGCTGT GCTCCGCACC CGCCACCGTT GGATCGGTTT TGCCGATGAG
GGCTATGTTT TCGAAACGCC GGAAGGGCGC AGCATCGTCC ATTGCGACGC CGCCCTGCTG
GCGCTCGGCG GCGCAAGCTG GCCGCGCCTC GGCTCGGATG CGGGCTGGCT GCCGTGGCTA
TCGGAGAAGG GTGTCGAGAT CGACGCCTTC CAGCCCGCCA ATTGCGGCTT CGTCGTCGGC
TGGAGCGAAA ACTTCCGCGA GCGTTTCGCC GGCGAGCCGG TGAAATCGGT CACCGCCACC
TCCGAAGCCG GCACTTTTCC CGGCGAATTC GTCATCACCA CAACCGGCAT CGAGGGCAGC
CTGGTCTACG CTCATGCGGC AAGCCTCCGC GACCGGCTGC TGGACCGCGG CAGCGCGGCC
CTGACGCTCG ACCTCGCCCC GGGCCGGACA GTCGAAAGGC TGGCCCGCGA TCTTGCGCGG
CAGGACGCCA AATCGAGCTT TTCAACCCGC CTGCGCAAGG GCGCCGGCCT CGACGGCGTC
AAGGCGGCCT TGCTGCGGGA ACTCGCTCCC GAGCGCGACA GAGCCGATCC CGGCCGTCTC
GCCGGCCTGA TCAAGGCCCT GCCGGTGCCG GTTCTCGAGA CAAGGCCGAT CGGCGAGGCG
ATCTCCTCGG CCGGCGGCAT CCGCTGGAGC GGCATCGACG ACGGCTTCAT GTTGACGGCG
CTGCCGGGCA CCTTCGTCGC CGGCGAGATG CTTGACTGGG AGGCGCCGAC CGGCGGCTAC
CTCCTCACCG CCTGCCTTGC GACCGGCCGG GCCGCTGCGC GCGGCATTGA GGCTTGGCTG
CACGGATACG GGCGCTCGCC GGCACTGAAC GACAAACAGG ACCTTCCCTG A
 
Protein sequence
MSQKRIAIIG GGPAGLAAAE LLSLSGHAVT VYDAMPTFAR KFLLAGKSGL NITHSEDYAR 
FATRFGPASA RLRPALDAFT PVDIRDWAAG LGTETFVGSS GRVFPMVMKA SPLLRAWLKR
LEAQGAVLRT RHRWIGFADE GYVFETPEGR SIVHCDAALL ALGGASWPRL GSDAGWLPWL
SEKGVEIDAF QPANCGFVVG WSENFRERFA GEPVKSVTAT SEAGTFPGEF VITTTGIEGS
LVYAHAASLR DRLLDRGSAA LTLDLAPGRT VERLARDLAR QDAKSSFSTR LRKGAGLDGV
KAALLRELAP ERDRADPGRL AGLIKALPVP VLETRPIGEA ISSAGGIRWS GIDDGFMLTA
LPGTFVAGEM LDWEAPTGGY LLTACLATGR AAARGIEAWL HGYGRSPALN DKQDLP