Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_3503 |
Symbol | |
ID | 6982257 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | + |
Start bp | 3621716 |
End bp | 3622966 |
Gene Length | 1251 bp |
Protein Length | 416 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 643398221 |
Product | HI0933 family protein |
Protein accession | YP_002282996 |
Protein GI | 209551079 |
COG category | [R] General function prediction only |
COG ID | [COG2081] Predicted flavoproteins |
TIGRFAM ID | [TIGR00275] flavoprotein, HI0933 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 35 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.849047 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCAGA AGCGAATTGC CATCATCGGC GGCGGCCCGG CGGGCCTGGC GGCCGCCGAA CTGCTTTCGC TCTCCGGCCA TGCGGTGACG GTCTACGACG CCATGCCGAC TTTCGCCCGC AAGTTCCTGC TCGCCGGCAA ATCGGGTCTG AACATCACCC ATTCCGAGGA TTATGCCCGT TTCGCCACGC GCTTCGGCCC GGCCTCCGCC CGCCTGCGCC CCGCCCTGGA TGCCTTTACC CCTGTCGATA TCAGGGACTG GGCGGCAGGG CTCGGGACGG AGACCTTCGT CGGTTCGTCC GGCCGGGTCT TCCCGATGGT GATGAAAGCC TCTCCCTTGC TGCGCGCCTG GCTCAAGCGA TTGGAGGCGC AGGGTGCTGT GCTCCGCACC CGCCACCGTT GGATCGGTTT TGCCGATGAG GGCTATGTTT TCGAAACGCC GGAAGGGCGC AGCATCGTCC ATTGCGACGC CGCCCTGCTG GCGCTCGGCG GCGCAAGCTG GCCGCGCCTC GGCTCGGATG CGGGCTGGCT GCCGTGGCTA TCGGAGAAGG GTGTCGAGAT CGACGCCTTC CAGCCCGCCA ATTGCGGCTT CGTCGTCGGC TGGAGCGAAA ACTTCCGCGA GCGTTTCGCC GGCGAGCCGG TGAAATCGGT CACCGCCACC TCCGAAGCCG GCACTTTTCC CGGCGAATTC GTCATCACCA CAACCGGCAT CGAGGGCAGC CTGGTCTACG CTCATGCGGC AAGCCTCCGC GACCGGCTGC TGGACCGCGG CAGCGCGGCC CTGACGCTCG ACCTCGCCCC GGGCCGGACA GTCGAAAGGC TGGCCCGCGA TCTTGCGCGG CAGGACGCCA AATCGAGCTT TTCAACCCGC CTGCGCAAGG GCGCCGGCCT CGACGGCGTC AAGGCGGCCT TGCTGCGGGA ACTCGCTCCC GAGCGCGACA GAGCCGATCC CGGCCGTCTC GCCGGCCTGA TCAAGGCCCT GCCGGTGCCG GTTCTCGAGA CAAGGCCGAT CGGCGAGGCG ATCTCCTCGG CCGGCGGCAT CCGCTGGAGC GGCATCGACG ACGGCTTCAT GTTGACGGCG CTGCCGGGCA CCTTCGTCGC CGGCGAGATG CTTGACTGGG AGGCGCCGAC CGGCGGCTAC CTCCTCACCG CCTGCCTTGC GACCGGCCGG GCCGCTGCGC GCGGCATTGA GGCTTGGCTG CACGGATACG GGCGCTCGCC GGCACTGAAC GACAAACAGG ACCTTCCCTG A
|
Protein sequence | MSQKRIAIIG GGPAGLAAAE LLSLSGHAVT VYDAMPTFAR KFLLAGKSGL NITHSEDYAR FATRFGPASA RLRPALDAFT PVDIRDWAAG LGTETFVGSS GRVFPMVMKA SPLLRAWLKR LEAQGAVLRT RHRWIGFADE GYVFETPEGR SIVHCDAALL ALGGASWPRL GSDAGWLPWL SEKGVEIDAF QPANCGFVVG WSENFRERFA GEPVKSVTAT SEAGTFPGEF VITTTGIEGS LVYAHAASLR DRLLDRGSAA LTLDLAPGRT VERLARDLAR QDAKSSFSTR LRKGAGLDGV KAALLRELAP ERDRADPGRL AGLIKALPVP VLETRPIGEA ISSAGGIRWS GIDDGFMLTA LPGTFVAGEM LDWEAPTGGY LLTACLATGR AAARGIEAWL HGYGRSPALN DKQDLP
|
| |