Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_3887 |
Symbol | |
ID | 8014707 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | - |
Start bp | 3955888 |
End bp | 3957198 |
Gene Length | 1311 bp |
Protein Length | 436 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 644826457 |
Product | protein of unknown function DUF21 |
Protein accession | YP_002977669 |
Protein GI | 241206573 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG4536] Putative Mg2+ and Co2+ transporter CorB |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.0724448 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.00619914 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTCGGTCG AAGGCGCTCT GGCATTTCTT TCGACATATT GGCCGGAGAT CCTTTCGATC ACGGCGCTCG TGCTCATGTC CGCCTTCTTT TCCGGCTCCG AGACCGCGCT GACCGCCGTT TCGCGCAGCC GTATCCATAC GCTCGAGGTC AACGGCGACG AACGCGCCGG CCTCGTCCGG CAGTTGATCG AACGGCGCGA CCGGCTGATC GGTGCGCTGC TCATCGGCAA CAATCTCGCC AATATCCTGT CCTCCTCGAT CGCCACCAGC CTCTTCCTCG GGCTGTTCGG CAGTTCCGGC GTGGCGCTGG CGACGCTCGC GATGACCGTC ATCCTGGTGA TCTTCGCGGA AGTGCTTCCG AAGAGCTGGG CGATTTCGGC GCCTGAGCGC TTCGCACTCG CCATCGCGCT GCCGGCCAGG CTGTTCGTTG CCGTCGTCGG CCCGGTTTCC TCCTTCGTCA ATGCGATCGT GCGGCAGATT CTTTCGCTGT TCGGCATCAA TCTCTCACGA GAGACATCGA TGCTGACGGC GCATGAGGAA CTGCGCGGTG CCGTCGATCT GCTGCACCGC GAGGGATCGG TGGTGAAGGC CGACCGCGAC CGCCTCGGCG GCGTGCTCGA TCTTAGCGAG CTCGAACTGT CCGACATCAT GGTCCACCGC ACCGCGATGC GGGCGATCAA CGCCGACGAT GCGCCGGAAG CGGTGGTGCG GGTTATCCTC GAAAGCCCCT ATACGCGCAT GCCGCTGTGG CGTGGCACGA TCGACAACAT CATCGGCGTC GTCCATGCCA AGGATCTGCT GCGGGCGCTT GCCGAGCCGA ACATGGAGCC GCAGAACCTC GATATCGTGA AGATCGCGCA GAAGCCGTGG TTCGTGCCCG ACAGCACCAA CCTCGAGGAC CAGCTCAACG CCTTCCTGCG GCGCAAGCAG CATTTCGCCG TCGTCGTCGA CGAATATGGC GAGGTGCAGG GCATCGTCAC GCTGGAAGAT ATTCTCGAGG AAATCGTCGG CGACATTTCC GACGAACACG ATATCGAAAT ACAGGGCGTG CGTCAGGAGG CTGACGGCTC CGTCGTCGTC GACGGCGGCG TTCCGATCCG CGACCTGAAC CGCGCGCTCG ACTGGAACCT GCCCGATGAG GAGGCGACGA CGATCGCCGG CCTCGTTATC CACGAATCGA TGACCATCCC GGAAGAGCGC CAAGCCTTCA CCTTCTACGG CAAGCGTTTC GTCGTCATGA AGCGGGAGAA GAACCGCATC ACCAAGCTGC GCATCCGCCC GGCCGGAGAA GACGGCGCAA AGCCAGCCTG A
|
Protein sequence | MSVEGALAFL STYWPEILSI TALVLMSAFF SGSETALTAV SRSRIHTLEV NGDERAGLVR QLIERRDRLI GALLIGNNLA NILSSSIATS LFLGLFGSSG VALATLAMTV ILVIFAEVLP KSWAISAPER FALAIALPAR LFVAVVGPVS SFVNAIVRQI LSLFGINLSR ETSMLTAHEE LRGAVDLLHR EGSVVKADRD RLGGVLDLSE LELSDIMVHR TAMRAINADD APEAVVRVIL ESPYTRMPLW RGTIDNIIGV VHAKDLLRAL AEPNMEPQNL DIVKIAQKPW FVPDSTNLED QLNAFLRRKQ HFAVVVDEYG EVQGIVTLED ILEEIVGDIS DEHDIEIQGV RQEADGSVVV DGGVPIRDLN RALDWNLPDE EATTIAGLVI HESMTIPEER QAFTFYGKRF VVMKREKNRI TKLRIRPAGE DGAKPA
|
| |