Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_3374 |
Symbol | |
ID | 8014254 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | + |
Start bp | 3387197 |
End bp | 3389305 |
Gene Length | 2109 bp |
Protein Length | 702 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 644825933 |
Product | protein of unknown function DUF187 |
Protein accession | YP_002977160 |
Protein GI | 241206064 |
COG category | [S] Function unknown |
COG ID | [COG1649] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0000681904 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.108912 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTCGACG CCAGAAAACA GAACACCCAC ACGCTGAGGA CGCCGGACTG GTTCAAGACC GCGACCCGCT GGACCCAGCT GACCTTCGTG GAAGACGACC CGGAGAAATA CGACCCGGCA TTCTGGATCG ATGTCTTCAA ACGCACGAAA TCGAACGCGG TCTGCCTCAG CGCAGGCGGC TATATCGCCT ATTATCCGAG CGAAGTGCCC TACCACTATG TGAGCAAATA TCTCGGCGAC AAGGATATCT TCGGCGCGCT CGTCGACGCC GCCCGCAAGC TCGACATGCA TGTCATGGCC CGCGTCGACC CGCATGCGAT CCATGATGAT GCCGCCAAAG CCCATCCGGA ATGGGTGATG ATCAATGCCG ACGGCACGCC ACGCCGCCAC TGGGCCTATC CCGATGTCTG GGTGACCAAT GCCTATGGCG ACTACAACAG CGTCTTCATG CCTGAGGTGG TCAAGGAGAT CGTCCGCAAA TACGATATCG ATGCGGTCTT CGCCAATCGC TGGCAGGGCC ACGGCGTCGA TTATAGCGAA GACAGCGCCC GCCGCTTCAA GGATATGTCC GGCCACGCCC TGCCTGTAAA ACCCGATGCC GAGGATCCGG CCTGGCAGGC TTGGGTGCAA TGGCGCCGCC GCGTGCTCAC CGACATGATC GCGCAATGGG ACGATGCCGT CAAAGCGATC CGCCCGCATG CGAGCTTCAT TCCGAACATG GGCGGCGCGT CGCTGATGGA ATTCGACCTC TCGGTCATCG CCAGGCACTG CCCCTTCCTC GTCGTCGACC ATCAGGGCCG CAAGGGCCTG GAGCTCGGCT GGTCGGCTGG CCGTAACGGC AAGCGCATCC GCGCCACCTT CCCCGACCGC CCGGTTGTGC TGATCACCTC GATCGGCCCG GAGGAGGAAT ATCGCTGGAA GGATGCCGTC ACCTCGGGTG AGGAGATGCA GCTTTGGATC AACGACGGCA CTGCCCACGG CCTCTACGCC TGGTTCACCA AATTCAACGG TGTCGTGCCC GACAAACGCT GGGTCGAGCC GGTGGCCGAC GCATTCGGCC TGCAGGCAGC CGTCGAGCCC GTTCTGGAAA GCATGAAGCC GACCGCCGAA ATCGCTGTCG TCGATCCCTC GACGACGCTG CGCCATTGGG CGCCGGAAGA GCGGCATTCC GCCGAGAAGC ACGATCTCGG TCTCTATCAC GCCCTCGTCG AAGCCCGACT GCCCTTCGAG CTGCTCTCGG ACCAGGTGCT GACCGAGGAA ACCCTCGACC GCTTCAAGCT GATCATTCTC GCCAACGCCT CCTGCCTTTC GGATGCGCAG AACGCGGCGA TCCGCGCATA TGTCGATCGC GGCGGCAGCG TGATTGCCTC TTACGAGACG TCGCTGCGTG ACGAATTTGG CAAGAAGCGT GACGAATTCG GTCTGGCCGA CGTGCTCGGC GCCAGATACG TTTCCGGCCC GCGCGGCATC GTCAAAAACA CCTATGTCGC CCTTTCCGGC AATCACCCGA TCAATCGGGG TTTCGACGGC GCCGAACGCA TCATGGGCGG CACCCGCCTG ATCCACGCCG AACCGTCGGC CGATGCGAAG ACGCCCTTCC TCTATATCCC CGATTTTCCC GACCTGCCGA TGGAAGAGGT CTATCCGCGC GAGGCCCCGA AAGGCGCTGC CGTCATCGCT CGTGAGACCG GCAAGGGTGG CCGCACGGTC TATATTCCCT GGAATATCGG CGAGATCTTC TGGGAGGTCT TTGCCGTCGA TCATGCGCGG CTCATCGCCA ACACCGTCCA TTGGGCGCTC GGCAAGACGC CGCGCGTCAC CGTCAAAGGC AAGGGCGTCG TCGATCTGGC GCTGCGCGAA AACGGTGAGG GTCTGGCGCT CAGCCTCTTC AATCTCACCA ATCCGATGAT GATGAAAGGC CCGATCCGCG ACAATTACCC TCTGGCAGCG CAGACCGTTT CGGTGGAGAT TCCGGAGGGC CGATCGGTGG CGAAGGCGTG GCTCGTTGTT GCCGACCGCG CCGCAAGCTT CAGCCTCGGG AATGGCCGCG CCGAGGTGGA GGTGCCAGGT ATCGATCGGC TTGAGGTCCT GCATCTCACC TGGAAATGA
|
Protein sequence | MLDARKQNTH TLRTPDWFKT ATRWTQLTFV EDDPEKYDPA FWIDVFKRTK SNAVCLSAGG YIAYYPSEVP YHYVSKYLGD KDIFGALVDA ARKLDMHVMA RVDPHAIHDD AAKAHPEWVM INADGTPRRH WAYPDVWVTN AYGDYNSVFM PEVVKEIVRK YDIDAVFANR WQGHGVDYSE DSARRFKDMS GHALPVKPDA EDPAWQAWVQ WRRRVLTDMI AQWDDAVKAI RPHASFIPNM GGASLMEFDL SVIARHCPFL VVDHQGRKGL ELGWSAGRNG KRIRATFPDR PVVLITSIGP EEEYRWKDAV TSGEEMQLWI NDGTAHGLYA WFTKFNGVVP DKRWVEPVAD AFGLQAAVEP VLESMKPTAE IAVVDPSTTL RHWAPEERHS AEKHDLGLYH ALVEARLPFE LLSDQVLTEE TLDRFKLIIL ANASCLSDAQ NAAIRAYVDR GGSVIASYET SLRDEFGKKR DEFGLADVLG ARYVSGPRGI VKNTYVALSG NHPINRGFDG AERIMGGTRL IHAEPSADAK TPFLYIPDFP DLPMEEVYPR EAPKGAAVIA RETGKGGRTV YIPWNIGEIF WEVFAVDHAR LIANTVHWAL GKTPRVTVKG KGVVDLALRE NGEGLALSLF NLTNPMMMKG PIRDNYPLAA QTVSVEIPEG RSVAKAWLVV ADRAASFSLG NGRAEVEVPG IDRLEVLHLT WK
|
| |