Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_1944 |
Symbol | |
ID | 8012984 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | - |
Start bp | 1933972 |
End bp | 1935837 |
Gene Length | 1866 bp |
Protein Length | 621 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 644824533 |
Product | hypothetical protein |
Protein accession | YP_002975765 |
Protein GI | 241204669 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG2931] RTX toxins and related Ca2+-binding proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.69459 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTCAAG CCGTTAGAAT AAACGATATT ATTCGTTCCT TTGGAATTGA TACGCATATC GACTACACAG ATGGAAAATA TTCCAACGTC GGAGAAGTTG TTAAAGCACT CGACTATCTT GGCCTTGATA CAGTTCGCGA TCACGCCCCC AACTCCGCTT CCGATCCCAA CGGCCAAACG CATCTCGGCG ATGCTGCCGA GGCCGGCGTG CAATTCGTCT TCAGCGCCCA ACGGGAAGTC GACCCCGCCA CTGTCGCCCA GCGGCTGCAT TCCTTCGTGC AGGCCCATCC AGGATCGGTC GTCGGTATCG AAGGTCCGAA CGAAGTCAAC AACTGGCCAG TCAGCTATCA CGGCCTGAGC GGCCAAGCCG CAGCGCTCGC CTATCAGAAG GACCTGTCTG CCGACGTCAA CGCCGATCCC TTGCTGAAAA ATATCCCCGT TCTCGGCTTT ACCGGATATA CCGTGGCTTC CGCCTCCGAC TACACGACGA TCCACACCTA TGCGAAGGAT GGCGACCAGC CATATTCATG GCTCTCCCGA GAATCCGGCG TGCAGCGCGC TGCCGATCCG GGCAAGCCGC TGGCGATCAC CGAGACCGGC TACCACACCT CGCTGACCGC CGACACCAAT GGCGGCTGGG AAGGCGTCAG TGAAGCGACG CAGGCAAAGC TCCTGCTCAA TACGCTGATG GACGGCGCCG CACTCGGATC AAAACAGACG TTCATCTACG AGCTGCTGGA CGCCTATTCC GATCCGCAGG GCACCAATCA GGAAAAGCAT TTCGGCCTTT TTCATCTCGA CTATTCGGCC AAGCCGGCTG CGACGGCGAT CCACAATCTG ACCGAAATCC TTGCGGATGA CGGCGCCGCG AAGGCAAGCT TCAGCGCGGG GACCCTCAAT TATTCGATCG ACGGCCTGCC GTCCTCGGCC CGGAGCCTGC TGACGGAAAA ATCGGACGGA AGCTACCAGA TCATCGTCTG GAACGAGCCC GATATCTGGA GCCAGTCCTC CGACACGGTT ATTCAGGCCA CGACAACAGC CGTCAAAGTC AATCTCGGGG CCTCGTTTGG CTCCGTTAAG GTCTTCGACC CGGTGACAGG AACGACGGCG ATCAAAAGCC TCAGCAACGT GTCGTCGCTG CCGCTCGATG TCGTCGACCA TCCCTTGATC ATCGAGGTAG CAGGCACCGG CGCCAGCACA CCGCCGCCGG CCACCAACCA TCTCTATGGC GGCACCGGTA ACGACACCTT CACCGTGACC AATGCAAATC AAATCGTCGA CGAAAGCCGG GGCGGTGGAA CAGATACCGT CAAGGCTTCG ATCTCCTTCA GCCTGGCCGA TCAGAAGCAT ACGGTCGGAA CGATCGAAAA CCTCACTTTG ACCGGGACGG GCAATCTCAG CGCGACGGGC AACAATACGG CCAACATTCT CACCGGCAAC GACGGAAACA ATTCCCTCAA CGGCGGGAAA GGAAACGACC GATTGATCGG CGGGCTCGGA AACGACAAGC TGATCGGCAA GGCCGGTGCT GACGTTCTCA CCGGCGGCGG CGGCAGCGAT TCCTTCGTCT TCGATGTGAA GCCCGACAAT ACCAGCGTCG ACAAGATCCG GGATTTTTCC TCCGCGGCGG GCGACAAGCT GATGCTCGAT CATTCGATTT TCGCCGAGCT TAGCCTATCC GGATTTTCGG ATGAGAATTT CGTTTTGGGA AGGAAAGCGC TCGAGGCTGA TGACAAGCTG ATCTACGATC AGGCGAGCGG CATTCTATCC TATGACGCGG ATGGAAGCGC GGCGGGCGCG GCCATCCATG TTACGGATCT CGATAATTCC GCAGCACTTC ACTTCAAAGA CTTCCTGCTT GTCTGA
|
Protein sequence | MAQAVRINDI IRSFGIDTHI DYTDGKYSNV GEVVKALDYL GLDTVRDHAP NSASDPNGQT HLGDAAEAGV QFVFSAQREV DPATVAQRLH SFVQAHPGSV VGIEGPNEVN NWPVSYHGLS GQAAALAYQK DLSADVNADP LLKNIPVLGF TGYTVASASD YTTIHTYAKD GDQPYSWLSR ESGVQRAADP GKPLAITETG YHTSLTADTN GGWEGVSEAT QAKLLLNTLM DGAALGSKQT FIYELLDAYS DPQGTNQEKH FGLFHLDYSA KPAATAIHNL TEILADDGAA KASFSAGTLN YSIDGLPSSA RSLLTEKSDG SYQIIVWNEP DIWSQSSDTV IQATTTAVKV NLGASFGSVK VFDPVTGTTA IKSLSNVSSL PLDVVDHPLI IEVAGTGAST PPPATNHLYG GTGNDTFTVT NANQIVDESR GGGTDTVKAS ISFSLADQKH TVGTIENLTL TGTGNLSATG NNTANILTGN DGNNSLNGGK GNDRLIGGLG NDKLIGKAGA DVLTGGGGSD SFVFDVKPDN TSVDKIRDFS SAAGDKLMLD HSIFAELSLS GFSDENFVLG RKALEADDKL IYDQASGILS YDADGSAAGA AIHVTDLDNS AALHFKDFLL V
|
| |