Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_5411 |
Symbol | |
ID | 6978505 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011368 |
Strand | + |
Start bp | 1055648 |
End bp | 1057168 |
Gene Length | 1521 bp |
Protein Length | 506 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 643394513 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_002279331 |
Protein GI | 209547413 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.253312 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.00154474 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGAAACTGC ATTTGGTAGC TGCCTGCTTT TCAACCGCGA CAATCGGGCT GAGTATCGGC TCGGCTCACG CTGAAGATGC CAAGAGCAAC GTCACTGTTG TGCTTGCCGA AACCGTCGAT GTCGTCGAGC CCTGCATGGC AGCGCGCCAG GATGTCGGCC GGGTCATTTC CGAAAACGTC AACGAGATGC TGGTGGAATT CGATTACGTC AATGGCGGCC TCAAACCCCG CCTGGCGACG GAATGGTCGA AGATCGACGA CGACACCTGG GAGTTCAAGC TGCGCCCGAA TGTCAAATGG CACGATGGCA AACCGTTCAC CGCCAAAGAC GTTCAGTTCA CGATCGAGCG CAACAAGAAC AAGAAGCTCA GCTGCGAGAC CGGCGGCAAA TATTTCGGCG GCACGGAATT CAGCTTCGAG ACGCCTGATG CCAACACCAT CCGCATTACG ACAAAACCGG CGCAGCCGAT TCTTCCGTTG CTGATGACGG TGATGGCCGT CGAATCGGCC GAAGCGACAC CTGCAGACGA ATTTACCCGC AAGCCGATCG GCACCGGCCC CTATACATTC GACAAATGGG AGATCGGCCA GTCGATCGAG CTGAAGCGTA ATCCGGACTA TTGGGGGGAC AAGCCGCAGG TGGAGCAGGC GACCTATCTG TTCCGCTCGG ATAGTGCTGT CGCGGCGGCG ATGGTCGATG CCGGCGAAGC CGATATCGTT CCGGCCGTGT CGGTGCAGGA TGCCACCAAC AAGGAAACCG ATTTCGCCTA TCCGAATTCG GAGACGACGT CGCTGCGCAT CGACACCCGC GCAGCACCGC TCAACGATCG GCGCATACGC GAAGCGATGA ACCTCGCCAT CGATCGTCAG GCGATGCTCG GAACGCTGTT TCCCGAACAG GCAAAGATCG CCACGCAACT CGTCGTACCC ACCACGATCG GCTACAATGC CGATATCCCC GCCTGGCCCT ATGATCCCGA AAAAGCAAAG GAACTGGTCA AAGCGGCAAA AGCCGACGGC GTGCCGGTCG ATCAGCAGAT CCGCATCATC GGCCGTAACG GGCAATATCC CAACGCCACC GAAGCGATGG AAGCGATGAT GGCCATGCTT CAGGACGTCG GCTTGAACGT CAAACTCGAC ATGTACGATG TTTCCGTGTG GAACGGCTAT TTCGTTGCAC CCTTCGTTGC CGATTCCGGT CCGACATTGA CCCAGTCGCA GCACGACAAT GCCACCGGCG ATCCCGTCTT CACCGCATTC GTGAAGTACG CCACCGACGG CTCCCATTCC ATGGTTCGGG ATCCCGCGGT CGACGCCCTT ATCGCCAAGG CGACCTCAGC CACCGGCGAC GAGCGCAAGA AACTCTGGAA GGAGCTTTTC GCCAAGGTGA ACGCCGAGAT CATCGCCGAT ATTCCGATGT TCCATATGGT CGGTTTCACC CGCGTTTCGC CGCGTCTTGA CTTCAAGCCG ACGATCGCGA CGAATTCCGA GCTGCAGCTG TCGCAGATCC GCTTCAAGTA A
|
Protein sequence | MKLHLVAACF STATIGLSIG SAHAEDAKSN VTVVLAETVD VVEPCMAARQ DVGRVISENV NEMLVEFDYV NGGLKPRLAT EWSKIDDDTW EFKLRPNVKW HDGKPFTAKD VQFTIERNKN KKLSCETGGK YFGGTEFSFE TPDANTIRIT TKPAQPILPL LMTVMAVESA EATPADEFTR KPIGTGPYTF DKWEIGQSIE LKRNPDYWGD KPQVEQATYL FRSDSAVAAA MVDAGEADIV PAVSVQDATN KETDFAYPNS ETTSLRIDTR AAPLNDRRIR EAMNLAIDRQ AMLGTLFPEQ AKIATQLVVP TTIGYNADIP AWPYDPEKAK ELVKAAKADG VPVDQQIRII GRNGQYPNAT EAMEAMMAML QDVGLNVKLD MYDVSVWNGY FVAPFVADSG PTLTQSQHDN ATGDPVFTAF VKYATDGSHS MVRDPAVDAL IAKATSATGD ERKKLWKELF AKVNAEIIAD IPMFHMVGFT RVSPRLDFKP TIATNSELQL SQIRFK
|
| |