Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_5140 |
Symbol | |
ID | 8007000 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012848 |
Strand | - |
Start bp | 540988 |
End bp | 541926 |
Gene Length | 939 bp |
Protein Length | 312 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 644822053 |
Product | protein of unknown function DUF58 |
Protein accession | YP_002973313 |
Protein GI | 241113478 |
COG category | [R] General function prediction only |
COG ID | [COG1721] Uncharacterized conserved protein (some members contain a von Willebrand factor type A (vWA) domain) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGATG CGGGCGTCTA TGTTTCGACG GACGAACTGG TCGCGCTCGA AGCGAGAGCC CGAGATCTGA GCTTCGTCCA GAAGGCGCGC AGCCATCAGC AGCTTGCAGG CCGCATGCAA TCGGCGATGC GCGGCCGGGG ACTGATCTTC GAAGAACTGC GCGACTATCT GCCCGGCGAC GACATCCGCT CCATCGACTG GCGCGTCACC GCGCGAACCA GCAGACCGGT GGTCCGCATC TATTCCGAGG AAAAGGAGCG GCCCGCGCTG ATCATCGTCG ACCAACGGAT CAACATGTTC TTCGGCAGCA GGCGATCGAT GAAATCGGTC ACGGCAGCGG AAGCCGCGAT GCTCTGCGCC TGGCGCATAC TGGGTTCCGG CGACCGGGTC GGCGGCTTCG TCTTCGGCGA AAGCGCAACG AGCGAGGCAA AACCGCATCG CAGCCGTAAT GCGGTGATTG CCTTTGCGGA ACAAATCGCA CGGCAGAACG CGAGCTTGCG CGCAGACAGC AAAAGCGAGC CTGACCCGCA GGCGTTGGAC ACGGTTTTGT CGGCGGTCGC AAATATCGCC CACCACGACC ATCTCGTGGT CGTGGTCTCC GACTTCGACG GCCATACCGC GACGACGCAA GACATCCTGC TGAGGCTCTC GAGCCGCAAC GACGTGATCT GCCTGTTGAT CTACGACCCC TTTCTACTGG ACCTGCCGAC CTCGGGCGAC ATCGTCGTCA GCGGCGGCGG CCCGCAGGCC GAGCTGGCTC TGCGGACACC AAGCGTCCGA TCGTCGATCG ACGCGTTCGC CCGCAACCGC GGCCGCGAGC TGAGAGCGTG GCAGCGCCGG CTCGGGCTTC CGATACTGCC CATATCGGCC GCCGAGGAAA CCGCGCCGCA GCTCAGGCGT CTGCTGGAGC AGTCTGCGTG GCGGCAACGG AGGCGTTGA
|
Protein sequence | MSDAGVYVST DELVALEARA RDLSFVQKAR SHQQLAGRMQ SAMRGRGLIF EELRDYLPGD DIRSIDWRVT ARTSRPVVRI YSEEKERPAL IIVDQRINMF FGSRRSMKSV TAAEAAMLCA WRILGSGDRV GGFVFGESAT SEAKPHRSRN AVIAFAEQIA RQNASLRADS KSEPDPQALD TVLSAVANIA HHDHLVVVVS DFDGHTATTQ DILLRLSSRN DVICLLIYDP FLLDLPTSGD IVVSGGGPQA ELALRTPSVR SSIDAFARNR GRELRAWQRR LGLPILPISA AEETAPQLRR LLEQSAWRQR RR
|
| |