Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_2174 |
Symbol | |
ID | 8013187 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | + |
Start bp | 2169163 |
End bp | 2170218 |
Gene Length | 1056 bp |
Protein Length | 351 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 644824760 |
Product | putative FHA domain containing protein |
Protein accession | YP_002975990 |
Protein GI | 241204894 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.00685953 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.0299055 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTGGATG ATGTTGTCGG AGCTTATGGC AGCCGTTTCC TGCTTGCCGC CGGTGGCGTC GGTCTGGCGC TTCTCCTGCT CATCATCGTG CTCTGGGTGA TCCGCAGCCG GGCGCCCTCG CCCTTCGTGC GCGGCGGCCG CAACCGGCAG CCCCGGCTGC AGGTGCTGGA TGCCGCAGCC GTCGATGCCC GTCGCCGGCT GGTGCTGGTG CGCCGCGACG ACGTCGAACA CCTGATCATG ATCGGCGGCC CGAGCGATAT CGTCATCGAA AGCCGTATCC TGCTCGCAGC GGCCGAACAG CCGGAAAGCG TCAGCGGCAC GCAGCAGCCC GCCGAGCAGC GTCCGATATC GGTCGCGCGG CCGGAAACAC CGCCGGTCTC TCCACCACGC CCGCCGGTCG CAGCCCGTGT CGAGCCGGCC GCCGAGCCCA CTTTCTCTGC GCCGGTTTCG CCAGAGCCGC GCCCGCGCCC AGAACCACCG GCCCAACCGC CGTCTCAGCC TGCCGTGGCA CCGCCGGTGG TCACGAGCCC TCTTCCGGCA GAGCCCGTGA CAGCTCCGCT GTCGGCCGAA CGCGACAATC CTCTGCGCGC CGTCCCGCCC CAGCCGCGCC CGCAGGAGCG TCCCGCCGCT CCCCCAGCCG CGCAGCCCGC ACCGTTTCAC GATGCCTCAA GTGCCGCCGA GATCCTCGAT GCCGCCCGCC AGCGCGTGCT GCCGCAGCAG CGCATCGAAC CCGAGGTCTC CGCCCCGCCT GTCCGGGACA TGCCGGCGGC CGCGCGCGCC GCACCAGGTA GCGCCGAAGA CGAGGCGGCT GCGCAGTCGG CAGCGGCGAT CCGTCATGAT TTCCAGCGGG TGCTGGAAGA GGAAATGTCG AACAATCTGA CGGCCGAACG CATCGTGCCG GCGCCGGCAA ACCAGGCGCC TCGCCAAGCG GTGTCGCAGC CGGCCAACCT GCCACGCCGT GATCCCGAGC TTGCCCCGAT CACCGGCGCC GATACCGAGC TGCAGAAGGA AGTCGCCCGC ATCTTCGGCG AAATGAGTGT CAATCGCGAC AAGTGA
|
Protein sequence | MLDDVVGAYG SRFLLAAGGV GLALLLLIIV LWVIRSRAPS PFVRGGRNRQ PRLQVLDAAA VDARRRLVLV RRDDVEHLIM IGGPSDIVIE SRILLAAAEQ PESVSGTQQP AEQRPISVAR PETPPVSPPR PPVAARVEPA AEPTFSAPVS PEPRPRPEPP AQPPSQPAVA PPVVTSPLPA EPVTAPLSAE RDNPLRAVPP QPRPQERPAA PPAAQPAPFH DASSAAEILD AARQRVLPQQ RIEPEVSAPP VRDMPAAARA APGSAEDEAA AQSAAAIRHD FQRVLEEEMS NNLTAERIVP APANQAPRQA VSQPANLPRR DPELAPITGA DTELQKEVAR IFGEMSVNRD K
|
| |