Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_3702 |
Symbol | |
ID | 6982464 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | + |
Start bp | 3829962 |
End bp | 3831572 |
Gene Length | 1611 bp |
Protein Length | 536 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643398424 |
Product | HemY domain protein |
Protein accession | YP_002283191 |
Protein GI | 209551274 |
COG category | [S] Function unknown |
COG ID | [COG3898] Uncharacterized membrane-bound protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGATCC GCCTTGTCGC CTTCGCCCTC TTCGTGCTGC TCCTCGCCTA TGGCTTCTCC TGGCTCGCCG ATCGTCCCGG CGACCTCTCG CTGATCTGGG AGGGCCAGAT CTACCAGACG AAACTGATCG TCGCCGCCAG CGCGATCATC GCTCTCATCG CCGCCGTGAT GATTGCCTGG TGGTTCGTCC GCCTGGTCTG GACCTCGCCG CATTCGGTGA CGCGGTATTT CCGTGCCCGC AAGCGAGACC GCGGTTATCA GGCGCTGTCG ACCGGCCTGA TCGCCGCCGG CGCCGGCAAT GCGCTGCTCG CCCGCAAGAT GGCCGCCCGC TCGCGCGGCC TCATCCGCGC CGATCAGGAG CCGCTGATCA GCCTACTCGA AGCCCAGGCC GCTCTGATCG AAGGCCGCTA TGATGAGGCC CGTGCCAAGT TCGAGGCCAT GGCCAACGAT CCCGAGACAC GCGAACTCGG TCTGCGCGGC CTCTATCTGG AGGCCCGCCG CCTCGGCGCT AACGAGGCCG CCCGGCAATA TGCCGAGAAG GCCGCCGACA ATGCGCCTTA TCTGCCCTGG GCCGCACAGG CGACCCTCGA ATATCGCAGC CAGGCCGGCC GCTGGGACGA TGCGATCCGC CTGCTCGAAC AGCAAAAGGC CGCCCGTGTC GTCGAAAAGG CCGACGCCAA CCGTCTGCAC GCGGTCCTGT TGACGGCGCG CGCCGGCGAG AAGCTGGAAG GCAATCCGGC CGGCGCCCGC GACGACGCGC AACAGGCGCT GAAGCTCGCC GCCGATTTCA TTCCGGCAGC CCTCGTTGCC GCAAAGGCGC TGTTTCGCGA AGGCGGGGTG CGCAAGGCCG CCTCGATCCT CGAACAGGCA TGGAAATCGG CACCGCATCC CGAAATCGGC CAGGCCTATG TCCGGGCCCG CAGCGGCGAT TCCACGCTCG ACCGGCTGAA GCGCGCCGAA CGGCTGGAGG GGCTGCGCCC GAACAACGTC GAATCCCTTC TCGTCGTCGC CCAAGCCGCA CTCGACGCGC AGGAATTCGC CAAGGCCCGC GCCAAGGCGG AAGCGGCAGC CCGCATGCAG CCGCGTGAAG CCGCCTTCCT GCTTTTGGCC GACATCGAGG AAGCCGAGAC CGGAGACCAG GGCCGCGTGC GCCACTGGCT GGCCCAGGCG CTGAAAGCCC CGCGCGATCC CGCCTGGGTG GCGGATGGTT TCGTGTCGGA CAAATGGCTG CCGGTATCGC CGGTAACCGG TCGCCTCGAC GCCTTCGAAT GGAAGGCGCC CTTCGGCCAG ATCGAGGGTC CGCTCGAAGA CGGCTCAGTG CCGCCCATCG AAACCGCTCT GAAAACCTTG CCGCCGCTGC GCGACGTCAG GCCGGAAAGC CCGGTCAATG ACCATCGGAT CATCGAACTG GAACGCGCCG CGACGATCGC CGAGGCTGTG CGCCCGACGC CGGCGCCGGC ACCGGCAAAA TCGAAGCCCG TCGAACCCGT CAGCGGCAAA ACGCCCGCGC CGGGCGAAGC GAAACCTTTC TTTGGCGGCC TGCCGGATGA TCCCGGCGTT CGCGACCCCA GGGTGGACCC GGAACCCAAG ACGCGGCTCC GCCTTTTCTG A
|
Protein sequence | MLIRLVAFAL FVLLLAYGFS WLADRPGDLS LIWEGQIYQT KLIVAASAII ALIAAVMIAW WFVRLVWTSP HSVTRYFRAR KRDRGYQALS TGLIAAGAGN ALLARKMAAR SRGLIRADQE PLISLLEAQA ALIEGRYDEA RAKFEAMAND PETRELGLRG LYLEARRLGA NEAARQYAEK AADNAPYLPW AAQATLEYRS QAGRWDDAIR LLEQQKAARV VEKADANRLH AVLLTARAGE KLEGNPAGAR DDAQQALKLA ADFIPAALVA AKALFREGGV RKAASILEQA WKSAPHPEIG QAYVRARSGD STLDRLKRAE RLEGLRPNNV ESLLVVAQAA LDAQEFAKAR AKAEAAARMQ PREAAFLLLA DIEEAETGDQ GRVRHWLAQA LKAPRDPAWV ADGFVSDKWL PVSPVTGRLD AFEWKAPFGQ IEGPLEDGSV PPIETALKTL PPLRDVRPES PVNDHRIIEL ERAATIAEAV RPTPAPAPAK SKPVEPVSGK TPAPGEAKPF FGGLPDDPGV RDPRVDPEPK TRLRLF
|
| |