Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_5064 |
Symbol | |
ID | 6412758 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | - |
Start bp | 5448294 |
End bp | 5449430 |
Gene Length | 1137 bp |
Protein Length | 378 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 642714949 |
Product | glycoside hydrolase family 5 |
Protein accession | YP_001994028 |
Protein GI | 192293423 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2730] Endoglucanase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCTCACCC GACGCACGAT CGCTTGCACC GCCCTCGGAC TGATCGCTCT GTGCTGGAGC GTGCAAGCTG GCGCGCAGAA CCGCTGCACC GGCCCGACAT CCGGCGTTCC GCCTGAGACG ATCGCAGCGC TGGCCCGCGG CGTGAACGTC TCAAACTGGA CCGAGAATGC GCGCCTGCAA GGACCAAGTC CAGAATTGCT CCGCGCAGTG CGCGGCGCCG GCTTCACCCA TGTTCGCCTA CCGGTCGCCG GCGATCTGGT GATGCGCGCC TTCAGTCCGC CGGCCACCAT CGAACGACAG CTCGCAGGCG TCGATGCCGC GCTCACCGAG CTGATTGGCC TCGGCTTCTC TGTTTCGATC GACCTGCATC CGGGCGATGG TTTCGGTCGC CTCCACCGCG ACGACCCCAA GGCCTCGATG GCGGCGCTGA CCGATGCGTG GGGCAATCTA GCGACGGTGA TCCGCAAGCA TCCGGCCGAG CGCGTGTTCG CCGAGCTGTT GAACGAGCCT GATATTGCGC CGGATCGTTG GCAGAGCGAG GTCGAACAGC TCGCCTTCTT CGTGCGTGGG CTGTTGCCGA AGACTACGCT GATCGTCGGG CCGACCTATT GGCAGCGCGC CGACTCGCTC CCGCAGTTCA GGCCGCTCGC CGACCGCAAC GTCGTCTATG CGCTGCATTT CTATGATCCG ATGGTGTTCA CCCATCAGGG GCATTGGGAT CCGGCTAATC CGCTGAGCCG GGTTCGCGAC CTGCCATTCC CGCTGGTCGC CAACGATCCC GCCGTCGAGC GGCTGCGCAG CGAGCTGACC TTGCGCGGCG ATAATGAAGC GCTGCAGGAG CTGGACAAAG CGATCGTTCA GGCCGGCTCG CCGAGCTATG TGGCCCGGCA GTTGATTCCG GCCGTCGCCT GGCAGGAGCG CTACGCACGA CCGCTGATCA TCAATGAGTT CGGCGTGTTT AAGCCGGCGG CGCCGCGGGA CAGCAGGCTG CGCTGGCTGG AATCGGTGGT CGATTCGGCC GAAGCCAATT GCTGGGGCTG GACCTATTGG GAGTTGGATC AGGGATTTGG TCTGGCCGAT CCGCGCACCG GCAGGTTGGA CGCTGGCGTA ATCGATGCGC TGATGCATCC GCGTTGA
|
Protein sequence | MLTRRTIACT ALGLIALCWS VQAGAQNRCT GPTSGVPPET IAALARGVNV SNWTENARLQ GPSPELLRAV RGAGFTHVRL PVAGDLVMRA FSPPATIERQ LAGVDAALTE LIGLGFSVSI DLHPGDGFGR LHRDDPKASM AALTDAWGNL ATVIRKHPAE RVFAELLNEP DIAPDRWQSE VEQLAFFVRG LLPKTTLIVG PTYWQRADSL PQFRPLADRN VVYALHFYDP MVFTHQGHWD PANPLSRVRD LPFPLVANDP AVERLRSELT LRGDNEALQE LDKAIVQAGS PSYVARQLIP AVAWQERYAR PLIINEFGVF KPAAPRDSRL RWLESVVDSA EANCWGWTYW ELDQGFGLAD PRTGRLDAGV IDALMHPR
|
| |