Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_0382 |
Symbol | |
ID | 6408029 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | - |
Start bp | 405710 |
End bp | 406990 |
Gene Length | 1281 bp |
Protein Length | 426 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 642710293 |
Product | glycoside hydrolase family 4 |
Protein accession | YP_001989418 |
Protein GI | 192288813 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1486] Alpha-galactosidases/6-phospho-beta-glucosidases, family 4 of glycosyl hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.126645 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCAACA CCACCAAGAT CGTGTTTCTC GGCGCCTCCA GTGCGTCGTT CGGTCTCAGC ATGTTTCGTG ACCTGTTCGC TTCGCCGGTG CTGGCCGGTT CGACGCTGAC GCTGGTCGGA CGCAATGCGG AAACGCTCGG CAGAATGACC GAGCTGGCGA AGCTGATGAA CGCAAGGTCC GGTGCCGGAC TGATCATCGA GCAGACCACC GACCGCCACG CCGCGCTGGA CGGCGCCGGC TTCGTCATCA ACGCCACCGC GATCGATCGC AACCGGCTGT GGAAGCAGGA CTTCGAGGTG CCGAAGAAGC ACGGCATCCG TCACACCCTC GGCGAGAACG GCGGCCCGGG CGGATTGTTC TTCACGCTGC GCACGCTCCC CTTGGTGTTC GACTTCATCC GCGACATCGA GGAGCTGTGC CCGAACGCGC TGTTTCTCAA CTACTCCAAT CCCGAAAGCC GGATCATTCT GGCGCTCGGC CGCTACACCA AGGTGCGCCA TATCGGACTG TGTCACGGCA TCTTCATGGG CCGCGACGCG GTCGCCTACA TCATGCAGAT GCCGCGCGAA GAGATCGAAG TGTGGGGCGC CGGGCTCAAT CACTTCCAGT GCTTGACCGA GATCCGCCAC CGTGACACCG GCGAGGATCT GTATCCGCGG TTTCGCGCCG CCGAGCAGAG CTTTGATCCG GATGCGTGGC GGTTCACGCG ACGGCTGTAT CGCGCGTTCG GCTATTGGCT GACCTGCAGC GATGATCATC TCGGCGAGTA TCTGCCGTAT GGCTGGGAAG CCGGCGAGAA GGGCTACGAT TTCGACCAGG ACGAACGCTG GCGCGGCGAA TTCCTCACCC AGCTGAATGG CGTGCTCGGC GGAACCATGC CGATCCCGCG GTGGTGGACC GAACCGTCGG GCGAGCGCGG CGCCGCCGTG ATCGCCGCGA TGCTGCACAA CCAGAAGCGT TTCATCGAAT CCGGCATCGT GCTCAATCGC GGCGTGATCC CCAACCTGCC GGCGGAGCTC GCGGTCGAAG TCCCGGTGAC TGTAGATGCG GCCGGCGTGC ATCCGGTGTC GCTCGGCCCG TTGCCCGACC CGATCGCCAA GCTGATGCTG ATGCAGGCCA GCGTGCAGCA GCTCGCGGTC GAGGCGGCCG TCCACGCCTC GAAGGAACTG GCCCTGCAGG CGCTGCTGAT CGATCCGGTG GTCAACTCGG CGGTCGCGGC CGAAAAGATC CTGGACGAAT TGTGGGAGAT CAACCGGCCG TATATTCGGA AGTGTGTGTA G
|
Protein sequence | MTNTTKIVFL GASSASFGLS MFRDLFASPV LAGSTLTLVG RNAETLGRMT ELAKLMNARS GAGLIIEQTT DRHAALDGAG FVINATAIDR NRLWKQDFEV PKKHGIRHTL GENGGPGGLF FTLRTLPLVF DFIRDIEELC PNALFLNYSN PESRIILALG RYTKVRHIGL CHGIFMGRDA VAYIMQMPRE EIEVWGAGLN HFQCLTEIRH RDTGEDLYPR FRAAEQSFDP DAWRFTRRLY RAFGYWLTCS DDHLGEYLPY GWEAGEKGYD FDQDERWRGE FLTQLNGVLG GTMPIPRWWT EPSGERGAAV IAAMLHNQKR FIESGIVLNR GVIPNLPAEL AVEVPVTVDA AGVHPVSLGP LPDPIAKLML MQASVQQLAV EAAVHASKEL ALQALLIDPV VNSAVAAEKI LDELWEINRP YIRKCV
|
| |