Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_0445 |
Symbol | |
ID | 3910001 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 491316 |
End bp | 492596 |
Gene Length | 1281 bp |
Protein Length | 426 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637882331 |
Product | glycoside hydrolase family protein |
Protein accession | YP_484067 |
Protein GI | 86747571 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1486] Alpha-galactosidases/6-phospho-beta-glucosidases, family 4 of glycosyl hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.482345 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCAGAA CGACCAGGAT CGTGTTGCTC GGCGCCAGCA GTGCGTCATT CGGCCTCAGC ATGTTGCGCG ATCTGTTCGC CACGCCGGAG CTGCGCGGGT CGACGCTGGT GATGGTCGGG CTCGATGCGG CGAGGCTCGC GACCATGGCC GAGCTGGCGA AGCTGCTGAA CGCGACGACC GGCGCCGGCT TCGTCATCGA ACACACCACC GACCGCCGCG CCGCGCTGGA CGGCGCAAGC TTCGTCATCA ACGCCACCGC GATCGATCGC AACCGGCTGT GGAAGATGGA TTTCGAGGTG CCGAAGAAGC ACGGCATCCG GCATCCGCTG GGTGAGAACG GTGGCCCCGG CGGATTGTTC TTCACGTTGC GGACGCTGCC GCTGGTGTTC GATTTCATCC GCGACATCGA GGAGCTTTGC CCCGAGGCGC TGTTTCTCAA CTACTCCAAT CCGGAAAGCC GCATCGTGCT GGCGCTCGGG CGCTATTCGA AGGTGCGCTG CATCGGCCTG TGTCACGGCA TCTTCATGGG CCGCGACGCC GTCGCCGACA TCATGGGACT GCCGCGCGAG CGCGTCGAGG TGTGGGGCGC GGGGCTCAAT CACTTCCAGT GCCTGCTGCA GATCCGCGAC CGCCTCACCG GCGAAGACCT CGCGCCGCGG CTGCGCGCGG CGGAGCAGAG CTTCGATCCC AATGCCTGGC GCTTCACCCG GCGGCTGTAT CGCGCCTTCG GCCACTGGCT GACCTGCAGC GACGATCATC TCGGCGAGTA TCTGGCTTAC GGCTGGGAGG CCGGCGAGCG CGGCTATGAT TTCGCCGGCG ACGACCGCAG CCGCGTCGAG ACCCTGGCGC AGATCGACGC CGTGCTGGCC GGGACGATGC CGATCCCACA TTGGTGGACC GAGCCCTCGG GCGAGCGCGG CGCCGCGGTG ATCGCCGCGA TGCTGCACGA CCAGAAGCGC TTCATCGAAT CCGGCATCGT GATGAACCGC GGCGTCATCC CCAATCTGCC GGCGGAGCTC GCCGTCGAGG TGCCGGTGAC GGTCGACGCC GCCGGGGTGC ATCCGGTGTC GCTCGGTCCA CTACCAGATC CGATCGCCAA GCTGATGCTG ATGCAGGCCA GCGTGCAGCA ACTCGCGGTC GAGGCGGCGG TGCACGCCTC GAAAGAACTC GCGCTGCAGG CGCTGCTGAT CGACCCGGTG GTCAACTCAG CGGTCGCCGC GGAAAAGATC CTCGACGAGC TGTGGGAGAT CAACCGGCCC TATATCAGGG CGTGCGTGTA G
|
Protein sequence | MARTTRIVLL GASSASFGLS MLRDLFATPE LRGSTLVMVG LDAARLATMA ELAKLLNATT GAGFVIEHTT DRRAALDGAS FVINATAIDR NRLWKMDFEV PKKHGIRHPL GENGGPGGLF FTLRTLPLVF DFIRDIEELC PEALFLNYSN PESRIVLALG RYSKVRCIGL CHGIFMGRDA VADIMGLPRE RVEVWGAGLN HFQCLLQIRD RLTGEDLAPR LRAAEQSFDP NAWRFTRRLY RAFGHWLTCS DDHLGEYLAY GWEAGERGYD FAGDDRSRVE TLAQIDAVLA GTMPIPHWWT EPSGERGAAV IAAMLHDQKR FIESGIVMNR GVIPNLPAEL AVEVPVTVDA AGVHPVSLGP LPDPIAKLML MQASVQQLAV EAAVHASKEL ALQALLIDPV VNSAVAAEKI LDELWEINRP YIRACV
|
| |