Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_1010 |
Symbol | |
ID | 3909134 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 1157846 |
End bp | 1159168 |
Gene Length | 1323 bp |
Protein Length | 440 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637882903 |
Product | glycoside hydrolase family protein |
Protein accession | YP_484631 |
Protein GI | 86748135 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2730] Endoglucanase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACGCCG CGGTGCCGAG CCATCTCGTC GATCCGCCGG CGATCGTCCG GGACGCCGCC GATCTGCGGC CGGCGGGGCG CGGCGGCCAG TTGCTGCCGC CCGGCTATCT CGGCACGCGG GGCAGCCAGA TCGTCGACGT GACCGGCCGG CCGGTGCGGA TCGCCTCGAT CGGCTGGAAC GGCACCGAGG GCCCGCCCGG CGCAGCACCC TCGGGGATCT GGAAGGTCAG CTACCGGACC GTTCTCGACT CGATCGTCGC CGCGGGCTTC AACACCGTGC GGATTCCATG GACGGATATC GGCCTCGACA CGCCGCTGAA CGGCTACAGC GACCGGCTCG GCTGGATCAA CACCACGCTC AATCCCGACC TGCTGGCATC CGACACGCCC GACGCCAACG GGCGCTATCG CTACGTCACC ACGCTGGTGG CGTTTCAGCG CATCGTCGAC TACGCCGGCG ACATCGGCCT GAAGGTGATT TTCAATCACC ACACCAATCA GGGCACCGCG GGGCAGCAGC GCAACGGGCT GTGGTTCGAT CTCGGCCCAG GCACCGACAA CACCGACGGC ATCAAGCCGG GCCGGTTCAC CGCGCAGGAC TTCAAGCAGA ACTGGCTGCG GGTGGCGCGG ACCTTCGCCG GCAATCCGAC CGTGATCGGC TACGATCTGC ACAACGAGCC CAACGGCGAC CGCGGCGCCA TCACCTGGGG CGGCGGCGGG CCGACCGACA TCAAGGCGAT GTGCGAGGAC GTCGGCTCGG CGATCCAGGA CGTCAGCCCC GACGTGCTGA TCATCTGCGA GGGGCCGGAG ACCTACAAGC CGCCGCCGGA ATCGTCGGGG ATGGACCCGC GCCACGCCGC GCCCGCGGGC AATCTCACCG CGGCGGGCGC CAATCCGGTG CGGCTCAAGA TCCCGCACAA GCTGGTGTAT TCGATCCATG AATATCCGGA GGAGATCGCC GACACCAAGC GCTGGGGCAT TCCGGAGACC GGCAAGGGCT TCATCGACCG GATGAACACC ACCTGGGGCT ATCTGGTGCG CGACGACATC GCGCCGGTGT GGATCGGCGA GATGGGCGCA TCATTGCGGA CGCCCGAGAC GCGCGAATGG GCGCGCAATC TGATCGACTA CATGAACGGC AAATACGGCA GCGAGGGCGG CCCGGCCTTT TCGGGCGATC AGCAGCCGAT CAGCGGCAGC TGGTGGCTGA TCGGTCCGTC GAACGATCCG CCCTATGGGC TGCAGACGGA CTGGGGCGTC GGCCACTATC GACCGGACCA GATCGCGATC ACCGACCAGA TGCTGTTTCG GCCGCGCAAG TAG
|
Protein sequence | MDAAVPSHLV DPPAIVRDAA DLRPAGRGGQ LLPPGYLGTR GSQIVDVTGR PVRIASIGWN GTEGPPGAAP SGIWKVSYRT VLDSIVAAGF NTVRIPWTDI GLDTPLNGYS DRLGWINTTL NPDLLASDTP DANGRYRYVT TLVAFQRIVD YAGDIGLKVI FNHHTNQGTA GQQRNGLWFD LGPGTDNTDG IKPGRFTAQD FKQNWLRVAR TFAGNPTVIG YDLHNEPNGD RGAITWGGGG PTDIKAMCED VGSAIQDVSP DVLIICEGPE TYKPPPESSG MDPRHAAPAG NLTAAGANPV RLKIPHKLVY SIHEYPEEIA DTKRWGIPET GKGFIDRMNT TWGYLVRDDI APVWIGEMGA SLRTPETREW ARNLIDYMNG KYGSEGGPAF SGDQQPISGS WWLIGPSNDP PYGLQTDWGV GHYRPDQIAI TDQMLFRPRK
|
| |