Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_5274 |
Symbol | |
ID | 6412975 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | - |
Start bp | 5688105 |
End bp | 5689433 |
Gene Length | 1329 bp |
Protein Length | 442 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 642715164 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_001994236 |
Protein GI | 192293631 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCGGAA TTTTGGGAAG ACTCGCGCAG ATCGCCGCCG CCGTGCTGGT TTGCGCCACC GCTGGTATCG TCCCCGCTGG CGCCGCCACC GAAATTGCCT GGTGGCATGC GATGTCCGGT GAGCTCGGCC GGCAGCTTGA GAAACTGGCG GCGGATTTCA ATGCGTCGCA ATCCGATTAC CGCGTGGTCC CGACCTACAA GGGCAATTAC ACCCAGACGG TGACCGCGGC GATCTTCGCG TTTCGCTCCT CCAGCCAGCC GACGATCGTG CAGGTCAACG AGATCGCCAC CGCCACCATG ATGGCGGCCA AGGGCGCGGT CTATCCGGTG TACGAGCTGA TGCGCGACGA GAGCGAGGTG TTCTCGCCGG CCGACTATCT GCCGGCCGTC ACCGGGTACT ACACCGATCT TAGCGGCAAC ATGCTGTCGT TTCCGTTCAA CGCGTCGACA CCAATTCTGT ACTACAACAA GACGCTGTTT CGTCGCGCTG GGCTCGATCC GGAGGTGCCG CCGCCGACTT GGCCGGAAGT CGGGACGATG GCGAAGCGGC TGATCGACGC CGGCGCAGCG TGCGGCTTCA CCACCTCGTG GCCGTCCTGG GTGCATATCG AGAACTTTTC CGCCTATCAC AACCTGCCGC TGGCGACCCA GTCGAACGGG CTGGGCGGGC TTGATGCCGA ACTGGTGTTC AACAATCCGG CGGTGGTGCG CCATATCGCG CAGCTTGCCG ATTGGCAGAA GACCAAGACC TTCGATTACG GCGGCCGCGC CACCGCGGCC GAACCGCGCT TCCAGCAGGG TGACTGCGGC ATCTTCATCG GCTCATCGGC AACGCGGGCC GACATCCTGG CCAACGCCAA GTTCGATGTC GGCTACGGCC GGCTGCCGTA TTGGCCGGAC ATCGCCGGCG CGCCGCAGAA CACCATCATC GGCGGTGCCA CACTATGGGT GCTGCGCGGC CATTCGGCGG GCGAATACAA AGGCGCCGCC AAGTTCTTCG CCTACCTGTC GAAGCCGGAA GTTCAGGCGG CCTGGCATCA GCACACCGGC TACCTGCCGA TCACAAAGGC GGCCTACGAT CTCACCCGCG CCCAGGGCTT CTACGACCGC AATCCCGGCA CCGCGATCTC GATCGAACAG ATCACGCTGA AGCCGCCGAC CGAGAATTCG CGCGGGCTGC GGCTCGGCTC GTTCGTGCTG GTGCGGGCGG CGATCGAAGA CGAGATCGAA CACGCGGTGC GGGGCGATAA GCCGGCGAAA GAGGCGATGG ACGCGGCGGT CGAGCGCGGC AACAAGCTGC TGCGGCAGTT CGAACGCACC AAGCCGTAA
|
Protein sequence | MAGILGRLAQ IAAAVLVCAT AGIVPAGAAT EIAWWHAMSG ELGRQLEKLA ADFNASQSDY RVVPTYKGNY TQTVTAAIFA FRSSSQPTIV QVNEIATATM MAAKGAVYPV YELMRDESEV FSPADYLPAV TGYYTDLSGN MLSFPFNAST PILYYNKTLF RRAGLDPEVP PPTWPEVGTM AKRLIDAGAA CGFTTSWPSW VHIENFSAYH NLPLATQSNG LGGLDAELVF NNPAVVRHIA QLADWQKTKT FDYGGRATAA EPRFQQGDCG IFIGSSATRA DILANAKFDV GYGRLPYWPD IAGAPQNTII GGATLWVLRG HSAGEYKGAA KFFAYLSKPE VQAAWHQHTG YLPITKAAYD LTRAQGFYDR NPGTAISIEQ ITLKPPTENS RGLRLGSFVL VRAAIEDEIE HAVRGDKPAK EAMDAAVERG NKLLRQFERT KP
|
| |