Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_3865 |
Symbol | |
ID | 6411545 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | + |
Start bp | 4152623 |
End bp | 4153615 |
Gene Length | 993 bp |
Protein Length | 330 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 642713747 |
Product | ABC-type nitrate/sulfonate/bicarbonate transport systems periplasmic components-like protein |
Protein accession | YP_001992838 |
Protein GI | 192292233 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCAAAT CACTTCTGCT GGCCGCCACG CTCACGGCTG CGCTCTCCAC CGTCAGCGGC GCGCAGGCGG CCTGCGACAA GATGGACAAG GTCACCGCAG CCTGGCTGCC GATCATGCAG ACCACCGCGT ACTACGTCGC GCTCGATCAG AAGCTGTTCG AGAAGGCCTG CATCGAGATC GACTCCGCCA AGATGGAATC GCCGAACCAG ATCATCGACG CGCTGATCGC CGGCCGCGCG GACTTCGGCC CGCCCGGCGC TGCTGCCGGC ATCGCGATGA TCGCGGAGTC GAAATTTCCC GGCAAGCTGA AGATCTTCGG GCTGCAGGGC GGCGGCATCA AGGTCGACCG CATCAATGAC GGGCTGATCG TCAAGCCGGA CAGCACCATC AAGAGCTTCG CCGATCTCAA GGGCAAGACG CTCGGCCACG TGCCCGGCAT CCAATGGCGG ACGATTTCCC GCCACATGGT GAAGGCGGCG GGCCTCGATC CGGATAAGGA CGTCAAGCTG GTCGATCTCG CCGTCGCCAT GCAGGTGCCG GCGGTGGTCG GCGGCACGGC CGACGCGACG CTGTCGCTGG AACCGGTAGG TTCGATCGCG GTCGCTTCCG GCAAGGCCAA GCGCGCAATG ACCAATCCGG TCGCCAGCGT AATCGCCGAT CCGTTCTATT CCGGCGCCTC GGTGATGACG ACCAAATTCA TGACCGAGCG CCCCGATGTC GCCCGCCGCG TGGTGGCGGT GATCGACCAG GCCACCGACC TCGTCAACGC CGACTTCAAC AAATACAAGG CGGTGTTGCC GGCCTACACG CCGATCAAGG CCGATCAGCT CGATCTGGTG GCGCAGCCTT ACTTGCGCGG CTTCAAGGAT CTCAACGACA CCGACGTCAA ATCCTATCAG GCGCTGGTCG ACGTGTTCGT TGCCGAAGGC GTGGTGCCAG GGCCGATCAA CGTTCGCGAG AAGCTGCTGA CCAAGGCGGA TATCGGCGAA TGA
|
Protein sequence | MSKSLLLAAT LTAALSTVSG AQAACDKMDK VTAAWLPIMQ TTAYYVALDQ KLFEKACIEI DSAKMESPNQ IIDALIAGRA DFGPPGAAAG IAMIAESKFP GKLKIFGLQG GGIKVDRIND GLIVKPDSTI KSFADLKGKT LGHVPGIQWR TISRHMVKAA GLDPDKDVKL VDLAVAMQVP AVVGGTADAT LSLEPVGSIA VASGKAKRAM TNPVASVIAD PFYSGASVMT TKFMTERPDV ARRVVAVIDQ ATDLVNADFN KYKAVLPAYT PIKADQLDLV AQPYLRGFKD LNDTDVKSYQ ALVDVFVAEG VVPGPINVRE KLLTKADIGE
|
| |