Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_2255 |
Symbol | |
ID | 6409915 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | + |
Start bp | 2447554 |
End bp | 2448576 |
Gene Length | 1023 bp |
Protein Length | 340 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 642712139 |
Product | NMT1/THI5 like domain protein |
Protein accession | YP_001991251 |
Protein GI | 192290646 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.387928 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGGCCG GCTTTCGACA CTTCGCTGCG GCGCTTGCCG CGATCCTGCT CACCGCCGGG GCTGCGCAGG CGCAGAGCAA GGTGACGATC GCGATCGGCG GCGGCGCCTG CCTGTGCTAT CTGCCGACCG TGCTGGCCAA GCAGCTCGGC GAGTACGACA AGGCCGGACT CGCCGTCGAA CTGGTCGATC TCAAGGGCGG CTCGGATGCG CTCAAAGCCG TGCTCGGCGG CAGCGCCGAC GTCGTCTCCG GCTATTTCGA CCACACCGTC AACCTTGCCG CGAAGAAGCA GGAGATGCAG AGCTTCGTGG TCTACGACCG CTATCCCGGT CTGGTGCTGG CGGTGTCGCC GGGCCACACT GCGGAGATCA AGTCGATCAA GGAACTCGCC GGCAAGAAGG TTGGCGTGAG CGCACCGGGC TCGTCGACCG ATTTCTTTCT CAAGTACCTG CTGAAGAAGA ACGGCGTCGA CCCGAATAAT GTCGCGGTGG TCGGCGTCGG CCTCGGCGCC ACCGCGGTGG CGGCGATGCA GCAGGGCCAG ATCGACGCGG CGGTGATGCT CGATCCGGCG GTGACGATCC TGCAGGCGGC GCACGCAGAC CTCCGCATCC TCAGCGACAC CCGCACCGAG CACGACACGC GCGAGGTGTT CGGCGGCGAC TATCCGGGCG GCGCGCTGTA CGCCACCACG GCCTGGATCA AGGCACATCC GAACGAGGCG CAGGGGCTCA CCAAGGCGAT CCTCGGCACG TTGAATTGGA TCCATGCGCA TTCGGCCGAG GAAATCGCCG ACAAGATGCC GGCCAACATC GTCGGCAAGG ACAAGGCGCA ATACGTTGCC GCGCTGAAGA ACACGATCCC GATGTACTCG ACCACAGGCC TGATGGACCC GAAGGGGGCG GATGCGGTGC TGGCGGTTTT CAGCACCAGC TCGCCGGACG TGGCGAGGGC GAATATCGAC GTCACCAAGA CCTACACCAA CGCCTTCGTC GAACAGGCGA AGGCGTCTGG TGCCGCGAAA TAA
|
Protein sequence | MKAGFRHFAA ALAAILLTAG AAQAQSKVTI AIGGGACLCY LPTVLAKQLG EYDKAGLAVE LVDLKGGSDA LKAVLGGSAD VVSGYFDHTV NLAAKKQEMQ SFVVYDRYPG LVLAVSPGHT AEIKSIKELA GKKVGVSAPG SSTDFFLKYL LKKNGVDPNN VAVVGVGLGA TAVAAMQQGQ IDAAVMLDPA VTILQAAHAD LRILSDTRTE HDTREVFGGD YPGGALYATT AWIKAHPNEA QGLTKAILGT LNWIHAHSAE EIADKMPANI VGKDKAQYVA ALKNTIPMYS TTGLMDPKGA DAVLAVFSTS SPDVARANID VTKTYTNAFV EQAKASGAAK
|
| |