Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_4228 |
Symbol | |
ID | 6411912 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | + |
Start bp | 4538075 |
End bp | 4539235 |
Gene Length | 1161 bp |
Protein Length | 386 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 642714110 |
Product | putative nitrate transport protein |
Protein accession | YP_001993199 |
Protein GI | 192292594 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.0904389 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGAGC GGCTGCGCAT CGGATTCATC CCGCTGGTCG ACGCTGCCGC GCTGATCGTC GCCGCCGACA AGGGCTTCTG CGCCGCCGAG GGCCTCGACG TCGAGCTGGT GCGCGAGATC TCCTGGGCCA ACGTCCGCGA CAAGTTCAAC ATCGGCCTGT TCGACGCCGC GCATCTGCTG GCGCCGCTGG CGGTCGCCTC CAGCCTCGGC ATCGGTCACG TCAAGGTGCC GGTGATTTCC GGCTTCGGCC TGGGCGTCAA CGGCAACGCC ATTACGGTAT CGCCGGACCT GCACGCCGCG ATCGTCACGA TGGCCGATGG CGACGTCGCC GATCCGCTGG TGTCGGGACG CGCGCTGGCG CGGGTCGTCG CCGAGCGACG CGCCAAGGGC CAGGAGCCGC TGACCTTCGG GATGACCTTC CCGTTCTCCA GCCACAATTA CGATTTGCGG TTCTGGATGG GCGCCGGCGG CGTCGATCCG GACGAGGACG TGCGGCTGGT GGTGCTGCCG CCGCCGTTCA TGGTGGAGAG TCTCGCCAGC AAGCATCTCG ACGGCTTCTG CGTCGGCGCG CCGTGGAATT CGGTTGCGAT TGATCTGGGC ATCGGAACCA TCCTGCATTT CACCAGCGAG CTGTTTCAGC GCGCCGCCGA GAAGATGCTG ACGGTGCGGG CGACCTGGGC CGCGCAGCAC CCCGAGGTGC TGCAAGCGCT GATCCGCGCC CACGTCCGCG CCGCCGACTA CATCGAAGAC GTCGCCAACC GCGACGAGGT TTGCGCCCTG CTCGCTGCGC CGGGCCGAAT CGAGGTGACG CCGGAGCTGA TCCGCCGCAC CCTCGACGGC CGACTGAAGG TCGCGGCCGA CGGCACGCTG CGCACCAGCG ACCGCTATCT GCTGGTCGGG CGCGAAGCCG CGGCGCGGCC CGATCCGGTG CAGGGCGCGT GGAACTACGC CCAGATGGTA CGGTGGGGAC AGGCGCCGCT GTCGACCGAT CTGCTCGCCG CCGCCAAAGC GGTGTTCCGG CCGGACCTGT ATGACGCAGC GGTCGGCACG GCTCCTGCGT TGCCGAGCGC GCCCGCCGAC GGCATCGGCG AGTGCACCGG CGCGCCGTTC GATCCCGACG ACATCGCCGG CTATCTGGCG CGGATGACGA TCCGGCGCTG A
|
Protein sequence | MSERLRIGFI PLVDAAALIV AADKGFCAAE GLDVELVREI SWANVRDKFN IGLFDAAHLL APLAVASSLG IGHVKVPVIS GFGLGVNGNA ITVSPDLHAA IVTMADGDVA DPLVSGRALA RVVAERRAKG QEPLTFGMTF PFSSHNYDLR FWMGAGGVDP DEDVRLVVLP PPFMVESLAS KHLDGFCVGA PWNSVAIDLG IGTILHFTSE LFQRAAEKML TVRATWAAQH PEVLQALIRA HVRAADYIED VANRDEVCAL LAAPGRIEVT PELIRRTLDG RLKVAADGTL RTSDRYLLVG REAAARPDPV QGAWNYAQMV RWGQAPLSTD LLAAAKAVFR PDLYDAAVGT APALPSAPAD GIGECTGAPF DPDDIAGYLA RMTIRR
|
| |