Gene Rpal_4228 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_4228 
Symbol 
ID6411912 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp4538075 
End bp4539235 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content70% 
IMG OID642714110 
Productputative nitrate transport protein 
Protein accessionYP_001993199 
Protein GI192292594 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.0904389 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGAGC GGCTGCGCAT CGGATTCATC CCGCTGGTCG ACGCTGCCGC GCTGATCGTC 
GCCGCCGACA AGGGCTTCTG CGCCGCCGAG GGCCTCGACG TCGAGCTGGT GCGCGAGATC
TCCTGGGCCA ACGTCCGCGA CAAGTTCAAC ATCGGCCTGT TCGACGCCGC GCATCTGCTG
GCGCCGCTGG CGGTCGCCTC CAGCCTCGGC ATCGGTCACG TCAAGGTGCC GGTGATTTCC
GGCTTCGGCC TGGGCGTCAA CGGCAACGCC ATTACGGTAT CGCCGGACCT GCACGCCGCG
ATCGTCACGA TGGCCGATGG CGACGTCGCC GATCCGCTGG TGTCGGGACG CGCGCTGGCG
CGGGTCGTCG CCGAGCGACG CGCCAAGGGC CAGGAGCCGC TGACCTTCGG GATGACCTTC
CCGTTCTCCA GCCACAATTA CGATTTGCGG TTCTGGATGG GCGCCGGCGG CGTCGATCCG
GACGAGGACG TGCGGCTGGT GGTGCTGCCG CCGCCGTTCA TGGTGGAGAG TCTCGCCAGC
AAGCATCTCG ACGGCTTCTG CGTCGGCGCG CCGTGGAATT CGGTTGCGAT TGATCTGGGC
ATCGGAACCA TCCTGCATTT CACCAGCGAG CTGTTTCAGC GCGCCGCCGA GAAGATGCTG
ACGGTGCGGG CGACCTGGGC CGCGCAGCAC CCCGAGGTGC TGCAAGCGCT GATCCGCGCC
CACGTCCGCG CCGCCGACTA CATCGAAGAC GTCGCCAACC GCGACGAGGT TTGCGCCCTG
CTCGCTGCGC CGGGCCGAAT CGAGGTGACG CCGGAGCTGA TCCGCCGCAC CCTCGACGGC
CGACTGAAGG TCGCGGCCGA CGGCACGCTG CGCACCAGCG ACCGCTATCT GCTGGTCGGG
CGCGAAGCCG CGGCGCGGCC CGATCCGGTG CAGGGCGCGT GGAACTACGC CCAGATGGTA
CGGTGGGGAC AGGCGCCGCT GTCGACCGAT CTGCTCGCCG CCGCCAAAGC GGTGTTCCGG
CCGGACCTGT ATGACGCAGC GGTCGGCACG GCTCCTGCGT TGCCGAGCGC GCCCGCCGAC
GGCATCGGCG AGTGCACCGG CGCGCCGTTC GATCCCGACG ACATCGCCGG CTATCTGGCG
CGGATGACGA TCCGGCGCTG A
 
Protein sequence
MSERLRIGFI PLVDAAALIV AADKGFCAAE GLDVELVREI SWANVRDKFN IGLFDAAHLL 
APLAVASSLG IGHVKVPVIS GFGLGVNGNA ITVSPDLHAA IVTMADGDVA DPLVSGRALA
RVVAERRAKG QEPLTFGMTF PFSSHNYDLR FWMGAGGVDP DEDVRLVVLP PPFMVESLAS
KHLDGFCVGA PWNSVAIDLG IGTILHFTSE LFQRAAEKML TVRATWAAQH PEVLQALIRA
HVRAADYIED VANRDEVCAL LAAPGRIEVT PELIRRTLDG RLKVAADGTL RTSDRYLLVG
REAAARPDPV QGAWNYAQMV RWGQAPLSTD LLAAAKAVFR PDLYDAAVGT APALPSAPAD
GIGECTGAPF DPDDIAGYLA RMTIRR