Gene Rpal_2403 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_2403 
Symbol 
ID6410065 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp2592412 
End bp2593800 
Gene Length1389 bp 
Protein Length462 aa 
Translation table11 
GC content62% 
IMG OID642712282 
ProductABC transporter nitrate-binding protein 
Protein accessionYP_001991392 
Protein GI192290787 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0310148 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTACGT TCGACAATCC GTTCGATCCC AATCGCCGGC TGCACACCAC GGGTTGTAGT 
TGTGGTCGTC ACGCTACCGA GGCTGAGCAC GCTGCCGAGC AGGCCGCCGC GTTGCAGGGC
ACCGTGATGC AGGGCGAAGA GAAGCGGTTC GAAGGCGTCG TCGCGTCCGC GGTGATGCGC
GCAATGTTTC CGCAGGATGC CTCGCGGCGC GCCTTTCTGA AGTCGGTCGG CGCTGCCACG
GCACTCGCCG CGGTGTCGCA GTTTTTCCCC CTGCAGACCG CAACCGATGT GTTCGCCTCG
GGCGGTCCGC TGGAAAAGAC CGACCTCAAG GTCGGTTTCA TTCCGATCAC CTGCGCTACG
CCAATCATCA TGGCGCATCC GATGGGCTTC TATGCGAAGT ACGGCCTCAA CGTCGAAGTG
ATCAAGACCG CAGGCTGGGC GGTGATCCGC GACAAGACGC TGAACAAGGA ATACGACGCC
GCGCATATGC TGTCGCCGAT GCCGCTGGCG ATCACGATGG GCGTCGGCTC CAATCCGATC
CCGTACACCA TGCCGGCGGT CGAGAACATC AACGGCCAGG CGATCACCCT GGCGATGAAG
CACAAGGATC GCCGCAATCC GAAGGACTGG AAGGGCTTCA AATTCGCCGT TCCATTCGAC
TATTCGATGC ACAATTACCT GCTCCGCTAC TATTTGGCCG AGCATGGTCT CGACCCCGAC
GTCGACGTCC AGATCCGCGC CGTGCCGCCG CCGGAAATGG TCGCCAACCT GCGTGCCGAC
AATATCGACG GCTATCTCGC GCCCGATCCG ATGAACCAGC GTGCGGTCTA TGACGGAGTC
GGCTTCATCC ACATCCTCAC CAAGGAAATC TGGGACGGCC ACCCGTGCTG CGCCTTCGCC
GCGTCGAAGG AATTCGTCAC CTCGATGCCG AACACCTACG GCGCGCTCCT GAAGTCGATC
ATTGAGGCGA CCGCCTACGC GCACAAGCCG GAGAACCGCA AGGAAATCGC CCAGGCGATC
TCGCCGGCGA ACTACCTGAA CCAGCCGGCG ATCGTACTCG AACAGATACT CACCGGCACC
TATGCGGACG GCCTCGGCAA CGTCGTCAAG CAGCCGAACC GGATCGATTT CGATCCGTTC
CCGTGGCAGT CGTTCGCGAT CTGGATCATG ACCCAGATGA AGCGCTGGGG GCAGATCAAG
GGCGACGTCG ACTACAAGAC GATCGCCGAG CAGGTCTATC TGGCGACCGA CACGGCGAAG
CTGATGAAGG AAGCAGGCCT CGCGGCCCCC GACACCACGT CGCGATCGTT CTCGGTGATG
GGCAAGCCGT TCGATGGCTC CAACCCGGAT CAGTATCTCG CCAGCTTCAA GATCAAGAAG
GCCTCGTAA
 
Protein sequence
MSTFDNPFDP NRRLHTTGCS CGRHATEAEH AAEQAAALQG TVMQGEEKRF EGVVASAVMR 
AMFPQDASRR AFLKSVGAAT ALAAVSQFFP LQTATDVFAS GGPLEKTDLK VGFIPITCAT
PIIMAHPMGF YAKYGLNVEV IKTAGWAVIR DKTLNKEYDA AHMLSPMPLA ITMGVGSNPI
PYTMPAVENI NGQAITLAMK HKDRRNPKDW KGFKFAVPFD YSMHNYLLRY YLAEHGLDPD
VDVQIRAVPP PEMVANLRAD NIDGYLAPDP MNQRAVYDGV GFIHILTKEI WDGHPCCAFA
ASKEFVTSMP NTYGALLKSI IEATAYAHKP ENRKEIAQAI SPANYLNQPA IVLEQILTGT
YADGLGNVVK QPNRIDFDPF PWQSFAIWIM TQMKRWGQIK GDVDYKTIAE QVYLATDTAK
LMKEAGLAAP DTTSRSFSVM GKPFDGSNPD QYLASFKIKK AS