Gene RPB_1761 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_1761 
Symbol 
ID3909748 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp2014928 
End bp2016088 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content69% 
IMG OID637883655 
Productnitrate transporter component, nrtA 
Protein accessionYP_485380 
Protein GI86748884 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.786473 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0141893 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGAGC GCCTGCGCAT CGGATTCATC CCGCTGGCCG ACGCCGCGGC GCTGATCGTC 
GCCGTCGACA AGGGGTTCTG CGCCACCGAG GGGCTCGACG TCGAGCTGGT GCGCGAAATC
TCCTGGGCCA ATGTCCGCGA CAAATTCAAC ATCGGCCTAT TCGACGCCGC GCATCTGTTG
GCGCCGATGG CGGTCGCCTC CAGCCTCGGC ATCGGCCACA TCAAGGTGCC GGTGATTTCC
GGCTTCGGCC TCGGCGTCAA CGGCAACGCC ATCACGGTGT CGCCGGATCT GCAGGCTGCG
ATCGCGGCGA TGGCCGAGGG CGACGTCGCC GATCCGCTGG TGTCGGCGCG GGCGCTGGCG
CGCGTCGTCG CCGAGCGCAA GGCGCTGGGG CTGGAGCCGC TGATCTTCGG CATGACCTTC
CCGTTCTCCA GCCACAATTA CGATCTGCGG TTCTGGATGG GCGCTGGCGG GGTCGATCCC
GACGAAGACG TCCGCCTCGT GGTGCTGCCG CCGCCCTACA TGGTCGAGAG CCTCGCCAAC
AAACATCTCG ACGGCTTCTG CGTCGGCGCG CCGTGGAATT CGGTGGCGAT CGATCTCGGC
ATCGGCCACA TCCTGCATTT CTCCTGCGAG CTGTTCCAGC GCGCCGCGGA GAAGATGCTG
GCGGTGCGCG CCTCATGGGC CGAAGGACAT CCGGAGACGC TGGCGCGGCT GATCCGGGCG
CACGATCGCG CCGCGCAATT CATCGAGCAC GAACCCAATC GCGACGAGGT CTGCGCGATT
CTCACCGCGC CGGGCCGGAT CGAAGTGACG CCGGAGCTGA TCCGCCGCAC CCTGGACGGC
CGCCTCAAAG TCTCGCCCGA AGGCCGCATC CGCGAGACCG GCCGCTATCT GCTGGTCGGC
CGCGAAGCCG CGGCACGGCC CGATCCGGTG CAGGGCGCGT GGAACTACGC GCAGATGGTG
CGCTGGGGCC AGGCGCCGCT GTCGGCCGAA CTGCTCGCCG CCGCCAAGGC TGTGTTCCGG
CCCGACCTCT ACGACGCCGC CGTCGGCACG CCGCCGATCC TGCCGATCGC GCCCGCCGAC
GGCATCGGCG AATGCACCGG CACGCATTTC GATCCGGACG ACATCGCCGG CTATCTGTCG
GCGCTGACGA TCCGGCGCTG A
 
Protein sequence
MSERLRIGFI PLADAAALIV AVDKGFCATE GLDVELVREI SWANVRDKFN IGLFDAAHLL 
APMAVASSLG IGHIKVPVIS GFGLGVNGNA ITVSPDLQAA IAAMAEGDVA DPLVSARALA
RVVAERKALG LEPLIFGMTF PFSSHNYDLR FWMGAGGVDP DEDVRLVVLP PPYMVESLAN
KHLDGFCVGA PWNSVAIDLG IGHILHFSCE LFQRAAEKML AVRASWAEGH PETLARLIRA
HDRAAQFIEH EPNRDEVCAI LTAPGRIEVT PELIRRTLDG RLKVSPEGRI RETGRYLLVG
REAAARPDPV QGAWNYAQMV RWGQAPLSAE LLAAAKAVFR PDLYDAAVGT PPILPIAPAD
GIGECTGTHF DPDDIAGYLS ALTIRR