Gene RPB_4685 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_4685 
Symbol 
ID3912503 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp5299921 
End bp5301207 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content68% 
IMG OID637886590 
Productmajor facilitator transporter 
Protein accessionYP_488279 
Protein GI86751783 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2223] Nitrate/nitrite transporter 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGACGGGC ATTCCACGCC GAAGGGCGCC TGGAGGATCA CCTTCCTCCT GTTCCTGTTC 
ATGGTGGTCA ACTTCGCCGA CAAGATCGTC GTCGGCCTCG CCGGCGTGCC GATCACCCAA
GAGCTCGGTC TCACACCCGA ACAGTTCGGC CTGCTCGGCT CGTCGTTTTT CTTCCTGTTC
TCGATCACGG CGGTCGTCGT CGGCTTCGTC GTCAACCGGG TCGACACCCG TTGGGTGCTG
TTGGTGCTGG CCCTGATCTG GGCGGTGGCG CAGTTTCCGA TGGTCGGCAC CGTCAGCTTC
ACCACGCTCT TGATCTGCCG CATCATCCTC GGCGCCGGCG AAGGCCCGGC CTTCGCGGTC
GCGGCGCATG CGATCTACAA ATGGTTTCCC GACCACAAGC GGACGCTGCC CACCGCGATC
CTGTCGCAGG GCTCGGCGTT CGGCGTGATC CTGGCGGTAC CGGCGCTGAA CTGGATCATC
GTCAACCATT CCTGGCACTA CGCGTTCGGC GCGCTCGGCA TTGTCGGGCT GATGTGGGCG
GCGGCGTGGC TCGCGCTCGG CAAGGAAGGC CCGCTGGTGC CGACCGCGGC GATGGCGGCG
GCGGAGACTC GGATCCCCTA TGCGCGGCTG CTGACCTCGC GCACCTTCAT CGGCTGCGTC
GCGGCGACGT TCGGTGCGTA TTGGGCGCTG TCGCTGGGAC TGACCTGGTT CACCACCTAC
ATCATCAGCG GGCTCGGCTT CAGCCAGCAC CAGGCCGGCC TGATCTCGAT CACGCCCTGG
GTGTTCGGCG CCGCCGTCGT GATGCTGACC GGCTGGCTGT CGCAGTTGCT GATGGGGCGC
GGCGCGTCGA GCCGGCTGGC CCGCGGCGTG CTCGGCGCGG CCCCGCTGGT GCTCGGCGGA
CTGATCCTGC TGACGATGCC GTTGATCGAC AATCCGACCG GGCGGATCGC CGCGCTGGTG
ATCGGCTCGG GGTTGTGCGG ATCGATCTAC GTGGTGTGCC CGCCGATGAT CGCCGAGTTC
GCCCCGGTGT CGCAGCGCGG CGCCGCGATC GCGATCTACG GTGCGCTGTA CACGCTCGCC
GGCATCGTGG CGCCGCTGGT GATGGGCAGC GTCGTCCAGC ACGCCGCGTC GCTGAACGAG
GGCTATCTCA CCGGCTACGT GATCAACGGC GCGGTGATGA TCGTCTCCGG CCTGCTCGGC
CTGCTGCTGC TGTGGCCGAA CACCGAGCGC GCGAGGCTGC TGAGCAGCTC GGACGTCGCG
CCTGTCGGCC TTCGCAAGCC GGCGTGA
 
Protein sequence
MDGHSTPKGA WRITFLLFLF MVVNFADKIV VGLAGVPITQ ELGLTPEQFG LLGSSFFFLF 
SITAVVVGFV VNRVDTRWVL LVLALIWAVA QFPMVGTVSF TTLLICRIIL GAGEGPAFAV
AAHAIYKWFP DHKRTLPTAI LSQGSAFGVI LAVPALNWII VNHSWHYAFG ALGIVGLMWA
AAWLALGKEG PLVPTAAMAA AETRIPYARL LTSRTFIGCV AATFGAYWAL SLGLTWFTTY
IISGLGFSQH QAGLISITPW VFGAAVVMLT GWLSQLLMGR GASSRLARGV LGAAPLVLGG
LILLTMPLID NPTGRIAALV IGSGLCGSIY VVCPPMIAEF APVSQRGAAI AIYGALYTLA
GIVAPLVMGS VVQHAASLNE GYLTGYVING AVMIVSGLLG LLLLWPNTER ARLLSSSDVA
PVGLRKPA