Gene RPB_4037 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_4037 
Symbol 
ID3911844 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4606775 
End bp4608001 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content69% 
IMG OID637885941 
ProductNnrS 
Protein accessionYP_487641 
Protein GI86751145 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3213] Uncharacterized protein involved in response to NO 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAGCA CGACGATCGA TCCTTCCGCT TCCGGCCCGG CCAAGCGCAA GCCGGTGCCG 
CGCTATCGCG AGCAGGGCGG CTTGACCCTG CTGTCCGCCG GCTTCCGGCC GTTCTTCTTC
TTCGGCGCCG TGTTCGCGGC CGTCGCCGTG CTGCTGTGGC TGCCGGTGTA TTACGGCGAC
CTGACGCTAC AGACCGCGTT CGCGCCGCGC GACTGGCACG TCCACGAGAT GCTGTACGGC
TATCTGCCCG CGGTGATCAC CGGCTTCCTG CTCACCGCGA TCCCGAACTG GACCGGCCGG
CTGCCGCTGC AGGGCAGGCC GTTGCTGGTG CTGGTGCTGA CTTGGCTCGC CGGGCGGCTG
TGCGTGACGT TCTCGGCCGA TACCGGCTGG CTGGCCGCGA TGCTGGTCGA TGCGAGCTTC
ATGGCGCTGG TTGCGCTCGC TGCCGCGCGC GAAATCGCCG CCGGGAAGAA CTGGAGCAAC
CTCAACGTCG TCGCGCTGCT CACGCTGCTG CTCGCCGGCA ACATCGCCTT TCATCTCGAG
GCGCATGTCA ACGGCACCGC CGATTACGGC ATCCGCGCCG GCATCGGCGT GGTGATCCTG
CTGATCTCGC TGATCGGCGG ACGGATCACG CCGAGCTTCA CCCGCAACTG GCTGGTGCGC
GAGAATCCCG GCCGGCTGCC GGTTCCGTTC AACAAGCTCG ACATCGCGAT CGTCGCCTTC
AGCGCCGCGA CGCTGATCCT CTGGACCGTG TTGCCGATCA GCATGGTGAC CGGCACGGCG
CTGGCGCTGG CGGGCGTGGC GCATCTGGTG CGGCTGGCGC GCTGGGCCGG CGATCGCACG
CTGCGGGATC GGCTGCTGCT GGTGTTGCAT GTCGGCTATC TGTTCGTGCC GCTCGGCTTC
CTGCTCACCG CCTGCGCGGC GTTCGGGCTG GTTCCGCCCA GCGCCGGCAT GCACGCCTGG
ATGGTCGGCG GCGCCGGCAT CATGACGCTG GCGGTGATGA CCCGCGCCTC GCTCGGCCAT
ACCGGGCAGG AATTGCGGGC GTCGCTGCCG ACCCAGGCGG TCTATCTCGC CGCCCTGGTC
GCCGTGATGG CACGCGTCGG CGCGGCGCTG CTGCCGTCGT GGAGCGATCC GTTGCTGCAT
CTCTCGGTGC TGGGCTGGTC GGTCGCCTTC CTCGGCTTCG CGCTGAGCTA CGGCCCGACG
CTGCTGGCCC GTAGCAAGCC GCATTGA
 
Protein sequence
MSSTTIDPSA SGPAKRKPVP RYREQGGLTL LSAGFRPFFF FGAVFAAVAV LLWLPVYYGD 
LTLQTAFAPR DWHVHEMLYG YLPAVITGFL LTAIPNWTGR LPLQGRPLLV LVLTWLAGRL
CVTFSADTGW LAAMLVDASF MALVALAAAR EIAAGKNWSN LNVVALLTLL LAGNIAFHLE
AHVNGTADYG IRAGIGVVIL LISLIGGRIT PSFTRNWLVR ENPGRLPVPF NKLDIAIVAF
SAATLILWTV LPISMVTGTA LALAGVAHLV RLARWAGDRT LRDRLLLVLH VGYLFVPLGF
LLTACAAFGL VPPSAGMHAW MVGGAGIMTL AVMTRASLGH TGQELRASLP TQAVYLAALV
AVMARVGAAL LPSWSDPLLH LSVLGWSVAF LGFALSYGPT LLARSKPH