Gene RPB_1331 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_1331 
Symbol 
ID3907839 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp1518009 
End bp1519631 
Gene Length1623 bp 
Protein Length540 aa 
Translation table11 
GC content73% 
IMG OID637883225 
Producthypothetical protein 
Protein accessionYP_484952 
Protein GI86748456 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.239363 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGATT ACTATCCGCT GATCGCACGC GCCATATCCG GCCTGGACCC CAGTGCTCCG 
GGCGAGCAGC GCCGTGCGAT CTACGAGCGG GCGCGCTCGG CCTTGATCAC GCAGCTGCGC
GGCGTCCAGC CGCCGCTGAC CGAATCCGAA ATCACCCGCG AGCGTCTGGC GCTCGAGGAG
GCCGTGCGCA AGGTCGAGTC CGAGGCCGCG CAGCGGTCCC GCGACGCCGC CCGCGCCGAG
TTGAAGAACC GCCGCCCCGC CGGCGACCCC GCGCGTCCCG GCGATTCGCT GCGGGCGTCG
AGCCGCCCGG CGCCGCGTCC GGGCGATCCG CCGCCGCAGG TCTCGCGCGC GCCGCTGCCG
CCGACCGCGG CTCAGGCCGA CGCCGAGCCG CCGCCGATCC GTCCGCGATC GCAACCGCCG
GCGCCGCCCG CGCCGCCGCG CGACGAGCGG GCGCAGCGCA ATCTTCGCGT CGAGCCGCCG
CCGATTCCGC CGGAGCCGCC GCTGCCCGGC CGCGAGCGCC CGGCGCCGCG CCGGCCCGAT
CAGGGTCCGG CCGCGAATCA GGGCGCCGAC AATGCGGGGC TGCGCGGCTT CCGCGACGTC
ACCGCCGATC TGGCCGATCT CGGCCAGGCC GCCGCGCAGG CCAACCGTTC GGCGCGCAAG
ACCTACGCCA ATGTGGCGCC CTCGACGGAG TTCGACCGGC TCGAACCGTC GATGGAGAAC
CGCACCGATC CGGACGCGCC ATATTCCTAC GACGAATCGG TCGACGAGGC GGCGCGCTAC
CAGCCGCCGC CGGCCGCGGC GCGGACGCGC GTCGAGCCGG ACCGCAAGGG GCCGCCGCGA
AAGCCGACCC GCCCGCCGTC GCGTTTTCCG CTGAAGAGCG CGCTGGTGAT CGGCCTGGTG
CTGGTGCTGG CCGGTGCCGG CATTTTGTGG GGACCGTCGC TGTATACGTC GCTGCGCGCG
ATGATGAGCT CGGCGCCGTC GACCGAGACC GCAACGCCGT CCGCGCCGCC GAGCTCGACC
GAGCGGCCGA AAATCACCGA TCGCGTCGGC CAGCCGTCGA GTTCGGAGGC GATCGCCCCG
GTGGCGCAGC GCGTCGTGCT GTACGACGAG GATCCGTCGG ATCCGAAGGG CAAGCAATAT
GTCGGCACGG TGGTGTGGCG CACCGAGCAG ATCAAGGGCG CCAGCGCCAA GGGCGGCGCC
GATCTGGCGG TGCGCGCCGA CATCGAGGTG CCCGAGCGCA AGTTCAAGAT GACGATGTCG
TTCCGCCGCA ACACCGACAC CTCGCTGCCG GCGAGCCATA CGGCGGAGCT GACCTTCATC
CTGCCGCAGG ATTTCAGCGG TGGCGGCGTG AGCAACGTTC CCGGCATCCT GATGAAGTCG
AACGAGCAGG CGCGCGGCAC GCCGCTGGCC GGGCTCGCCG TCAAGGTCAC CGACGGCTTC
TTCCTGGTCG GGCTGAGCAA TGTCGAGGCC GACCGCGCCC GCAACCTGCA GCTCCTGAAA
GAGCGCTCCT GGTTCGACGT GCCGATCGTC TACACCAACC AGCGCCGCGC CATCATCGCC
ATCGAAAAGG GCCCGCCCGG CGAGCGCGCC TTCGGCGAGG CCTTCGCCGC TTGGGGCGAG
TAG
 
Protein sequence
MADYYPLIAR AISGLDPSAP GEQRRAIYER ARSALITQLR GVQPPLTESE ITRERLALEE 
AVRKVESEAA QRSRDAARAE LKNRRPAGDP ARPGDSLRAS SRPAPRPGDP PPQVSRAPLP
PTAAQADAEP PPIRPRSQPP APPAPPRDER AQRNLRVEPP PIPPEPPLPG RERPAPRRPD
QGPAANQGAD NAGLRGFRDV TADLADLGQA AAQANRSARK TYANVAPSTE FDRLEPSMEN
RTDPDAPYSY DESVDEAARY QPPPAAARTR VEPDRKGPPR KPTRPPSRFP LKSALVIGLV
LVLAGAGILW GPSLYTSLRA MMSSAPSTET ATPSAPPSST ERPKITDRVG QPSSSEAIAP
VAQRVVLYDE DPSDPKGKQY VGTVVWRTEQ IKGASAKGGA DLAVRADIEV PERKFKMTMS
FRRNTDTSLP ASHTAELTFI LPQDFSGGGV SNVPGILMKS NEQARGTPLA GLAVKVTDGF
FLVGLSNVEA DRARNLQLLK ERSWFDVPIV YTNQRRAIIA IEKGPPGERA FGEAFAAWGE