Gene RPB_3594 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3594 
Symbol 
ID3911396 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4121747 
End bp4122793 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content69% 
IMG OID637885496 
Producthypothetical protein 
Protein accessionYP_487200 
Protein GI86750704 
COG category[S] Function unknown 
COG ID[COG2855] Predicted membrane protein 
TIGRFAM ID[TIGR00698] conserved hypothetical integral membrane protein 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.641351 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCTCGA ACGCCAGAAC CCATCAACCG CACCCCCGCA TCGACCGGCA CCTGGCCGCG 
ATCCTGCCGG GCGTGGTGCT GACGGCCACG ATCGCCGCCG CAGCTCTCGG GTTGCAGCGG
ATCCCCGCGC TCGCGGGCGT CAGTCCGATG CTGCTGGCGA TCCTGATCGG AATCGTCGCC
CACAACGTCG TCGGGACGCC GGGATGGGCG CTGCCCGGGG TCAAATTCAG CGTTCGACGG
GTGCTGCGCT TCGCCATCAT CCTGCTCGGA CTGCAGCTCA CCGTCGCGCA ACTGCTCGAG
GTCGGCGGCG AAGGGCTGGC GATCATCGCC GCGACGCTGC TGGCGACCTT CAGCTTCACC
GTGTGGCTCG GCCGCATGCT CGGTGTCGAT CGCAAACTCG GCGAGCTGAT CGCCGCCGGC
ACCTCGATCT GCGGCGCCTC CGCCATCATC GCCACCAACA CCGTGACCCG CGCCGACGAC
GAGGATGTCG CTTACGGCGT CGCCTGCGTC ACGGTATTCG GGTCGCTGGC GATGGTCGGT
ACGCCACTGC TCCAGAACGC GTTCGGTCTC GACGCCCACG GCTTCGGGCT GTGGACCGGC
GCCTCGATCC ACGAGATCGC CCAGGTCGTC GCCGCGAGCT TCCAGGGCGG GCACGATGCC
GGCGAGTTCG GCACCATCGC CAAGCTGTCG CGGGTGATGA TGCTGGCGCC GCTGGTGATC
GGTCTCGGCA TGCTGGCGCG GGCCCGTGCG CGGCACGAGC CTTCGGCCGC CGGCACCGCT
GCGCCGCCGA TGCCGTGGTT CGTGCTCGGC TTCGTGGCGA TGGTCGGCGT CAACCAGCTG
ATCGCCATCC CGCAGGACGT CAAAGCCCCG ATCGTCGCCG CCACCGCCTT CCTGCTGTCG
ATGGCGCTGG CGGCGATGGG GCTCGAGACG GATCTGCGCA AGCTCGCCGC GCGCGGCCTG
CGTCCGGCCT TGCTCGGCCT GTGCGCGGCC TTGTTCATCT CGGGCTTCTC GCTGAGCCTG
ATCAAACTCA CAGGATGGCA TGGATGA
 
Protein sequence
MTSNARTHQP HPRIDRHLAA ILPGVVLTAT IAAAALGLQR IPALAGVSPM LLAILIGIVA 
HNVVGTPGWA LPGVKFSVRR VLRFAIILLG LQLTVAQLLE VGGEGLAIIA ATLLATFSFT
VWLGRMLGVD RKLGELIAAG TSICGASAII ATNTVTRADD EDVAYGVACV TVFGSLAMVG
TPLLQNAFGL DAHGFGLWTG ASIHEIAQVV AASFQGGHDA GEFGTIAKLS RVMMLAPLVI
GLGMLARARA RHEPSAAGTA APPMPWFVLG FVAMVGVNQL IAIPQDVKAP IVAATAFLLS
MALAAMGLET DLRKLAARGL RPALLGLCAA LFISGFSLSL IKLTGWHG