Gene RPB_3665 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3665 
Symbol 
ID3911467 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4206491 
End bp4207810 
Gene Length1320 bp 
Protein Length439 aa 
Translation table11 
GC content66% 
IMG OID637885567 
Productadenylate/guanylate cyclase 
Protein accessionYP_487271 
Protein GI86750775 
COG category[T] Signal transduction mechanisms 
COG ID[COG2114] Adenylate cyclase, family 3 (some proteins contain HAMP domain) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.125818 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.715111 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAACTGC CGTGGCGACG GTCGTCGAGA ATCTCCCAGC GCTCGACGCT GTCCGACGAA 
ATCCGCAGCG CGTTGGCGCG GGAAGTTCTG AAGACCGAAC TGCTGCGCGC GAAGGTGCTG
CTGTTCACCG TATCGGCGCT GATATCGACG CTGCTGGTGG CGTACTGGAT TTCCCCCGCG
CAGATCGAGC GGATCTGGCG CGGCCAGTTC TCGCTGATGC CGCTGCTGTT GTCCTATGCG
CCGCTGCTGG CGATCGAAGT CAGCGTCATC ATCATGCTGC GCCGGCGGTT GGCACGGGGG
CACAATGTCC TGCATTGGGG CCGCTATCTC GGCGTGCTGG TCGAGACCAG CCTGCCTTCG
GTCGGCCTGT ATCTGCAGAT GACGACGATG GGCCCGGCCC AGGCGCTCGC TTTCGCCGTC
CCGTTCGCCT ATTTCATCTT CATCATCCTG TCGACGCTGT GGCTCGATTT CTGGCTTTCG
GTCTTCACCG GATTCGTCGC CGCCGCCGAA TTGCTGGCGC TCGCGATGCT CTACCAACCG
CCGGGATTCG TGGGCGAGCC TGCGCCGGAT TTCGCCTTTC ATCTGCTTCG CAGCCTGGTC
ATCCTGATCT GCGGCGTGCT CGCCGGCGGG GTCGGGATGC AGTTGCGCCG ACAATTCGAG
GCTTCGATCG GCGCGGCCGA CGCGCGGGAT CGCATCACCA GCCTGTTCGG CCAGCACGTC
TCGCCGCAAG TGGTCGAGCA ACTGCTCGCG GCGGGCACGG AGGTGACCGG CGAGACCCGG
ACGGTGGTGG TGATGTTCGT CGATTTCCGC AATTTCACCG GCGCCGCTCG GACGCGGTCG
CCGGAGGAGG TGGTCGCGCG GCTTGACGAT GCTTTCGCCG TGCTGGTCGA CATCCTCGAG
CGCCACGGCG GCATCGTGAA CAAATTTCTC GGCGACGGAT TTCTCGCGCT GTTCGGGGCG
CCGATCGATG ATCCGCGCGC GGCGTCGTGC GCTGTGGCCG CGGCGCGCGA GATGCTGGCC
GCGATGGATG ACGACAACGC CGGACGGGAC TGGCCGTTGC GGATCGGCAT CGGCATTCAC
ACCGGCGATG CCGTGGTCGG CACTGTCGGC TCACCGCGGC GCAAGGAGTA CACGGTGATC
GGCGACACCG TGAATTTCGC GTCGCGGCTG GAGTCCCTCA ACAAACACTT CGGCACTCAG
CTTCTGATCT CGACGGCGAT CCGCGACGAA CTCGGCGATG CCGCGAGCGA CGCCGTCCTG
CTCGGCAACG TCGCGATGCG CGGCTACGCC GAGCCGATGG CGGTCTGGCG GCTGGGCTGA
 
Protein sequence
MQLPWRRSSR ISQRSTLSDE IRSALAREVL KTELLRAKVL LFTVSALIST LLVAYWISPA 
QIERIWRGQF SLMPLLLSYA PLLAIEVSVI IMLRRRLARG HNVLHWGRYL GVLVETSLPS
VGLYLQMTTM GPAQALAFAV PFAYFIFIIL STLWLDFWLS VFTGFVAAAE LLALAMLYQP
PGFVGEPAPD FAFHLLRSLV ILICGVLAGG VGMQLRRQFE ASIGAADARD RITSLFGQHV
SPQVVEQLLA AGTEVTGETR TVVVMFVDFR NFTGAARTRS PEEVVARLDD AFAVLVDILE
RHGGIVNKFL GDGFLALFGA PIDDPRAASC AVAAAREMLA AMDDDNAGRD WPLRIGIGIH
TGDAVVGTVG SPRRKEYTVI GDTVNFASRL ESLNKHFGTQ LLISTAIRDE LGDAASDAVL
LGNVAMRGYA EPMAVWRLG