Gene RPB_3661 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3661 
Symbol 
ID3911463 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4202981 
End bp4204237 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content68% 
IMG OID637885563 
Producthemagluttinin-like protein 
Protein accessionYP_487267 
Protein GI86750771 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0725058 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGCCC GGATGAACGA CAGAGTTGGC GCGGCGTGGA TTGGCGCATC TCTCATTCTC 
GCGGCGTTTC TGCTGCAGCC GGGCCGCGCG ATGGCGCAAA CCACCGACCC GGTTCAGGTC
GTGTCGGGTT GCGTCGCCAA TGTCGCAGAC CGGCAATTGG CCTGCGGCCC GGGCGCCAGC
ACTGCCGGAA GCAATGATCG GTCGACGGCC ATCGGCAGCA ACGCTCAATC CGACGGCAGC
TCTGTCGCCA TCGGTAGCAG TTCCATAGCA ACCGGCAATA ACTCGACCGC TATCGGCGAC
AACGCCAACG CGGTTGGATT TGGGGATTCG ACAGTGATCG GCTCCGGCGC CGGATCTGGA
GGAGCCCGCA GCACGGTTAT CGGCAGCGGG GCGGCCACCG GCAACGAAGG AGCCATCGCC
GTGGGACATC GCGCCGGCGT CGGACTGGGC TCGGGTCAGT ACAGCATCGC GATGGGCGCG
GGCGGCGATA CCGCGCAGTC CGCGTCGCAT GCAATCGGCA ATTTCAGCAT CGCGATCGGC
GGCGGCGACG GCCTATCGGC CAACGGTGCC ATTTCCAATG CGGCGTTCGG CACCGCGGTC
GGCGCATCCA GCATCGCGGC GAACCAGTTC GACGCGGCGT TTGGCGCCTT CTCGATCGCC
AGCGGGGCGC GTAGCGCCGC ATTCGGCGCC AACAGCGTCG CCGCCGGCGC GTCGTCGGTG
GCGCTGGGAG ACGGCTCGTT TGCCCAAGGG ACCCACGCTG TGTCCACCGG CTTCAATTCA
GCGGCCACCG GTGTCAACAG CGTGGCGCTC GGTGCCGAGG CTTCGGCGAC GGCTTCCAAC
TCGGTGGCGA TCGGCTCGCG TTCGGTCACG AGCGCGCCCA ACACGGCATC GTTCGGGACG
CCCGGCAATG AGCGCCGGCT CACCAACGTG GCCGCCGGCA TCAGCCAGAC CGACGCGGTC
AATGTCGGGC AGCTCGCGGC AGTCACCAGC GGCCTTCAGT CGCAGATCAC CAACAACCGC
TCCGAAGCGC GGCGCGGCAT CGCCGCGGCG GTCGCCACCG CCAGCGCGCC GATGCCCTCG
GCGCCAGGCA AGACGACGTG GCAGATCCGC GGCTCCACCT TCCAGAACGA ATATGGCATC
GGCGTCGGTT TTGCCCACCA ACTGCGGACG GCGATGCCGC TCAACATCGT CGGCGGCTAC
GGCAATGGCG GCGGCGCCGA GCACACCGCC TATGTCGGCG TCGGCGGCGA GTTCTGA
 
Protein sequence
MTARMNDRVG AAWIGASLIL AAFLLQPGRA MAQTTDPVQV VSGCVANVAD RQLACGPGAS 
TAGSNDRSTA IGSNAQSDGS SVAIGSSSIA TGNNSTAIGD NANAVGFGDS TVIGSGAGSG
GARSTVIGSG AATGNEGAIA VGHRAGVGLG SGQYSIAMGA GGDTAQSASH AIGNFSIAIG
GGDGLSANGA ISNAAFGTAV GASSIAANQF DAAFGAFSIA SGARSAAFGA NSVAAGASSV
ALGDGSFAQG THAVSTGFNS AATGVNSVAL GAEASATASN SVAIGSRSVT SAPNTASFGT
PGNERRLTNV AAGISQTDAV NVGQLAAVTS GLQSQITNNR SEARRGIAAA VATASAPMPS
APGKTTWQIR GSTFQNEYGI GVGFAHQLRT AMPLNIVGGY GNGGGAEHTA YVGVGGEF