Gene Rsph17029_3359 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_3359 
Symbol 
ID4898941 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009050 
Strand
Start bp409800 
End bp411290 
Gene Length1491 bp 
Protein Length496 aa 
Translation table11 
GC content72% 
IMG OID640113958 
Productlambda family phage portal protein 
Protein accessionYP_001045227 
Protein GI126464114 
COG category[R] General function prediction only 
COG ID[COG5511] Bacteriophage capsid protein 
TIGRFAM ID[TIGR01539] phage portal protein, lambda family 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.591009 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGGGA CCTTCCTGCG CCGGCTCGGC GCCTGGGTCG GCGGGTTCGA TGCGGGCCTC 
GCCAACCGGC GCCTGCGCGG CTTCCGCCCC GCACGCGCCC ATGTGAATGC GCTTCTCGCC
GCGGCCGGCC CCGACATGAA CGCCCGCGCG CGCTACCTCG TGCGCAACAA CGGCTATGCC
CAGGGCGCGC TCGACAGCTG GGCCGCGAAC ACGGTCGGCA CCGGGGTGAA GCCCTCCTCG
CTCATCGCGG CGCCGGCGCG GAAGGCAGCC CTCCAGCGGC TCTGGCAGGA CTGGACCGAC
GAGGCGGATG CCGAGGGCGT GACCGACTTC TACGGCCTGC AGCGCCGCAT CGCGCGCGAG
TTCTTCCTCA CGGGCGAATG CTTCGTGCGC CTGCGCGCGC GGAGGCCCGG CGACGGGCTC
ACGGTGCCGC TCCAGCTCCA GTGCCTGCCC TCCGAGATGC TGCCGATCGG CCGGACCGAG
GTGCTGGGCG GCGGGCGCGC GATCCGGCAG GGGATCGAGT TCGACGCGGT GGGCCGGCGG
GTGGCCTATC ACTTCCATCG CCGCCATCCG GGCGATCCGA CCGAGCCGGG GCTTGCGGGC
GAGACGGTGC GCGTGCCGGC CGAGGATGTG CTCCACATCG TCGATCCGGT CGAGAGCGGC
CAGCTCCGCG GCGTCTCGCG CTTCGCGCCC GCCATCGTGA AGCTCTTCCT GCTCGATCAG
TACGACGATG CCGAACTCGA CCGGAAGAAG GTCGCGGCCA TGTATGCGAT GTTCATCACC
TCGAACGATC CGGATGCGGC GCCGCTCGAG GGCGAGCTGG GCGATCAGGT GGCGCCGGGG
CAGATCGTGC GTCTCGACCC GGGCGAGGAC ATGAAGGTGG CCGATCCCGC GGACTCGGGC
GCGACCTACG AGCCGTTCCA GTACCGCACG CTCCTGCAGG TCTCGGCCGC GCTCGGGATC
CCCTACGCCC ATCTCTCGCA GGACATGGTG AAGGCGAACT ATTCCAATGC CCGCACCGCG
CTCATGGAAT TCCGCCGCCG GGTCGAGGCC TTCCAGCATT CGGTCCTCGT CTATCAGCTC
TGCCGTCCGG TCTGGGCGCG CTTCACCGAT CTCGCGGTGC TGACCGGAGC GGTGCGGCTG
CCGGGCTATG AGCGCCGGAG GCGGGACTAT CTCGCCTGCG AGTGGCTGCC GCCGAAGTGG
CAATGGGTCG ATCCGCTGAA GGACATCCGC GCCGAGATCG AGGAGATCGG CGCGGGCCTC
AAAAGCCGGT CGCAGGCGAT CGGGGAGCGC GGCTACGACG CCGAGGAGGT CGATCGCCAG
ATCGCCGCCG ACCGCAAGCG CGAGGGGCGG CTCGGGCTCG ACTTCCGCCG CAGCGCGCAG
GGCTCCTCCG CACCTGCGGC GCAGGACGGG GCGCGCGCCG ACGAGGAGGA CGACGAGGAT
GACGACGGCC GCGCGGCGGA CCGCGACGCC GGCAGGAGGG CAGAGCCATG A
 
Protein sequence
MAGTFLRRLG AWVGGFDAGL ANRRLRGFRP ARAHVNALLA AAGPDMNARA RYLVRNNGYA 
QGALDSWAAN TVGTGVKPSS LIAAPARKAA LQRLWQDWTD EADAEGVTDF YGLQRRIARE
FFLTGECFVR LRARRPGDGL TVPLQLQCLP SEMLPIGRTE VLGGGRAIRQ GIEFDAVGRR
VAYHFHRRHP GDPTEPGLAG ETVRVPAEDV LHIVDPVESG QLRGVSRFAP AIVKLFLLDQ
YDDAELDRKK VAAMYAMFIT SNDPDAAPLE GELGDQVAPG QIVRLDPGED MKVADPADSG
ATYEPFQYRT LLQVSAALGI PYAHLSQDMV KANYSNARTA LMEFRRRVEA FQHSVLVYQL
CRPVWARFTD LAVLTGAVRL PGYERRRRDY LACEWLPPKW QWVDPLKDIR AEIEEIGAGL
KSRSQAIGER GYDAEEVDRQ IAADRKREGR LGLDFRRSAQ GSSAPAAQDG ARADEEDDED
DDGRAADRDA GRRAEP