Gene Rsph17029_0642 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_0642 
Symbol 
ID4897247 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp656750 
End bp657961 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content66% 
IMG OID640111225 
Productphage major head protein 
Protein accessionYP_001042527 
Protein GI126461413 
COG category 
COG ID 
TIGRFAM ID[TIGR01554] phage major capsid protein, HK97 family 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACAAGG AAATCGCCGC GATCCTCGAG CGGGTCGTGA AGGACGTCGC TCGTATCGAC 
AACGAGCTTT CGAAGAAGGC CGAGGCCGCC TTCGCCGAGG TCAAGAACAT GGGCAACCTG
TCGACCGAGA CGAAGGCTTC GGTCGATCAG CTGCTCACGG CGCAGACCAC GCTTGTCAGC
GTCGTGGATG ACCTGAAGGC CCGCCTGGGC GAGGTCGAGC AGAAGGGCGC TCGTCGCTCG
GCGCCGACTT CGGCGCAGTC GTGGGGCCAG CAGGCGGTGC AGGCCGAGAA ACTGATCGCC
TTTGCGGCGG CGGTGGAAGG CGGCCGGCGC GTCTCGGTGC CCGTGGTCAA GAACGTGGTC
ACCTCGGCGG ATGTGGCCGA AGGCGTCGTC GAGCCCCAGC GCCTGCCGGG CATCGACGTC
GCGCCGAAGC AACGGCTGTT CATCCGCGAC CTGATCGCGC CGGGCAGCAC CGAGTCGCCC
GCGATCTTCT GGGTGCAGCA GACCGGCTTC ACGAATGCTG CCCGGGTGGT GCCCGAGGGC
ACGGCGAAGC CCTACTCGGA TATCGAGTTC GCGACCAAGA TCACGCCGGT CGTGACCGTT
GCGCACATGT TCAAGGCGTC GAAGCAGATC CTCGACGACT TCCGCCAGCT GCAGTCCATG
ATCGATGCCG AGATGCGGTA TGGCCTGAAG TATGTCGAGG AGCAGGAGAT CCTGTTCGGC
GCGGGCGGCG CGGGCAACAT CGAGGGCATC GTCCCGCAGG CGTCGGCCTT CGCTCCCGCC
TTCGCGCCGG AAATGCGGAC GCCGATCGAC GATCTTCGCC TCGCGCTCCT GCAGGCGCAG
CTGGCCCGTC TGCCGGCCTC GGGCTTCGTG CTTCACATGA TGGATTGGGC CAAGATCGAG
CTCACGAAGA ACACGGTTGG CGATTACGTC CTTGCCAACC CGCTCCGCCT CGCCGGGCCG
ACGCTCTGGG GTAAGCCCAT CGTCGAAACG GAGATCCCGG AGTTCGAGGG CGAGTTCCTC
GCGGGCGCCT TCTCCACCGG CGCGCAGATC TTCGATCGCG AGGACGCGAA TGTCGTCATC
TCGACCGAGA ACGCCGACGA CTTCGAGAAG AACATGATCT CGATCCGCTG CGAGGAGCGT
CTGGCGCTCG CCGTGAAGCG TCCGGAGGCG TTCGTCACCG GCGAGTTCGG CACCGCAGTC
GCCGCGCCGT GA
 
Protein sequence
MDKEIAAILE RVVKDVARID NELSKKAEAA FAEVKNMGNL STETKASVDQ LLTAQTTLVS 
VVDDLKARLG EVEQKGARRS APTSAQSWGQ QAVQAEKLIA FAAAVEGGRR VSVPVVKNVV
TSADVAEGVV EPQRLPGIDV APKQRLFIRD LIAPGSTESP AIFWVQQTGF TNAARVVPEG
TAKPYSDIEF ATKITPVVTV AHMFKASKQI LDDFRQLQSM IDAEMRYGLK YVEEQEILFG
AGGAGNIEGI VPQASAFAPA FAPEMRTPID DLRLALLQAQ LARLPASGFV LHMMDWAKIE
LTKNTVGDYV LANPLRLAGP TLWGKPIVET EIPEFEGEFL AGAFSTGAQI FDREDANVVI
STENADDFEK NMISIRCEER LALAVKRPEA FVTGEFGTAV AAP