Gene Rsph17025_1761 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_1761 
Symbol 
ID5083667 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009428 
Strand
Start bp1795011 
End bp1797776 
Gene Length2766 bp 
Protein Length921 aa 
Translation table11 
GC content68% 
IMG OID640483321 
Productphage tape measure protein 
Protein accessionYP_001167959 
Protein GI146277800 
COG category[S] Function unknown 
COG ID[COG5281] Phage-related minor tail protein 
TIGRFAM ID[TIGR01541] phage tail tape measure protein, lambda family
[TIGR02675] tape measure domain 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.337369 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCAACTG ACGTCGAAAA GCTGGTCGTC CAGCTCTCCG CGGACATCAA GCAGTATCAG 
CGCGAGATGA ACAAGGCAGT GGGCGTTTCA AACAAGCAGG CGAGGGCTGT CGAAAATCGC
TTTCGTCAGA TGAACCGGAA CCTCGACAGC ATCGGCCGCA CCGCGGCCCG GTCGCTGGTC
ACGCCCCTGC TCGGCATCGG GGCTGCCCTG TCGGTGCGCG AGGTGGGGCG CTACGCGGAC
GCCTGGACCA ATGCCAAGAA CAGCCTGGCC GTGGCTGGCG TGACCGGCAG GGCGCAGGCC
GATGTGCTCG ACAGGCTCTA TCAGTCGGCG CAGAGCAACG CGGCTCCGGT GGGCGCGCTG
GCCGATCTCT ATGGCAAGGC CTCGCAGGCC TCGGACGTTC TGGGCGCCTC GCAGGCTGAC
CTGATCAAGT TCTCCGACGG CGTCGCGACG GCGCTGCGCG TTCAGGGCGG GTCGGCGGCC
CAGGCGTCCG GCGCGCTGAC GCAGCTCGGC CAGCTGCTCG GGTCGGCTCG CGTGCAGGCC
GAAGAGTTCA ACTCGGTGAA CGAAGGCGCC CGGCCGATCC TGATGGCGGT CGCCGCTGGT
CTTGATGAGG CGGGCGGCTC CGTCTCGAAG CTCAAGACGC TGGTGAATGA CGGCAAGGTC
AGCGGACAGC AGTTCTTTCA GGCTTTCCTG CGCGGTCTGC CCACCATCCA GGGGATGGCG
GCGAACGCCA CCACCACGAT CGAGCAGGGC ATCACGAAGG TCGAGAACGC GTTCACGCGC
TACATTGGTC AGACCGATGA GAGTCTCGGT GCCTCTCAGC GTCTAGTGCA GGGCCTCTCT
GCCCTTGCAG ACGATTTCGA GAGCATCGCC GACATCACGC TCAAGGTGGC CGCGCTGATC
GGGGCCGGGC TGCTTGGCCG CTCCATCGCG GGCATGGTCA AAAATCTCGG CATCGCCGGA
GTGGCCGCCA CGGAGTTCGT CGCCGCGATC CGGGTGGCAT CGTCCGTCTC CGGCGTCGCG
AAGGCCATCG GCGGCTTGAG CGCGGCGGCG GGGCCGATCG GGGCCGTGGT GGGCGTGGCC
GCGGTGGGCG CGCTGATGCT GTACTCGGCC AGCACGGCCG ACGCGCGCGA GGCTTCGGAG
GCCTTCGAGC AGCGGCTCGA CCGCATTCGC GCCGCGGCGC CTCAGATGGC CACGGCCGTC
GAGGACGGCG CTCGCCGCGC GAAGGATGCG ATGGAGAGCC TGAAAAACCC GGCGGTGATC
AAGCTGGAGA TGGAGAGCGC AGAGGCTGAC GCGAACCTCG ATGCCATCCG CACAAACCTA
CAGCGGGCAA TGGAACAGCT CGATCACCTG GCGGGCATGA AGGTCTTCAC CGAGGCTCAG
CAGGAAGCGA TCAGGGACTT TAACGACAAG GTGCTCCAAG GCACGGCCAG CGCCGAAGAA
ATGGCTGAGG TGCTGAAGGA TCTCGGTAAG ATCGACGGGG TGGGCCAGAG CCTGATCGAT
TCCGTCACGA GCCTCTGGCG CGCGCTCAAC CTGACCTCGG CCGCGGCGCG GCAGGTCCAG
TCGGACATTC AGGCCGCGCT CGGGGCCTAT GCGCCTGACC TGGGCCGGTT CGGTCAGGTC
CAGGATGGGA TACTTTCCCG GCTCTACGCC GATGAAGCCG CCAAGCGCGC GGGCCGCGCC
TATCTCGACG AACAGCAGCG CCTGCAAGGC CTCACGCGGG AGCAGCTCGC GCTTGAGACG
GAAATCTCGC GCCTGCGGAA GAGCATGCCC GACGGCGCGA CCGTCAGCGA CGAGGATCTT
GCAGAGACGG CGCGCGGCAA CCTCGCCGCG GCCGCTCGTC GGGCTGAGGA AGGCCGGAAG
GGCGGCGGTG GCGGCGGTCG CGAGGCCAAG GCCGTCGAGC GCTTCGATGA CCGGATCCTG
AAGGAGACCG AGGGGCTCAA GGCCGAGACC GCCGCGCTCA ATCAGCTGGA GCTCGGACAG
GACCAATACG GCACCGCAGT GGCCCGCGCC CGCAAGGAAG CGGAGATGTT GCAGGACCTG
CACAACAAGG GGCTCACGAT CACGCCGGCC CTGCGCGCGC AGGTGCAGGG GCTGGCCGCG
GACTGGCAGC ATGCAGCCGA GGCGAACGCC ATGGCAACCG AGCGGCATGA GGAGTTCCAG
AGCAACCTGC AGGACACGAA GGCCACCATG AGCCAAGCGT TCAGCGGGCT GATCAAGGGG
GCTCACGGCT TCTCCGACGC GCTGGGGATG GTGATCGACA AGCTCGCCGA CATCGCCCTG
AACGCGGCCT TCAACGGCCT CTGGAACGGC ATTCTCGGGG GAGCTGCGTC GGGCCTCGTG
TCCGGGCTTG GGGGCTTCTC CAAGGGGGGC TACACCGGGC CGGGCGGCAA GCACGAGCCC
GCCGGCGTCG TCCACAAGGG CGAGGTGGTC TGGAGCCAGG CCGATGTGGC GAAGGCCGGC
GGCGTGGCCG TGGTCGAGGC GATGCGGCGC GGCGCCAAGG GCTATGCGTC GGGCGGGGTC
GTGATGCCCG AGATGCCGCG CGTCCCGGCG CCGAAGGCGC TCGCGGCCAT GGCGGCCAAG
GCGCAGGTCC CGCGCGTGGT CTATGTCCCG CAGCCTTACG TCGCCCAGGT CTCGGCCGAC
GATGATGGCC GGATCATCGG CACCATGCGG CGGGTCGCCC TGGCCACGTC TGCGGCCACA
GCGGCCGGCC AGCAGCGGCA ACTCGGATCG TCCATCAATA GCTACAGCGC GCGGGGCACG
ACCTGA
 
Protein sequence
MATDVEKLVV QLSADIKQYQ REMNKAVGVS NKQARAVENR FRQMNRNLDS IGRTAARSLV 
TPLLGIGAAL SVREVGRYAD AWTNAKNSLA VAGVTGRAQA DVLDRLYQSA QSNAAPVGAL
ADLYGKASQA SDVLGASQAD LIKFSDGVAT ALRVQGGSAA QASGALTQLG QLLGSARVQA
EEFNSVNEGA RPILMAVAAG LDEAGGSVSK LKTLVNDGKV SGQQFFQAFL RGLPTIQGMA
ANATTTIEQG ITKVENAFTR YIGQTDESLG ASQRLVQGLS ALADDFESIA DITLKVAALI
GAGLLGRSIA GMVKNLGIAG VAATEFVAAI RVASSVSGVA KAIGGLSAAA GPIGAVVGVA
AVGALMLYSA STADAREASE AFEQRLDRIR AAAPQMATAV EDGARRAKDA MESLKNPAVI
KLEMESAEAD ANLDAIRTNL QRAMEQLDHL AGMKVFTEAQ QEAIRDFNDK VLQGTASAEE
MAEVLKDLGK IDGVGQSLID SVTSLWRALN LTSAAARQVQ SDIQAALGAY APDLGRFGQV
QDGILSRLYA DEAAKRAGRA YLDEQQRLQG LTREQLALET EISRLRKSMP DGATVSDEDL
AETARGNLAA AARRAEEGRK GGGGGGREAK AVERFDDRIL KETEGLKAET AALNQLELGQ
DQYGTAVARA RKEAEMLQDL HNKGLTITPA LRAQVQGLAA DWQHAAEANA MATERHEEFQ
SNLQDTKATM SQAFSGLIKG AHGFSDALGM VIDKLADIAL NAAFNGLWNG ILGGAASGLV
SGLGGFSKGG YTGPGGKHEP AGVVHKGEVV WSQADVAKAG GVAVVEAMRR GAKGYASGGV
VMPEMPRVPA PKALAAMAAK AQVPRVVYVP QPYVAQVSAD DDGRIIGTMR RVALATSAAT
AAGQQRQLGS SINSYSARGT T