Gene Synpcc7942_0750 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSynpcc7942_0750 
Symbol 
ID3775923 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus elongatus PCC 7942 
KingdomBacteria 
Replicon accessionNC_007604 
Strand
Start bp743738 
End bp746590 
Gene Length2853 bp 
Protein Length950 aa 
Translation table11 
GC content64% 
IMG OID637799165 
ProductPhage tail tape measure protein TP901, core region 
Protein accessionYP_399769 
Protein GI81299561 
COG category[S] Function unknown 
COG ID[COG5283] Phage-related tail protein 
TIGRFAM ID[TIGR01760] phage tail tape measure protein, TP901 family, core region 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.173353 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGGTA AAGCCTTTCA GGTAACGCTT GAAATTGGGG GGCGCGTTGC AGCGTCCCTC 
GGTTCGTCGA TTGCCCGTGC CCAAGGCCAG CTCAACACAC TGGCCAGGGC CACCAAGACA
GGCTTTATGG GCGTCGTGCG CAACGACGCT TTTCAGGCGT TGGCAGCGGC TAGCGCGGCG
GTTGGCGCGG GCTTGATCTA CTCGACCAAG CAAGCTGTTG CCTTCGAGTC TCAGCTGGCT
GACATCGGCA AGACGGCAGG CTCTAGCGCG GCAGAACTCA AGGCGTTGGG GGCTGACCTA
CTGGCGCTCA GCGCCCGCGA TCGCACGAAC CAATCAGCCA GCAATCTCGC CTCTGGCATC
CAAGACCTAG TGGCCCAAGG CCTCGAACTA AAGGATGCGA TCGCCAGCAT TGAAACCCTA
GGTCGAGTAG CGACGGCGAC CAACTCGAAC CTGACCGACA TCACTAAGAC CGGCTTTCAG
CTCCAGAATG CGCTGAAGAT CAAGCCGACT GAACTCAAGG CGACGTTCGA CGCACTGGCC
TACGCGGGCA AGCAAGGGGC GTTCGAGCTG AAGGACATGG CGCAGTTCAT GCCGACGATC
GCTGCGGCGG CTGGTTCTCT GGGGGTGACA GGCAGAGAAG GCGCGGTCAG TCTGGCCGCC
ATGATGCAGA TGGTGCGACG CGATGCGCCG GACGCTGGGC AGGCTGCCAC GCGGTTGACA
GACGCAATGC TGAAGATGAC CGCACCGGAT GCGGTGAAGC GCTTCAAGAA GTTTGGCGTC
AACATCGAGC AGGTGCTGAA AGACGCCAAG GCCAACGGCA TCAATCCGAT GGAAGCGGCA
GTCGAAACCC TGTTCAAAGT GACGGGTGGC GACACGTTTA AGCTCGGACA GATCTTCGGT
GACAAGGAAG CAAAGCTGGC TCTAATGAGC CTGATGAAGT ATCGCGCCGA GTACACCAAG
CTGCGCGATG ACGCGGGCGG CTCGATTGCG GCAGGCACTG TTGACGCTGA CTATCAGCGA
TCGCTGGGTA CCTTCGCGGA ACAGATGAAG GCGCTGCAAA ACACGGGCGA GCGGCTGGCG
ATCTCGATCG GCACGGCGCT GCTGCCGTCG CTGAACTCAC TGGCTAACGT GGTCACGCCA
GTGATTGAAG GTATGGCCCG CTGGGCTGAA ACCAACCCCG GGCTGATGAA AGGCATCGTG
GCGATCGCGG GGCTGACGGT GGGCTTAACT GCTGCCCTGC CCCTCATTGG TGCCGTGGTG
GCAGCCATCG GTGTGATTGG TGGCCCGATC ACCTTGGCCG TGCTGGGAAT TGGTGCGGCG
ATCGCCCTAG TCATTGCGTA CTGGGACGAC CTCAAGCAGG TTGCTCTCGG ATTCTGGGGT
TCAGTGAAGC AAGCGGGCGC GAGTGACCTG TTCCAAGGCA TCCGGCAGGG ACTCACCGGG
GTGATGACGC TACTCAGTGA GGCCAAGCGC TTCTGGGTGG CTTTGTTCTC CGGCAACGAG
CAGGAAGTCG CGGCCGCAGC CCAGCGGATC GGGCAGACGA TCGGCGCGGT CATCCTGCCA
GCCGTGGCGC AAGTCTGGCT GTCGATCGGG CGCATGGCCG CGATGGGTGT CGTCGCGCTC
GGTCGGTCGT TCATCACCGG GCTAGCCGGT GCGATGAGGG CCATGCCCGG AATCGTGATG
GGCGCCGCTC GCATGGCGGG CACTGTCCTC GTGACGCTCA TGCAAGCAGC GATCGCGACG
GTCGGCTCCC TACTGCAACA GCTACCCGGC ATCGCCGGAT CCGCGTTTGC GGGCATCACG
AGCCAGTTCA GGGCGGTCTG GGATCAAGCG CTCGGTGTAG TGCGATCGTT CCTGCCCCAG
ATGGGCGCGA TCTTGTTCCC ACTGCCCACG TTGGTCATCG GCATCTTCCA GAAGATCGTC
CCTGGCATCG CGTCAGTCTT TGCTCAGATG GTCGCCCAGA TTCAGGGCGC GTTCCAGCAG
GTCGTCGCGT TCATCCGAAG CGTGCCGTCA ATGCTGGCCG GCGTCGGTGA GGCGATCATC
CAAACCATCA TCGACGGGGT GAAGGCAAAG GCTGGCGAGC TGCTGGCGAC CGTGCAGCAG
AGCTTCGCCA GGGTGCGCGA GCTGATGCCG TTCAGCGACG CGAAGCGCGG CCCGTTCTCG
ACCCTCACCA AATCCGGCAT GGCGATCCCC GGCACGCTCG GGATTGGCGT GCGTCGCGGT
GCAGGTTTGC TCCGGCGTCC GCTCGTTGCG GCTGCGACGG CGGCGATGGC CGCGATGGGT
GCGGTGCAAG CCCCAGCGAT CGCTATAGCA GCGCCGACCC TGCCCCAACC CTTGCCAGCA
CTCACTCAGC CCGCGATGGG TGCGCCGAAA ACTGGCCCGG TTCTGGCCCG GCAGGCCGTC
CCGCTGCAGA CCGAGAATGC AGGCTCGCAG CAGCTTGTCG CGTCGATTTC CCCTGCAATA
GCGCTGCAGG TTCCCGCGCC CGCGATCGCG ACTCCGGCTC CGATCAGCGT CCCTGCTCCG
CAGATTGTTT CGCAGCCGTC GATTGCCCTG CCGGCCCCAA CGATCGTGGC TCAAGCCTCT
GTGGCAACGC CCGAGGCCCG CTTTGCCCTG ACTGAGGTCA GGATCCCCGA GGTCAATAGC
ACACCCACGA TCGCCGCGCC GATGCCCGCG CCGGTCGTGG TTTCGCCTGC GCCTCGAAGC
GATCGGCGCG CCCCGATCAA CATCACTGCA CCGATCACAA TCAACGCTGG TCCAGGGCAG
GACGCGCGTA GTATCGCCGC CCAAGTGCGC CAGGTGTTCG ATGACCTGAT GCGCGAGGCC
GAACTCAACC AGCGTGCTGC TCTAAACGAC TGA
 
Protein sequence
MAGKAFQVTL EIGGRVAASL GSSIARAQGQ LNTLARATKT GFMGVVRNDA FQALAAASAA 
VGAGLIYSTK QAVAFESQLA DIGKTAGSSA AELKALGADL LALSARDRTN QSASNLASGI
QDLVAQGLEL KDAIASIETL GRVATATNSN LTDITKTGFQ LQNALKIKPT ELKATFDALA
YAGKQGAFEL KDMAQFMPTI AAAAGSLGVT GREGAVSLAA MMQMVRRDAP DAGQAATRLT
DAMLKMTAPD AVKRFKKFGV NIEQVLKDAK ANGINPMEAA VETLFKVTGG DTFKLGQIFG
DKEAKLALMS LMKYRAEYTK LRDDAGGSIA AGTVDADYQR SLGTFAEQMK ALQNTGERLA
ISIGTALLPS LNSLANVVTP VIEGMARWAE TNPGLMKGIV AIAGLTVGLT AALPLIGAVV
AAIGVIGGPI TLAVLGIGAA IALVIAYWDD LKQVALGFWG SVKQAGASDL FQGIRQGLTG
VMTLLSEAKR FWVALFSGNE QEVAAAAQRI GQTIGAVILP AVAQVWLSIG RMAAMGVVAL
GRSFITGLAG AMRAMPGIVM GAARMAGTVL VTLMQAAIAT VGSLLQQLPG IAGSAFAGIT
SQFRAVWDQA LGVVRSFLPQ MGAILFPLPT LVIGIFQKIV PGIASVFAQM VAQIQGAFQQ
VVAFIRSVPS MLAGVGEAII QTIIDGVKAK AGELLATVQQ SFARVRELMP FSDAKRGPFS
TLTKSGMAIP GTLGIGVRRG AGLLRRPLVA AATAAMAAMG AVQAPAIAIA APTLPQPLPA
LTQPAMGAPK TGPVLARQAV PLQTENAGSQ QLVASISPAI ALQVPAPAIA TPAPISVPAP
QIVSQPSIAL PAPTIVAQAS VATPEARFAL TEVRIPEVNS TPTIAAPMPA PVVVSPAPRS
DRRAPINITA PITINAGPGQ DARSIAAQVR QVFDDLMREA ELNQRAALND