Gene Synpcc7942_2595 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSynpcc7942_2595 
Symbol 
ID3775192 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus elongatus PCC 7942 
KingdomBacteria 
Replicon accessionNC_007604 
Strand
Start bp2677378 
End bp2678808 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content56% 
IMG OID637801049 
Producthypothetical protein 
Protein accessionYP_401612 
Protein GI81301404 
COG category[R] General function prediction only 
COG ID[COG1253] Hemolysins and related proteins containing CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones42 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATCCTC TGCCTGCCCC TACCCCTGAC TTGACTGCCG CGATCGCCGC AAGCCCCGAT 
CTACCACTGA GCGCTTGGCT CTGGCGGGGA GCTGCGATCG TGCTGCTAAT TGTCATCAAT
GCCTTCTTTG TGACTGCGGA GTTTGCGATC GTCTACGTCC GGCGATCGCG GATTAATCAA
CTCGCCCAGG AAGGCGACGT TCCCGCTCGC ATGGTGGAAC GACTACAGCG CAGCATTGAT
CGACTGCTCT CGACCACACA GCTAGGAATT ACGCTGGCGA GTTTAGCCCT CGGCTGGGTC
GGAGAATCAA CGATCGCCGT TCTGATTCGT CAGGCACTCG AGCAACTGCC GCTACCCGCG
ATCGGGCCAG AACCGCTCAG CCATGTCCTG GCGATCCCCC TCGCCTTTGC CCTGCTGGTC
TATCTCCAGA TCGTCCTAGG TGAACTCTGT CCCAAGGCAG TCGCACTGAT CTACCCAGAA
CAGATGGCCC GCCTCTTAGG TCCACCAAGC ATTGCGATCG CCCAGATTTT TGCGCCGGTG
ATCAGCCTAT TGAACGGCTC GACCCGATGC CTGCTGGGAC TCTTCGGCAT CGACTATAGC
CAGCAGCGCT GGTATAGCAG CGTCACCCCA GAGGAGCTGC AGTGGATCAT TCAATCTGCA
GCTGAATCGA CAGGCTTAGA AGCAGAAGAA CGGCAGATTC TCAGTAATGT GATTGAGTTT
GGTGAAATCA CCGCTGGCGA AGTGATGGTG CCACGCACCC GGATTGTGGC GCTAGAAGAA
GACGCCACCT TCCTCGATCT TTTGGCTGCG ATTCAGGAAT CCGGCCATGC TTGCTTTCCC
GTGATCAGAG ACAGCCTTGA CCAAGTCTTA GGCCTGATCG ACTTTCGTGC TTTGGCGGTG
CCGATGGCCA GCGGCGAACT TCAGCCCAGC AGTCCTGTCA AAGCCTGGGT GCAACCAGCC
CGTTTTGTCC CGGAAGGCCT CTCCCTAAAA GAGTTACTGC CCCAGATGCA GCGATCGCCC
CTACCGATGG CGATTGTGGT CGATGAGTTT GGCGGCACCG AAGGTCTGGT GACCTTGCAG
GACATTCTGG CGGAAATTCT CGGCGATGAA GAGCAAGACG CTGAGGAGAA TGAACAGTTT
CGGCGGATTG ACGACCAAAC CGTGCTGGTT CAGGCTCAAA CGGACATTGA GACCGTCAAT
GAGCGCTTAG GACTGGATCT TCCCCTCGAA GAGGAGTACA ACACCTTGGG TGGATTTGTC
GTAGCGCAGT TACAGAAAAT TCCCGAAGCC GGTGAAGGCT TTGACTTTCA GGATTGTCAG
ATTCGCGTGG CGATCGCAGA AGGGCCACGG TTGGAATTTA TCGAAATTCG ACAATTGCGA
TCGCCGCAAC CTGCAGCGTC CGATGAGGCA AAACCGCATG CTAACATCTG A
 
Protein sequence
MDPLPAPTPD LTAAIAASPD LPLSAWLWRG AAIVLLIVIN AFFVTAEFAI VYVRRSRINQ 
LAQEGDVPAR MVERLQRSID RLLSTTQLGI TLASLALGWV GESTIAVLIR QALEQLPLPA
IGPEPLSHVL AIPLAFALLV YLQIVLGELC PKAVALIYPE QMARLLGPPS IAIAQIFAPV
ISLLNGSTRC LLGLFGIDYS QQRWYSSVTP EELQWIIQSA AESTGLEAEE RQILSNVIEF
GEITAGEVMV PRTRIVALEE DATFLDLLAA IQESGHACFP VIRDSLDQVL GLIDFRALAV
PMASGELQPS SPVKAWVQPA RFVPEGLSLK ELLPQMQRSP LPMAIVVDEF GGTEGLVTLQ
DILAEILGDE EQDAEENEQF RRIDDQTVLV QAQTDIETVN ERLGLDLPLE EEYNTLGGFV
VAQLQKIPEA GEGFDFQDCQ IRVAIAEGPR LEFIEIRQLR SPQPAASDEA KPHANI