Gene Synpcc7942_2053 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSynpcc7942_2053 
Symbol 
ID3774272 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus elongatus PCC 7942 
KingdomBacteria 
Replicon accessionNC_007604 
Strand
Start bp2130328 
End bp2132259 
Gene Length1932 bp 
Protein Length643 aa 
Translation table11 
GC content60% 
IMG OID637800498 
Productpeptidase 
Protein accessionYP_401070 
Protein GI81300862 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.58101 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCACGG CTGCCTACGG TTCTTGGCGA TCGCCGATCA GTGCCGACCT GATTGTGCAA 
GGCAGTGTTG GATTGAGCGG TGTGATGCTG TCAGGGGGCG ATCGCTACTG GCTGGAGTCG
CGCCCCACGG AACGCGGCCG TACGACATTG ATTCGCCAAT CCGCGACAGG ACAGATTGAA
GAACTAACTC CGACACCGTG GAACGTTCGC ACTCGAGCCC ATGAGTATGG CGGCGGCTCC
TACTGCATCG ATCAGGGCGA GGTCTACTTC AGCCACGACA AGGATCAGCG GCTCTATCGC
CTGATTCTGG GGCAAGATCC GCAGCCGTTG ACGCCGGAGC TACCGCTGAA ATTTGCAGAT
GGTCTGATCG ATCGCCAGCG GCAGCGCTGG ATTGGCGTCC GCGAAGATCA CCGACCGGAG
GGCGAAGCGA TCGATGCGAT CGTGGCGATT CCTCTGACTG GCGAACCCAG TGAAGGGCAG
ATTCTCACGA TCGGAGCGGA CTTCTATGCA TCGCCGCGCC TCAGTGCCGA TGGGCAACGG
CTGGCTTGGC TGACTTGGTC GCACCCGAAT ATGCCCTGGG ATGGCACCGA GTTGTGGGTG
GCGGAGTTCC TAGCCGATGG GTCGCTGGCC ACGCCGCAGA AAGTTGCAGG AGGCGATCGC
GAGTCGGTGT TTCAGCCGGA ATGGCTGCCG GATGGGCGTT TGGGCTTTGT CTCCGATCGC
AGCAGCTGGT GGAATCTCTA CAGCTGGGAT GGTCAGAACA CGCAGGCGAT CGCGCCCACT
GAGGCGGAAT TTGGCCTGCC CCAATGGGTG TTTGGTATGC GCACTTGGGC ACCGATCGAT
GGCGATCGCT GGTTGGCCGC TTCTACAAAG GCAGGGCACT GGTCGCTCTC GCTAGTGGAT
CTCGCCACGG GCAGCCTGAA GCCATTTGAT CTGCCGTTCA CGGATATCTC TGGCTTAGTT
GTAGAAGGCG ATCGCGCTTT ATTTACGGCA GCCAATACCG ATCGCCCGGG TGCGGTGATT
GAACTGCAAA TCAGCAGTGG CGAGTGGCAA GTCCTCAAGT CCAGCTCCAG CTTGGATCTC
GACCCGCGCT ATCTCTCGAT TCCCCAGAGC ATTAGTTTCC CGAGTGCCAA TGGTCGGGTG
GCATACGGTC ACTTCTACCC GCCGAATAAT CCGGACTACC GAGCGCCTGC GGGCGAGAAA
CCGCCGCTAC TGGTCAAAAG TCATGGCGGC CCGACGGCGC AAACTCGCAG CAGCCTTAGC
CTTGGCATTC AGTACTGGAC GAGTCGCGGT ATCGCTGTGC TTGATGTCGA TTACGGCGGT
AGCACGGGCT ATGGCCGCCC CTATCGCGAT GCCCTGCAAG GGCAGTGGGG CATTGTCGAT
GTTGAAGATT GCGCCGCTGG TGCCCAGTGG TTAGCCGATC AAGGGCTAGT GGATGGCGAT
CGCCTCTGCA TTGATGGTGG TAGCGCGGGC GGCTACACAA CGCTCTGTGC CCTGACCTTC
ACCGATGTTT TCAAAGCCGG AGCGAGCCGC TATGGCATTG GCGACCTCAA AGCCCTCGCT
GAAGACACCC ACAAATTTGA GTCCCGCTAC CTCGATGGCT TGATTGGCCC TTGGCCCGAG
GCGGCGGATC TTTACCGCGA GCGATCGCCG ATTCACCACG TCGAGCAGCT CAACTGCCCT
GTGATTTTCT TCCAAGGTTT GGAAGACAAA GTTGTGCCAC CGGCGCAGGC AGAAACCATG
GTCGCCGCAC TCAAAGCCAA AGGCCTGCCT GTCGCCTATG TGCTCTTCCC CGAGGAACAG
CACGGCTTCC GGCAGGCTGC TAACATCAAG CGATCGCTGG AAGGGGAGCT GTACTTCTAC
AGCCAAATCT TCGGCTTCGA CCTTGCAGAC GAAATCGAAC CGGTGGCGAT CGCTAACTGG
CCTAAGGCTT AA
 
Protein sequence
MITAAYGSWR SPISADLIVQ GSVGLSGVML SGGDRYWLES RPTERGRTTL IRQSATGQIE 
ELTPTPWNVR TRAHEYGGGS YCIDQGEVYF SHDKDQRLYR LILGQDPQPL TPELPLKFAD
GLIDRQRQRW IGVREDHRPE GEAIDAIVAI PLTGEPSEGQ ILTIGADFYA SPRLSADGQR
LAWLTWSHPN MPWDGTELWV AEFLADGSLA TPQKVAGGDR ESVFQPEWLP DGRLGFVSDR
SSWWNLYSWD GQNTQAIAPT EAEFGLPQWV FGMRTWAPID GDRWLAASTK AGHWSLSLVD
LATGSLKPFD LPFTDISGLV VEGDRALFTA ANTDRPGAVI ELQISSGEWQ VLKSSSSLDL
DPRYLSIPQS ISFPSANGRV AYGHFYPPNN PDYRAPAGEK PPLLVKSHGG PTAQTRSSLS
LGIQYWTSRG IAVLDVDYGG STGYGRPYRD ALQGQWGIVD VEDCAAGAQW LADQGLVDGD
RLCIDGGSAG GYTTLCALTF TDVFKAGASR YGIGDLKALA EDTHKFESRY LDGLIGPWPE
AADLYRERSP IHHVEQLNCP VIFFQGLEDK VVPPAQAETM VAALKAKGLP VAYVLFPEEQ
HGFRQAANIK RSLEGELYFY SQIFGFDLAD EIEPVAIANW PKA