Gene Synpcc7942_1986 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSynpcc7942_1986 
Symbol 
ID3774173 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus elongatus PCC 7942 
KingdomBacteria 
Replicon accessionNC_007604 
Strand
Start bp2056331 
End bp2057653 
Gene Length1323 bp 
Protein Length440 aa 
Translation table11 
GC content58% 
IMG OID637800431 
Productprocessing protease 
Protein accessionYP_401003 
Protein GI81300795 
COG category[R] General function prediction only 
COG ID[COG0612] Predicted Zn-dependent peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.500047 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCACTAT GGGGCAATAG AGTTGACTGC TACGAGGTTG CGCGTTCCAT GGTTGCTGCG 
CTGGCTGCTC CCCGGTTGAA TGCCCCCCAT TGGCAGCAAC TTGAAAATGG ACTAATCATT
ATTGCCGAAC GCCTGCCGGT TCCTGCGGTG ACCTTTGATC TCTGGGTCAA GGTGGGATCA
GCTGTAGAAC CTGACGCTGT TAACGGGGTG GCTCATTTCC TCGAACACAT GGTGTTCAAG
GGCAGTCAGC GCCTCAAAGC CGGTGAGTTT GAGCAGCAGG TGGAAGCTCG GGGGGCGATC
GCCAATGCTG CCACTAGCCA GGATTACACC CACTTTTATT TCACCTGTGC GCCCAGTGAC
TTTACAGATC TCGTTTCGCT GCAAACCGAT GTCGTGCTCA ATCCCCTGCT TGCGGAAGCA
GAATTTGAGC GGGAACGGCG GGTTGTCCTA GAAGAAATTC GCCGTGCGGC GGATAATCCT
CGGCGTCGAG CCTACTACCG CATGATTGAG GCTGCGTTTG AGCGGTTGCC CTATCGGCGG
CCCGTGCTGG GCCCCTACGA CACGATCGCC CAACTCCCGC TCACCGATTT GCAGGCCTTT
CACCGCCAAT GGTACGGCCC CAATCAGCTA GTCGCAGTGG TGGTCGGGGA TCTCCCGGAA
GCGGAAATGA TCGATGCAGT GCGAGCTGCG GTTGCCGATC ATCCCCCTGT CACTGCTCAG
CGATCGCCCC TGTTGCCCGA GCCTGCCTTC AGTCAGCCCC AGCAGCAGAT CTATCACGAT
GCTGATCTGC ACCAAGCCCG GCTGTACCTA ACTTGGCGGG TTCCTGGGCT CAGCCAACTC
TCCCGGACTT ATGCTCTCGA TGCGATCGCG TCGATCTTGG CCAGTGGTCG CACCTCTCGG
CTGGTTGCTC AGCTTCGAGA ACAACAGGGC TTGGTCAGTA ATATTGTGGC GAGCAACTCC
ACCTATCGCG ATCAGGGCCT CTTTGCAATC ACAGCGCGGC TGCCGGTTGC CCATCTCGAC
ACCGTCCGAT CGCAGGTTCT TGCAGAGCTG CAATCCCTGC AGACTGAACC CGTGACCCCA
GCAGAGTTGG AGCGGATTCG GCGGCAGGTC GTCAATCGTT TCATCTTTGG CAATGAGCGC
CCCAGCGATC GCGCCAGCTT GTATGGCTAT TACGCTACGC TCCTCGGTAG CCTGGAACCG
GCCTTCAATT ACGTCGACGA AATCCACGCG CTCAGCGTTG ATGATTTGCA GGCAGCGGTT
CAGACTTACC TTGCTCCCGA AGCTTGCTCC ACGATTCAGA TTTTGCCAGG AGAGCATGGC
TGA
 
Protein sequence
MSLWGNRVDC YEVARSMVAA LAAPRLNAPH WQQLENGLII IAERLPVPAV TFDLWVKVGS 
AVEPDAVNGV AHFLEHMVFK GSQRLKAGEF EQQVEARGAI ANAATSQDYT HFYFTCAPSD
FTDLVSLQTD VVLNPLLAEA EFERERRVVL EEIRRAADNP RRRAYYRMIE AAFERLPYRR
PVLGPYDTIA QLPLTDLQAF HRQWYGPNQL VAVVVGDLPE AEMIDAVRAA VADHPPVTAQ
RSPLLPEPAF SQPQQQIYHD ADLHQARLYL TWRVPGLSQL SRTYALDAIA SILASGRTSR
LVAQLREQQG LVSNIVASNS TYRDQGLFAI TARLPVAHLD TVRSQVLAEL QSLQTEPVTP
AELERIRRQV VNRFIFGNER PSDRASLYGY YATLLGSLEP AFNYVDEIHA LSVDDLQAAV
QTYLAPEACS TIQILPGEHG