Gene Synpcc7942_2181 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSynpcc7942_2181 
Symbol 
ID3773738 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus elongatus PCC 7942 
KingdomBacteria 
Replicon accessionNC_007604 
Strand
Start bp2259980 
End bp2261191 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content59% 
IMG OID637800626 
Producthypothetical protein 
Protein accessionYP_401198 
Protein GI81300990 
COG category[R] General function prediction only 
COG ID[COG1994] Zn-dependent proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.392153 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAAGTG GCTGGCGGAT TGGCTCCATT TTGGGCATCC CGCTCAGGAT CGATCCTTCT 
TGGTTTGTGA TCGTGGCACT CGTCACGTTC AGCTATTCCG AAACCTTCCG ATCGCAGCAG
CCGACTTGGT CGCCGGGGCT GTTGTGGGGG GCTGCCCTCG TCATGGCCTT GCTGCTGTTT
GCCTCTGTCT TGGCCCATGA GTTGGGGCAC AGTCTGATTG CCCGCGCCCA AGGGATTCGC
GTCAGCTCGA TCACGCTCTT CCTCTTTGGT GGTGTCGCGG CCATTGAGCG CGAGTCGCGG
ACCCCGGGGG GCGCTTTTTG GGTCGCGATC GCGGGGCCGT TGGTCAGCTT TGCCTTGGCG
TTGTTACTGC TGATCAGTCA GCTGTGGTGG CCAGCAGGTT CACCAGCGCA AGTTTTATCT
CTCAATCTGG GGCGACTGAA CTTTATCTTG GCGGTGTTTA ATCTCATCCC GGGGTTGCCC
TTGGATGGTG GTCAGGTGCT CAAGGCGATC GCCTGGAAAG TGACGGGCGA TCGCTATCGG
GCGGTGCATT GGGCTGCGAA CTCAGGTCGG ATTCTCAGTG CGATTGCCAT GGCGATCGGG
CTATTTAGCT GGTTTTTGGG GCCCGGCGGT TTTAGCGGCG TGTGGCTGGC GCTGTTGGGC
TGGTTTGGCT GGCGTAATGC CACGGCCTAC GATCGCACCA CCACCTTGCA ACAGGCGATC
CTGGCGATCG GCGCCAGCGA AGCGATGAGT CGTCGCTATC GGGTGCTGGA AGGATCACTG
ACCTTACGGC AGTTTGCGGA GCTGCTGATC ACTGAAGAGC AGGAAGGATT TGCCTACTTT
GTGGCTAGTG ACGGGCGCTA TCGGGGTCGG ATTAGCTTAG CGACCCTGCG GCAAACTGAG
CGATCGCAGT GGGATCGGCT GACCTTAACG GATTTAGCAG AACCGTTCGA CCGCTTGCCT
GCGCTGCCCG AGACGGCCAA TCTAGCCCAA GCGATCGCTG CTTTGCAAAC AGCCCAGCCC
AGCTACGTCA CAGTTCTGAC TCCCAGTGGC GCGGTCGCCG GCATCATTGA CCATGCCGAT
GTGATTCAAG CCCTCGGCAA AAAACTGGGC TTTCAGCTCC CCCCTGCCGA ACTCCAGCAG
ATTCGGGGCC GTGCCGCCTA TCCCGATGGA CTGCCGCTAG AAATGCTGGC GCAGTCAGTT
CTAAACAACT GA
 
Protein sequence
MQSGWRIGSI LGIPLRIDPS WFVIVALVTF SYSETFRSQQ PTWSPGLLWG AALVMALLLF 
ASVLAHELGH SLIARAQGIR VSSITLFLFG GVAAIERESR TPGGAFWVAI AGPLVSFALA
LLLLISQLWW PAGSPAQVLS LNLGRLNFIL AVFNLIPGLP LDGGQVLKAI AWKVTGDRYR
AVHWAANSGR ILSAIAMAIG LFSWFLGPGG FSGVWLALLG WFGWRNATAY DRTTTLQQAI
LAIGASEAMS RRYRVLEGSL TLRQFAELLI TEEQEGFAYF VASDGRYRGR ISLATLRQTE
RSQWDRLTLT DLAEPFDRLP ALPETANLAQ AIAALQTAQP SYVTVLTPSG AVAGIIDHAD
VIQALGKKLG FQLPPAELQQ IRGRAAYPDG LPLEMLAQSV LNN