Gene Synpcc7942_1462 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSynpcc7942_1462 
Symbol 
ID3773634 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus elongatus PCC 7942 
KingdomBacteria 
Replicon accessionNC_007604 
Strand
Start bp1514071 
End bp1515141 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content56% 
IMG OID637799894 
Productimelysin 
Protein accessionYP_400479 
Protein GI81300271 
COG category[R] General function prediction only 
COG ID[COG3489] Predicted periplasmic lipoprotein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.0336824 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.0166245 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATCGTGA CAGGCTCTCA GGTCAGGCAA GGTTTAAACA CTTGGTTTGT GCTCCCGCTG 
CGTAGGACTG CGATCGGCCT GGGCTGCGCC GGAGTTGCAA CGCTCTTCTC TGCCTGTGGT
CAAACCCAGG CATTGATTAC CAATCAGACC ATTCAAGGAT TTGTCGATCA GGTTGTCGTT
CCTAGCTATG TCAGCGTTGC TGCTGGCGCA ACTCAGCTGG AACAAGCCCT CCAAACCTAT
CAGCAGGCAC CGACTGCTGC CAATTTGGAG GCGGCTCGAC AAGCCTGGCG GGTCGCCCGC
GATCGCTGGG AGCAGACTGA ATGTTTTGCT TTTGGGCCAG CGGATAGCGA AGGGTTTGAT
GGGGCAATGG ACACCTGGCC TATCGATCGC CAAGGCTTGA AAACTGCCGC AGCTCAGCCA
GTGGAGCAAC GGGAAGATAG CCGTAAGGGC TTCCACGCGA TCGAGGAGTT GTTGTTTGCC
GCAACGGAAC CGACGCTGAG CGATCGCCAG CATCTTGTGA TCTTGGCGAC GGACCTTACC
AAGCAAGCAC AGGGGTTGGT CACCCGTTGG CAACAAGCGA GTGATCAGCC TGCCTATCGC
TCAGTTTTGC TCAGCGCTGG CTCGACAGAT TCGGCCTATC CCACCCTGAA TGCTGCGGGA
ACCGAGATTG TTCAAGGCCT GGTTGATAGC CTCTCAGAGG TCGCCAGCGA AAAGATCGGC
GGGCCACTCG AGACTCAAGA ACCCGATCGC TTTGAAAGTT TTGTTAGCCG CAATACTCTG
TCTGACCTGC GCAACAACTG GACTGGCGCT TGGAATGTCT ATCGCGGTCA GCGGTCTGAT
GGGGTCGCGG CAGGAAGTCT GCAACAGCGT TTACAGCAAC AACATCCAGT GATCGCTCAG
CAACTCGATC AGCAATTTGC AACTGCCCGC CAAGCCCTTT GGGCTATTCC TGAACCGATT
GAAACCAACC TTGCCAGCCC AAGAGGCAAA GTGGCTGTCC TCACGGCTCA AACTGCGATC
GCAGCAGTCA GCGACACCCT AGAGCGTCAA GTTCTCCCGC TGGTTCAGTA G
 
Protein sequence
MIVTGSQVRQ GLNTWFVLPL RRTAIGLGCA GVATLFSACG QTQALITNQT IQGFVDQVVV 
PSYVSVAAGA TQLEQALQTY QQAPTAANLE AARQAWRVAR DRWEQTECFA FGPADSEGFD
GAMDTWPIDR QGLKTAAAQP VEQREDSRKG FHAIEELLFA ATEPTLSDRQ HLVILATDLT
KQAQGLVTRW QQASDQPAYR SVLLSAGSTD SAYPTLNAAG TEIVQGLVDS LSEVASEKIG
GPLETQEPDR FESFVSRNTL SDLRNNWTGA WNVYRGQRSD GVAAGSLQQR LQQQHPVIAQ
QLDQQFATAR QALWAIPEPI ETNLASPRGK VAVLTAQTAI AAVSDTLERQ VLPLVQ