Gene Synpcc7942_1691 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSynpcc7942_1691 
Symbol 
ID3775390 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus elongatus PCC 7942 
KingdomBacteria 
Replicon accessionNC_007604 
Strand
Start bp1757155 
End bp1758315 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content61% 
IMG OID637800129 
Producthypothetical protein 
Protein accessionYP_400708 
Protein GI81300500 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.187562 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCAGTA GTGACGTTCC CTCCTTTGCT CCCGATCAGG CACTCCCACA GTGGCGACTG 
TTCTGGGGCA GCAGCGCTGC CGTCATTGCG ATCGCGATCG CAGCGAGTTT ACATGGTGGG
CGGCAGACGA TGTTGTTATT GCTGGGTGCA GCGATCGGGG CCGTTCTGTA TCACGGGCGC
TTTGGGTTTA GCAGCGGGTT TCGCAAACTG CTACAGCGTC AGGATGGCAA TCCTGCCTTG
GCGCAGTTGT GGTTGTTGGC GCTAACCTCG ATCGCTTTTG CTGGCGTTTT TAGCCTGGCG
GAACTCAGGG GTTGGGAGCT ACGACCCGCG ATCGCGCCGG TGGGCTGGGC TAGTCTGGGC
GGAGCTTTCC TATTTGGCAT TGGCATGCAG CTCAGCCGCG CCTGTGGTTG CGGTACTTTG
GCAGCCGTTG GCGGCGGTTC TTACAACCTA TTGATCACCC TGATTGCCTT TGGCATCGGG
GCGTTTGGGG CGACCTTGAC ACGGCCGCTG TGGAGCCAGC TTCCTGCTTG GGAGCCTTGG
TCTTTTGCCA GCCAATTGGG TTGGGGCAGT GCCCTCCTCC TGCAACTGTG GCTGCTCTTA
ATTCTGGCCG TGGCTCTCTG GCGCTGGGCA CCCCTGCCGA TGGTCAAATC TCGGCGGCTC
TGGCTGGCGG CGACAGCGAT CGCCCTGCTT TATACCGGCA CCCTCGTGGT TGACGGCCAA
CCTTGGCGAG TGACCTGGGG CTTGGCTCTG ACGACTGCGC AAGTTGCCCA AAACTTGGGC
TGGGATCCCC AGAGCAGCGT CTTTTGGGGG CGCCAACTGG GACGCCTCAG CAGCAGCTTA
CTGGCAGACC CCAGTGTGAT CACCGATCTA GGTCTGATTT TGGGTGCTTT GACGGCTGCC
GCCTATGAAG GCCGCTGGCG CTGGCAAGGT AGCCTCCAAC CCCATGCAGT TGGGCTCTCG
ATCGCAGGCG GATTGAGTAT GGGCTTTGGA GCATTTCTGG CGGCCGGCTG TAATATCAGC
GCCTATTTGG CCGGTATTGC CTCCACCAGT TTGCACGGCT GGGTCTGGCT GGTTGCGGCA
CTCCTAGGGT CTTGGGTCGG CATTCACTTG CGATCGCGCT GGCAACCGAG TTCGCCACCT
GCTTCAACCA ATCTGAGCTA G
 
Protein sequence
MASSDVPSFA PDQALPQWRL FWGSSAAVIA IAIAASLHGG RQTMLLLLGA AIGAVLYHGR 
FGFSSGFRKL LQRQDGNPAL AQLWLLALTS IAFAGVFSLA ELRGWELRPA IAPVGWASLG
GAFLFGIGMQ LSRACGCGTL AAVGGGSYNL LITLIAFGIG AFGATLTRPL WSQLPAWEPW
SFASQLGWGS ALLLQLWLLL ILAVALWRWA PLPMVKSRRL WLAATAIALL YTGTLVVDGQ
PWRVTWGLAL TTAQVAQNLG WDPQSSVFWG RQLGRLSSSL LADPSVITDL GLILGALTAA
AYEGRWRWQG SLQPHAVGLS IAGGLSMGFG AFLAAGCNIS AYLAGIASTS LHGWVWLVAA
LLGSWVGIHL RSRWQPSSPP ASTNLS