Gene Synpcc7942_1842 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSynpcc7942_1842 
Symbol 
ID3774417 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus elongatus PCC 7942 
KingdomBacteria 
Replicon accessionNC_007604 
Strand
Start bp1908945 
End bp1910213 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content54% 
IMG OID637800283 
Producttransposase IS605 
Protein accessionYP_400859 
Protein GI81300651 
COG category[L] Replication, recombination and repair 
COG ID[COG0675] Transposase and inactivated derivatives 
TIGRFAM ID[TIGR01766] transposase, IS605 OrfB family, central region 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.119511 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTAATCC GTGAAGCCAA GTTAGCTGGA ACCCCTGAGC AGTTTGCCCA ACTCGATGCA 
GCAATCCGCA CGGCTCAGTG TGTCCACAAC CGCTGTCTTC GCTACTGGCG AGACAACGCT
GGGGTTAGCA AAAACGACTT GCAAAAGTTC TGTGCTGTCT TAGCTAAAGA TGCTGAGCAG
CCTTGGATGG GCTTGCTGAA TTCGCAGGCT CGACAAGCAG CCGCGGAACG GACTTGGCAG
TCGATTCAGT CTTTTTATCG CCGTTGCAAG GAGGGCGCGA AACACAAGGG CTATCCCAAG
TTCAAAAAGC ATTCCCGCTC AGTGGAGTAC AAAACGTCGG GCTGGAAACT CTCGGATGAT
GGGATGTCGA TCACGTTCAC CGACGGTTTT AAGGCGGGTC AGTTGGAGCT GTGGTTGGAC
GGTTCTGCTC GGCAGTTAAT TCTGTCATCC AAAATCAACC GCGTTCGGGT GGTGCGCCGT
GCTGATGGGT ACTACGCTCA GTTCTGTTTG GACGTAGAAC GACAGGAAGC TGGAGTCTAC
AGCGGCAACG TCATTGGCTT GGACTTGGGG CTGAAGTATT TCACCAAGGA CTCTAATGGC
GCTGAGGTTG CTTGTCCCAA GTTTTTCCGC AAGGGAGAAA AGCGGCTCAG ACGGGCCCAA
CGGCGGCTAT CCAAGCGGTT CAAGAAAGGA GCGAAGCCTC AGAGCAAGAA CTACCACAAG
CAACGTCAGC GAATAGGCAA AGTCCACCTC AAAATTCAGC GCCAACGTAA AAGCTGGGCT
ATTGAACAAG CACGGCGCGT AATGGTGTCT AACGACATCG TGGTCTATGA AGATTTGCGG
GTGCCTAACT TAGTTCGCAG CCGACACTTA GCAAAATCCA TCCACGATGC AGGCTGGACG
CAGTTCACCC ACTGGCTGGA CTACTACGGC AAACTTTGGA GAAAAGTCGT CGTCGCGGTT
AATCCGGCCT ACACCAGCCA AGACTGCTCT GGTTGCGGTT ATCGGGTGCA GAAGTCACTC
TCGACCAGAA CCCATGACTG CCCACACTGC GGCTTAACGA TTTGCCGCGA CCAAAATGCC
GCGCTCAATA TCCTCAAGCG AGGGTTAGAA GTCGTCGGCG CGGAATGGAA CAACGGTACG
GCAGGGCATG CCGAAACCGG CTCGCAAGAG CAAACGACTG GGGAGATAAG CACCTCTGCC
GAAGTAGGGC AACCCACTTC TGTAAGTGCT GTCGCTGAAC CAGTAAGAAC AAGCCGCGAG
GCTGTTTGA
 
Protein sequence
MLIREAKLAG TPEQFAQLDA AIRTAQCVHN RCLRYWRDNA GVSKNDLQKF CAVLAKDAEQ 
PWMGLLNSQA RQAAAERTWQ SIQSFYRRCK EGAKHKGYPK FKKHSRSVEY KTSGWKLSDD
GMSITFTDGF KAGQLELWLD GSARQLILSS KINRVRVVRR ADGYYAQFCL DVERQEAGVY
SGNVIGLDLG LKYFTKDSNG AEVACPKFFR KGEKRLRRAQ RRLSKRFKKG AKPQSKNYHK
QRQRIGKVHL KIQRQRKSWA IEQARRVMVS NDIVVYEDLR VPNLVRSRHL AKSIHDAGWT
QFTHWLDYYG KLWRKVVVAV NPAYTSQDCS GCGYRVQKSL STRTHDCPHC GLTICRDQNA
ALNILKRGLE VVGAEWNNGT AGHAETGSQE QTTGEISTSA EVGQPTSVSA VAEPVRTSRE
AV