Gene Cyan8802_3132 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCyan8802_3132 
Symbol 
ID8392464 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8802 
KingdomBacteria 
Replicon accessionNC_013161 
Strand
Start bp3204737 
End bp3206011 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content33% 
IMG OID644981077 
Productrestriction modification system DNA specificity domain protein 
Protein accessionYP_003138807 
Protein GI257060919 
COG category[V] Defense mechanisms 
COG ID[COG0732] Restriction endonuclease S subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.474681 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGTAG AAGGTTGGAA AGATAGTAGT TTGATTTCCC TCTTGACAAT ACTTAAGTCT 
GGAGGGACAC CAAATACATC ACGAAGCGAT TTTTATAATG GTGATATTCC TTTTGTTGCA
ATTGAAGATA TGAGTGCTAG TAGAAAATAT TTATACAGCA CTGTTAAAAG TTTAACAAAA
GAAGGCTTAA AAAATTCTAA CGCTTGGTTA GTCCCTGAAA ATTCCTTACT GTATTCTATA
TACGCAACTC TTGGACTTGT TCGTATTAAT AAGATACCTG TAGCTACTAA TCAAGCTATA
TTAGCAATGA TTGTAAACGA TGAGGTGGTT GATCAAGATT ATCTCTATTA TTGGTTAGAA
TATATTCGTG ATTCTATTGT TAATTTATCG GCTCAAACAA CACAAAGTAA TTTAAGTGCC
ACTACTGTTA AGCCTTTTTT AGTTCAGCAT CCTAAAGATA AAGAAGAACA AACCCAAATA
GCCACTATCC TCTCAACAAT AGACCGCGCT ATTGAACAAA CCGAAACTTT GATCGCTAAA
CAACAGCGCA TTAAAACGGG ACTAATGCAG GATTTACTAA CAAAAGGTAT TGATGAAAAT
GGTAATATTC GCAGCGAAGA AACCCATCAA TTTAAGGATT CAGTTTTAGG TAGGATTCCT
GTTGAGTGGG AGGTGAAACC TTTAGGTGAA AAAGCAAGGG TAAGATCAGG ATCTACTCCT
TTACGATCTA ATGAAAAATT TTGGATAGGG GGGACAGTTT CTTGGGTTAA AACCTCTGAA
GTTTGTTTTT CCAAAATAAC AGAAACAGAA GAAAAAATTA CAGAGCAAGC ATTAAAATTG
ACCTCTTTGA ATTTAGAACC TATTGGTAGT GTATTGGTAG CTATGTATGG ACAAGGTGGA
ACTAGAGGAA GATGCGCTAT TTTAGGCATT GAAGCAACAA CTAATCAAGC TTGTGCTGCA
ATTCTAGGAC AGCAAGGAGA AATCAATCAA GACTATTTAT TTTATTATTT ATCTTCTAAA
TATAATGATT TACGAACAAT AGGACATGGA TCAAACCAAA CTAACTTAAA CGGTAATTTA
TTAAGATTAT TTCTTATTAA AGTTCCATCC TATAAGGAAC AAGTTAAAAT TGCTGACTCT
TTCAATAAAT TAAAACAGAT GCAAGATCAG CTTTTTTCGG AATTATCAAA GTTAAATAGT
ATAAAAACCG GCCTTATGCA AGATCTTTTA ACTGGCAAAG TTAGGGTAAC AGAATTACTT
AAAGAAACAG ATTGA
 
Protein sequence
MSVEGWKDSS LISLLTILKS GGTPNTSRSD FYNGDIPFVA IEDMSASRKY LYSTVKSLTK 
EGLKNSNAWL VPENSLLYSI YATLGLVRIN KIPVATNQAI LAMIVNDEVV DQDYLYYWLE
YIRDSIVNLS AQTTQSNLSA TTVKPFLVQH PKDKEEQTQI ATILSTIDRA IEQTETLIAK
QQRIKTGLMQ DLLTKGIDEN GNIRSEETHQ FKDSVLGRIP VEWEVKPLGE KARVRSGSTP
LRSNEKFWIG GTVSWVKTSE VCFSKITETE EKITEQALKL TSLNLEPIGS VLVAMYGQGG
TRGRCAILGI EATTNQACAA ILGQQGEINQ DYLFYYLSSK YNDLRTIGHG SNQTNLNGNL
LRLFLIKVPS YKEQVKIADS FNKLKQMQDQ LFSELSKLNS IKTGLMQDLL TGKVRVTELL
KETD