Gene Cyan8802_3989 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCyan8802_3989 
Symbol 
ID8393339 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8802 
KingdomBacteria 
Replicon accessionNC_013161 
Strand
Start bp4102957 
End bp4104327 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content33% 
IMG OID644981913 
Productrestriction modification system DNA specificity domain protein 
Protein accessionYP_003139627 
Protein GI257061739 
COG category[V] Defense mechanisms 
COG ID[COG0732] Restriction endonuclease S subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.0737122 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCTAA ACACCCTAAA ACAATGGAAA CTCTACCCAA ATTATAAACC TTCTGGGGTT 
GATTGGTTGG GGGATATTCC TGATAGTTGG GAGGTTAAAA GATTAAGATA TTTAAGCAAA
AAGATAACAG CCGGTCCCTT TGGTTCTAAT TTGACTAAAA ATATTTATAC ATCTACAGGA
TATAAAATTT ATGGACAAGA ACAAGTAATT GCTTCTGATT TTTCCATAGG TGATTATTAC
ATCTCTAAAG AAAAATATGA CCAAATGAGT CAATATAAAA TAAATTCTGG AGATATATTA
ATAAGCTGTG TAGGAACTTT TGGAAAGGTT GCAGTAGTTC CTAAAAACAT AGAACAGGGT
ATAATCAATC CTCGCCTTAT AAAACTCATC CCTATTACTG AATATATTAA CTCTGTTTAT
TTAGAAAAAT TATTAAAAAG TGTTGTTGCT TTTGAACAGA TGGAAAAATT AAGCAGAGGA
GGAACAATGG GAGTAATTAA CATTGGATTA CTTTCTGATA TTTTACTACC TATTCCCCCC
CTTCCCGAAC AAGAAAAAAT CGCTCAATTT CTGGATAAAG AAACGGCGAA AATAGATAAA
CTCATCACCC TCAAAGAAAG ACTAATTGAA TTATTAAAAG AAAAGCGCAC AGCTTTAATT
AGTCATGCTG TCACCAAAGG ACTTAACCCC GATGTCCCCA TGAAAGATTC TGGGGTAGAA
TGGTTAGGGT TTATTCCTGA ACATTGGGAG GTTAAAAGGT TAAAATATAT AGTCCCTAAT
ATTACCGTAG GTATTGTAGT TACTCCTGCT AAATATTATG TAGAATCAGG AATACCATGT
TTACGTTCTG TAAATATATC TTCGGGAAAA ATTGATAATT CTAATTTAGT TTTTATTAGT
TCTCAAAGTA ACGAACTTCA TCAAAAATCT AAAATCTATA AAGGTGATTT AGTTTTAGTA
AGAACTGGTG TCACTGGAAC AGCTGCGATT GTTACAGATA ATTTTGATGG GGCAAACTGT
GTTGATTTAT TAATTATTCG TAATTCTAGA TTAATTTTAA CACTATATCT ATACTATTAT
CTTAATTCTT CAACAACGTC TTATCAAGTT AATAATTATT CAGTAGGTGC TATTCAAGCC
CACTATAATA CGTCAACATT ATCAGAACTA ATCATTACTT TTCCTCCCCC TCAAGAACAA
CAAAAAATCG CTGAATACTT AGACAGAAAA ACCGAACAAA TAGACCAAAT AATTAACAAA
ACCCGTGAGA GTATTGAATA TTTAAAAGAA TATCGAACCG TGTTAATATC TGCTGCCGTA
ACAGGTAAAA TAGATGTGAG GCAGTGGGGA GGTGAGGAGG TGAGGGAATG A
 
Protein sequence
MTLNTLKQWK LYPNYKPSGV DWLGDIPDSW EVKRLRYLSK KITAGPFGSN LTKNIYTSTG 
YKIYGQEQVI ASDFSIGDYY ISKEKYDQMS QYKINSGDIL ISCVGTFGKV AVVPKNIEQG
IINPRLIKLI PITEYINSVY LEKLLKSVVA FEQMEKLSRG GTMGVINIGL LSDILLPIPP
LPEQEKIAQF LDKETAKIDK LITLKERLIE LLKEKRTALI SHAVTKGLNP DVPMKDSGVE
WLGFIPEHWE VKRLKYIVPN ITVGIVVTPA KYYVESGIPC LRSVNISSGK IDNSNLVFIS
SQSNELHQKS KIYKGDLVLV RTGVTGTAAI VTDNFDGANC VDLLIIRNSR LILTLYLYYY
LNSSTTSYQV NNYSVGAIQA HYNTSTLSEL IITFPPPQEQ QKIAEYLDRK TEQIDQIINK
TRESIEYLKE YRTVLISAAV TGKIDVRQWG GEEVRE