Gene PCC8801_3772 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_3772 
Symbol 
ID7103992 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp3961178 
End bp3962593 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content28% 
IMG OID643476777 
Producttype II restriction enzyme NspV-like protein 
Protein accessionYP_002373878 
Protein GI218248507 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAACGA ATCAACAACA TCAAAAATTA GAATATGGAG ATTTTCAAAC TCCTTTAGAT 
CTTGCTTATA AAATCTGCAA AAAATTACAT AGTTTAGGAG TAAAACCTGA TTTAATTATC
GATCCTACTT GTGGCATTGG TAATTTTTTA GAAGCCTCAT CTCAGGTCTT TATTGAAACG
TCTAAAATCA TTGGAATAGA AATCAATTCA ACTTATTTAA GAAGTTTTTC TGATCAAAGA
ATTGAAATGA TTCAAGGTGA CTTTCTAATG TTTAAATGGA ATAGCTTAAA ACAATTATCA
TTTGGTAATA TTCTGATACT TGGTAATTTT CCTTGGATAA CTAATTCCAA ACAAGCAATA
ATTGGAGGAA ATAATTTACC TCCAAAAAAT AACTTTCAAA AGCACAGTGG ATTAGATGCA
ATTACGGGAA AAAGTAATTT TGATATTTCA GAATGGATGT TAATTAAAAC TATTGATCAA
TTACAAAATA ATCATGCTTA TTTAGGGATG TTGTGCAAAA CTTCTGTTAC CAGAAAAATA
TTAAACTATA TCTCTTCTCA AAACTACAAT TTAAAGCAGT TTGCTACTTA CAAGATTGAT
GCTAAAAAAT ACTTTAATGT TGCTGTTGAT GCTAGTTTAT TATTTTGTGA AATTTCTCCT
GATTTTAAGC AGTATTTTTG TTCTGTCTTT GAGAACTTAG AAACCTCATC ATTTGAAACA
ATAGGATATC ATAATAAGAT CTTAGTTAGA AATTTAAACC GCTTCAAAGA GCTAAATTTT
CTATATACTA ACAAAAATCA ACAAAAATGG CGTTCAGGAC TTAAACACGA TTGTTCTAAG
GTTATGGAAT TACGCCAAAT TAACGATAAG TTAACTAATG GGTTTAATGA AACTGTGGAT
CTTGAAGATA CTTATTTATT TCCGCTTCTC AAAAGTTCTG ATATTGCTAA TAATAAAACA
TCTAATACAA AAAAGTATGT TTTAGTTACG CAAAGAAATA TTGGTGAATC TACAGATAAA
ATTGAAAAAA CAGCCCCTAA AACCTGGGAA TATTTACAAT ATTATGCAGA TTATTTAGAT
AGACGAAAAA GTAAGATATA TCAAAAAAAT CCTAAATTTT CTATTTTTGG GATTGGAGAC
TATAGTTTTC TCCCTTGGAA AATAGCGATT TCTGGATTAT ATAAACAGTT TTCTTTTAGA
TTAATTAATA CTATCAATAA CAAACCTGTT ATGTTTGATG ATACGGTTTA TTTTCTTGCC
TTTGAAGATA AAATGATCGC AGAAAAAACC TTCGATTTTT TAACCTCAAA AATTATATTA
GATTTTTATT CATGCTTAAC TTTTTGGGAT GAAAAACGTC CTATAAAAGC ATCAATTTTA
AATCGTTTAG ACCTAATTTA TGCAACAATG ACTTGA
 
Protein sequence
MTTNQQHQKL EYGDFQTPLD LAYKICKKLH SLGVKPDLII DPTCGIGNFL EASSQVFIET 
SKIIGIEINS TYLRSFSDQR IEMIQGDFLM FKWNSLKQLS FGNILILGNF PWITNSKQAI
IGGNNLPPKN NFQKHSGLDA ITGKSNFDIS EWMLIKTIDQ LQNNHAYLGM LCKTSVTRKI
LNYISSQNYN LKQFATYKID AKKYFNVAVD ASLLFCEISP DFKQYFCSVF ENLETSSFET
IGYHNKILVR NLNRFKELNF LYTNKNQQKW RSGLKHDCSK VMELRQINDK LTNGFNETVD
LEDTYLFPLL KSSDIANNKT SNTKKYVLVT QRNIGESTDK IEKTAPKTWE YLQYYADYLD
RRKSKIYQKN PKFSIFGIGD YSFLPWKIAI SGLYKQFSFR LINTINNKPV MFDDTVYFLA
FEDKMIAEKT FDFLTSKIIL DFYSCLTFWD EKRPIKASIL NRLDLIYATM T