Gene PCC8801_3970 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_3970 
Symbol 
ID7103463 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp4157273 
End bp4159567 
Gene Length2295 bp 
Protein Length764 aa 
Translation table11 
GC content30% 
IMG OID643476967 
Productstem cell self-renewal protein Piwi domain protein 
Protein accessionYP_002374067 
Protein GI218248696 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGATTT TACTTAACGG TTTCAAAATA GAGCTTTCTG CACCAACTTT CACCGCTTAT 
GTGGAGCAGA TGCCAAACAA TAAAAATATA GAGTCAATAA GAGAAAAGCA TCAGGATGAC
TGCTTTATAT ACTGGCATGA AGGTCAACTT TTTGTAATCC CTAAAACACC TGAGTCTAAA
ATCTCAATCG GAGAAAAGAC TACTCTACAA TGTGAAGAAA ATTTAAAGCT TTTAGTTGCT
CGTGTGAATC ATTTATTACC CACAATAGTA CCTGACTATA ATCCTGTTAG ATTTCAACCT
GTTCAATTTT TAGCCAAAAA AACAGAGTTA GTAAAAATGA TTATAGAAAA ACAAAATTTA
ACCAATTATT CTCAAATACT TGAAAATTTT AAAATAATAC CAAAATATAC TCTTGAAGCT
AGATTAATTG AATTACGACC TAATGAAGTT TTTATCAGTA TATTCTTATG CTTAGGAACT
AATTGGAAAA TTACAGCATC CCTTAGTAAT TTACAAGCTC AAGGTGTTAA CTTGAGAGAT
TTATATGTAG TCCGTCGTCA ATGTAAAACA GGAGAACGTC GCTTAGTTGG AAAAATTGAC
TCATTGACTC ATGGAATAAT TAATTTATCA GAATCATACA ATAATTTATC TCAAATTCAT
GAAGACGAAA TATGGTTAGA AGGTTCAAAA GCTTCTTTTT CCCATTGCCT AAAAGCTTTA
CTAGGTAACA AATATTATAG TTTTGAAAGA GAAAGAGAGC GTGAAGAAGG TAATTTGCTA
ATTGGAACAG GACAATATGA AGCATTAAAA AAAATTAAAG ATTACTTAAA AAAATCTCCA
CTTTTTCTGA CACCTGATTT ACAAGGAAAC TTAAAAGAGC AAATAGAAAT TAATAATGAT
GAAAATTATA CAACAATGAC TTCAACCTCT ACAGTAGAAT ATTGTTTTGA TTCAGCAAGA
ACTAAAAAAC ATAAAATAGC TTGGTATGGC ATTAGGGATT ATGGACCTTT TAGTCGAGAA
GTTTTTTCTA AAAAATCACC CAATATTTTA GTATTTTTTC CTGATACAAT CCAAGGAAAG
GTAGAAATAT TTTTAAAATC TTTTCAAGAA GGAATTAGTA TAAAAGAAGG ACAATTTGAA
AAATCAAGCT ATAGTGGTGG TTTTGCCAAA ATCTTTTGTC TTACTAACCC TAAGTTTACT
TTAGAAAGAA TTGCTTGGCA CGAAAATAAA GATCAATCAC CAGCTAAAGT TTATAAAGCA
TACAGAAAAA CAATTGAACA AATATTAAGT AAAATGCAAG AAGATATAGA TGCAGCTATA
GTTATTATTT TAGATGAACA CTCAAATCTC CCAGATTCTA TAAATCCATA CCTCCATTCT
AAATCTTTAT TGCTAACACA TGGAATCCCT GTTCAAGAAA TTCGATATTC AAATATTCAA
AAAGACAAAA AATCTTTACA ATATATTCTT CAAAACTTTA GCCTTGCAAT GTATGCAAAA
TTAACGGGAC AACCTTGGAC AGTTGATCAA GATCAAACTA TTAGTGACGA ACTTGTAATT
GGTATAGGAA CTAGCGAATT ATCTAATAGT CGATTTGAAA CAAGACAAAG ATTTGTAGGA
ATTACAACTG TTTTTAGAGG AGATGGAAAT TATCTTTTAA GTAGTTTGTC AAAAGAATGT
TCTTATGATG AATATCCTGA TGTTTTACGA GAATCAACAA TATCAATTTT ACAAGAATTT
AAGAGACGTA ATGGATGGCA ACCAGGTGAT ACAGTACGTC TTATTTTCCA TTCAGCTAGA
CCTTTAAAAA AGGTTAATAT TGCTAAAATT ATATCTGAAT GTGTCGAAGA AGTAGGTAAA
GAACAAATTG TTGAATTTGC TTTTTTGACA GTTTCTGAAG AGCATCCATT TATAGTTCTT
GATACTTCTC AACAGGGTTA TAAAGGAAAA GGAATTTATG CACCAGAACG AGGTAAAATT
ATACAAATTG GAAAATATAA TCGATTATTG TCTACCAATA GTCCACATTT AATTAAAAAG
GAAACTTCTC CTATACCTAG ACCTCTTCTA ATTCGTTTAC ACCAACAGTC TAATTACCGT
GACTTAACTT ATTTAAGTGA GCAAGTTTTA AAGTTTACTG CACTATCTTG GCGTTCTACC
TTTCCCGCGC CTAAACCAGT GAGTATCTAT TATTCAGAAT TAATTGCTAA TCTTTTAGGA
AGGTTAAAAA ATATAGAAGG ATGGTCATCT ATAATATTAA ATACAAAACT CCGTGCTAGT
AAATGGTTTT TATGA
 
Protein sequence
MSILLNGFKI ELSAPTFTAY VEQMPNNKNI ESIREKHQDD CFIYWHEGQL FVIPKTPESK 
ISIGEKTTLQ CEENLKLLVA RVNHLLPTIV PDYNPVRFQP VQFLAKKTEL VKMIIEKQNL
TNYSQILENF KIIPKYTLEA RLIELRPNEV FISIFLCLGT NWKITASLSN LQAQGVNLRD
LYVVRRQCKT GERRLVGKID SLTHGIINLS ESYNNLSQIH EDEIWLEGSK ASFSHCLKAL
LGNKYYSFER EREREEGNLL IGTGQYEALK KIKDYLKKSP LFLTPDLQGN LKEQIEINND
ENYTTMTSTS TVEYCFDSAR TKKHKIAWYG IRDYGPFSRE VFSKKSPNIL VFFPDTIQGK
VEIFLKSFQE GISIKEGQFE KSSYSGGFAK IFCLTNPKFT LERIAWHENK DQSPAKVYKA
YRKTIEQILS KMQEDIDAAI VIILDEHSNL PDSINPYLHS KSLLLTHGIP VQEIRYSNIQ
KDKKSLQYIL QNFSLAMYAK LTGQPWTVDQ DQTISDELVI GIGTSELSNS RFETRQRFVG
ITTVFRGDGN YLLSSLSKEC SYDEYPDVLR ESTISILQEF KRRNGWQPGD TVRLIFHSAR
PLKKVNIAKI ISECVEEVGK EQIVEFAFLT VSEEHPFIVL DTSQQGYKGK GIYAPERGKI
IQIGKYNRLL STNSPHLIKK ETSPIPRPLL IRLHQQSNYR DLTYLSEQVL KFTALSWRST
FPAPKPVSIY YSELIANLLG RLKNIEGWSS IILNTKLRAS KWFL