Gene PCC8801_4114 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_4114 
Symbol 
ID7105565 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp4313300 
End bp4315645 
Gene Length2346 bp 
Protein Length781 aa 
Translation table11 
GC content38% 
IMG OID643477103 
ProductATP-dependent DNA helicase PcrA 
Protein accessionYP_002374202 
Protein GI218248831 
COG category[L] Replication, recombination and repair 
COG ID[COG0210] Superfamily I DNA and RNA helicases 
TIGRFAM ID[TIGR01073] ATP-dependent DNA helicase PcrA 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTTATCC CCATGACAGT TAATAATGAT TTTCTGGCTC AACTCAACAC TTCACAACGT 
CGCGCGGTTG AACACTTTTG TGGTCCTTTA TTGGTGGTAG CGGGTGCGGG GTCGGGGAAA
ACTCGTGCCC TAACCTACCG AATTGCCCAT CTGATTAGTT ATCATCAAGT TGAACCCGAA
TCGATTTTAG CCGTTACCTT TACCAATAAA GCGGCTAGAG AAATGAAAGA AAGAATTGAG
AAACTTTTTG CTCAAGAAAT GGCTTTCAAA AAGCATGGTA TCAGATTTGA TCTCCTCAAT
GAATATGAAC AAAAACAACT TTTGTCGAAA GTTTATAAAA GTACGACTAA AAAGCTTTGG
ATTGGAACTT TTCACAGTTT ATGTACTCGA ATATTGCGCT ATGATATTAA TAAATATCAA
GACGAGCGGG GACGACAATG GCAACGAAAT TTTTCAATTT TTGATGAATC TGATGCCCAA
AGTTTAGTTA AGAATATTGT CACCAAACAA TTAAACTTAG ACGATAAAAA ATTTGACCCC
CGTTCCGTGC GGTATCAAAT TAGTAATGCT AAAAATTTAG GGCTTTCTCC CGATCAGTAT
TTACAAAAAA ATCCTTCCTA TAAAAGTCGA GTGATTGCTG AAGTTTACAA CGAATATCAG
TCCCAATTAG CGGCTAATAA TGCCTTAGAT TTTGATGACT TAATCCTCAT CCCCGTTAGA
CTATTTCAAC AAAATGAATC AATTCTTGGC TATTGGCATA GTCAATTTAA ACATATTTTA
GTAGATGAGT ATCAAGATAC CAATCAAATT CAATACGAAT TAATCCGCTT ACTGGCTACC
AATGGAGAAA CGAAAAAAAG CGAATGGAAT TGGCAAAATC GCTCTATTTT TGTCGTAGGA
GATGCGGATC AATCCATTTA TAGTTTTCGC ATGGCAGACT TTACCATTTT GCTCAATTTT
CAATCAGACT TTGGGGATGG TTTGCCTGAT GATGACACGC GAACTATGGT TAAACTAGAG
GAAAATTATC GGTCACGGGA AAATATTTTA GAAGCCGCTA ATCATTTAAT TGAAAATAAC
AGTCAGCGTA TTGATAAGAT CCTGAAACCG ACACGGGGAT CAGGAGATTT TATTTACTGT
TACAAAGCGG ATGATGAACA AATTGAAGCC CAATTGGTGA TTGAACATAT CCAAAAATTA
GTCAGGGAAA ACCCCGAATT AAATTGGGGA AGTTTTGCCA TTCTTTATCG AATCAATGCC
CAATCTCGAC CCTTTGAAGA TCGGTTAATC ATGAATAGTA TTCCCTACAA TATTGTAGGT
GGATTTAAGT TTTATGATCG TCAAGAAATC AAAGACGCGA TCGCCTATTT ACGGCTAATT
GCTAACCCAT CCGACACAGT AAGCTTACTG AGAATCATTA ATACTCCCCG TCGTTCCATT
GGCAAAACCT CAATAGAATC TTTATTAAAA GCAGCCCAAG AGTTAGAGAT TCCCCTCTGG
GAAATTATCT CCGATGAAAC CTCCGTTAAT ACCTTAGCAG GACGGGCAGC TAAATCCGTC
AATAAATTTG CCCAAATGAT TAAATCTTTT CAAGAACAAT TAGATAGTTT ATCAGGAGCA
GAAATTCTTA ACCAAGTGAT GGAAGCGTCG GGATATGTTG ATGATTTAAA ACAAAAAGGA
ACCGAAGAAG CGGATAATCG TTTAGCAAAT ATTTTCGAGT TATACAATGC GGTGTTACAA
TTCCAAGAAG ATAGCGAAGA TCAAAGCTTA CAAGGCTTTT TATCTAATGC GTCCCTCGCG
TCTGATCTCG ATGATTTAAA AGAAGGACAA GAGAAAGTTT CTTTAATGAC CCTTCATTCC
GCTAAAGGGT TAGAATTTCC CGTTGTTTTT CTTGTGGGTT TAGAAGAAGG CTTATTACCC
CATGCCCGCA GCGTTAGCGA TCCTTTATCC TTAGAAGAAG AACGGCGGTT ATGTTATGTT
GGGGTCACTC GTGCTCAAGA ACAATTATTT TTGACCTACG CGAGAGAACG GTTTGTTTGG
GGGTCAAAAG AAGCGAAAAT GCCCTCACGA TTTTTACAAG AATTGCCCTC AGATTTAATT
AAGAGTAATG TTCCTCCTAA AGTCGTTAAA TCTCGCCCAT CCTTAACAAC CTCTTCCCCT
AAAATCTTGA AGCAATCTAA TGAAACGTGG TCTGTCGGAG ATCAGGTGAT TCATCAGGTG
TTTGGAGAGG GGGAAATTAC CCGTATTTTA GGATCAGGTA AAAAATCGAG TTTAGCGATT
AAATTTCCAG GGTTAGGACA AAAAATTATC GATCCGACCA TAGCCCCCCT CAAACGCAAC
GAGTAA
 
Protein sequence
MVIPMTVNND FLAQLNTSQR RAVEHFCGPL LVVAGAGSGK TRALTYRIAH LISYHQVEPE 
SILAVTFTNK AAREMKERIE KLFAQEMAFK KHGIRFDLLN EYEQKQLLSK VYKSTTKKLW
IGTFHSLCTR ILRYDINKYQ DERGRQWQRN FSIFDESDAQ SLVKNIVTKQ LNLDDKKFDP
RSVRYQISNA KNLGLSPDQY LQKNPSYKSR VIAEVYNEYQ SQLAANNALD FDDLILIPVR
LFQQNESILG YWHSQFKHIL VDEYQDTNQI QYELIRLLAT NGETKKSEWN WQNRSIFVVG
DADQSIYSFR MADFTILLNF QSDFGDGLPD DDTRTMVKLE ENYRSRENIL EAANHLIENN
SQRIDKILKP TRGSGDFIYC YKADDEQIEA QLVIEHIQKL VRENPELNWG SFAILYRINA
QSRPFEDRLI MNSIPYNIVG GFKFYDRQEI KDAIAYLRLI ANPSDTVSLL RIINTPRRSI
GKTSIESLLK AAQELEIPLW EIISDETSVN TLAGRAAKSV NKFAQMIKSF QEQLDSLSGA
EILNQVMEAS GYVDDLKQKG TEEADNRLAN IFELYNAVLQ FQEDSEDQSL QGFLSNASLA
SDLDDLKEGQ EKVSLMTLHS AKGLEFPVVF LVGLEEGLLP HARSVSDPLS LEEERRLCYV
GVTRAQEQLF LTYARERFVW GSKEAKMPSR FLQELPSDLI KSNVPPKVVK SRPSLTTSSP
KILKQSNETW SVGDQVIHQV FGEGEITRIL GSGKKSSLAI KFPGLGQKII DPTIAPLKRN
E