Gene PCC8801_3771 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_3771 
Symbol 
ID7103991 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp3958576 
End bp3961068 
Gene Length2493 bp 
Protein Length830 aa 
Translation table11 
GC content46% 
IMG OID643476776 
Productrecombination and DNA strand exchange inhibitor protein 
Protein accessionYP_002373877 
Protein GI218248506 
COG category[L] Replication, recombination and repair 
COG ID[COG1193] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01069] MutS2 family protein 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGATTCAAC AAGAAACCTT AGAACTTCTA GAATGGTCAC GGCTATGTCA ACACCTAGCC 
ACCTTTGCAG CGACAAAATT AGGGTCATTA TCGGCTCAAA AACTATCAAT TCCGACAAAT
ATTGAAGAAA GTAAACAGCT TTTAGCCCAA ACTCAAGAAA TTTATCGGTT AGAACAAAAT
TTAGACATAA AATGGTCTTT TGACGGCATT AACGACATTG GAGACTCCCT AGAACGAGCT
CAACTGGGGG GAATGCTATC AGGACAAGAA TTGCTCAACA TCGCTACAAC CCTAGCGGGA
GTAAGACGGC TCAGACGAAT TATCGAAAAT CAAGAAGATT TCCCTATTTT AGCCGAATTG
GTGGAAGATG TTCGGACGTA TCCAGAAATA GAGCAAAATA TCTACCATTG TATCGATGAA
GCCGGAAAAG TGGCCGATCG CGCCAGTGTT AAATTAGGAG AAATTCGTCG TCATCTCAAG
GATATCCGCG ATCGCATCGT CCAGAAACTC CAAAACATTA TCCAACGGCA AGGGGGAGCC
ATACAAGAAC CCGTCATCAC CCAACGGGGC GATCGCTTTG TTATCCCTGT AAAAGCCCCC
CAAAAAGACC AAATTCCAGG GATTATTCAC GACAGTTCTA GTACAGGAGC TACCCTGTAC
ATTGAACCCA ACTCCATTGT GGAATGGGGC AACAAACGCC GTCAATATCT ACGCCAAGAA
CAGGTAGAAG AAGAGGCGAT TTTACGGAAA TTAAGCGCGG AAATTGCCGA AGTTTACGAC
GATCTCGATT ACTTACTCGC CATTGCGACT ATTCTTGACT TAACGACAGC AAAAGCCCGT
TATAGCCTGT GGTTAGAAGG AAATGCCCCC AGATTCATCA ATTTTGACCA AACCGAACTC
ATTACCCTCC GACAATTACG CCATCCCCTC CTCGTTTGGC AACAGAAACA CGAACAGGGA
GTCCCCGTGG TTCCCATCAA CGTCCAGGTT GACCCCAAAA TCCGCGTAGT TGCCATCACA
GGACCCAATA CCGGAGGAAA AACCGTTACC CTCAAAACCA TCGGGTTAGC AGCGTTAATG
GCAAAAGTTG GGTTATTTAT CCCCGCAAAA GAACCCGTAG AAATACCTTG GTTTGAGCAA
GTTTTAGCCG ATATTGGCGA TGAACAGTCC ATTGAGCAGA GTTTATCGAC CTTTTCAGGT
CATATTCGCC GTATCGTTCG GATTACGGAA GGATTAGGCA ATCAGGAAGA TCAAGGCAAC
AGGCAACAGG CAACAGGCAA CAGTGAAGAT CAAGGCAGCA GCGAAAATAT CCCCCCCTCA
CCCCCTCCCC CCCTCCCCCC CTCACCCCCT CCCAATACCC TCGTACTACT CGATGAAGTC
GGTGCCGGAA CCGATCCCGC AGAAGGGAGT GCTCTGGCGA TCGCCCTCCT CAACTATTTA
GCCGATCATG CGCTGCTGAC AATTGCTACA ACCCACTACG GCGAACTAAA AGCCCTGAAA
TACCAAGATT CGCGCTTTGA GAACGCTTCA GTAGAATTTG ACGATCAAAC CCTTTCCCCT
ACCTATCGCC TGTTGTGGGG CATTCCAGGG CGGTCTAATG CCCTAATTAT TGCCCAACGG
TTGGGGTTAA ATTTAGAGAT TGTCCAAGAA GCCAAAACCC GTATCGGGGG CTTTTCTGAG
GAAATTAATC AAGTCATTGC AGGTTTAGAA GCGCAACGGC GAGAACAGGA ACAAAAAGCC
CTAGAAGCTA AGCAATTACT GCAAAAAACC GAGAAATTTT ATACCGAAGT TTCCGAAAAA
GCCACTTCTT TGCAACAGAG GGAACAGGAA TTAAAACGCT ATCAGGAACA GGAAGTTCAA
AAGGCGATCG CCCAAGCAAA AGAAGAAATT GCCCAAGTGA TCCGTCAGTT ACAACAGGGT
TCACAAACCG CGCAAAAAGC CCAACAAGCG ACAGAAGCGT TAGATCAAAT TACTCAACAT
CAGTTACCAA AAACCCCGAA AAAACAAGCG AGTTATCAAC CGAAAGTCGG GGAAAGAATT
AGGCTTTCTA ATTTAGGACA AACCGCAGAA GTGTTGGAAA TTGATGAAGA GGCACAAGCG
TTAACCGTTC GGTTTGGACT GATGAAAATG ACGGTTGCTT TGACAGAAAT TGAGTCCCTT
GATGGCAAAA AAGTTGAGAT TCAAACCCAG AAGAAAACGC CAACAGTTAC GGCAACAAAA
CCCGATAAAA GTGCATCCGT TCCCATTATT CGGACTTCGC AAAATACGGT AGATATTCGA
GGGAGTCGAG TCGCTGAAGC AGAGTCAGAT TTAGAGAAGG CGATCGCGTT AGCCACAGCT
TCAGGAATAT TGTGGATTAT TCATGGCAAA GGAACGGGGA AATTACGCCA AGGAGTCCAT
GAATTTTTGA AGCTACATCC GCAAATTGAT CGCTTTGAAT TAGCCCCCCA AAATGAAGGC
GGATCGGGGG TCACTTTGGC TTATTTAAAA TGA
 
Protein sequence
MIQQETLELL EWSRLCQHLA TFAATKLGSL SAQKLSIPTN IEESKQLLAQ TQEIYRLEQN 
LDIKWSFDGI NDIGDSLERA QLGGMLSGQE LLNIATTLAG VRRLRRIIEN QEDFPILAEL
VEDVRTYPEI EQNIYHCIDE AGKVADRASV KLGEIRRHLK DIRDRIVQKL QNIIQRQGGA
IQEPVITQRG DRFVIPVKAP QKDQIPGIIH DSSSTGATLY IEPNSIVEWG NKRRQYLRQE
QVEEEAILRK LSAEIAEVYD DLDYLLAIAT ILDLTTAKAR YSLWLEGNAP RFINFDQTEL
ITLRQLRHPL LVWQQKHEQG VPVVPINVQV DPKIRVVAIT GPNTGGKTVT LKTIGLAALM
AKVGLFIPAK EPVEIPWFEQ VLADIGDEQS IEQSLSTFSG HIRRIVRITE GLGNQEDQGN
RQQATGNSED QGSSENIPPS PPPPLPPSPP PNTLVLLDEV GAGTDPAEGS ALAIALLNYL
ADHALLTIAT THYGELKALK YQDSRFENAS VEFDDQTLSP TYRLLWGIPG RSNALIIAQR
LGLNLEIVQE AKTRIGGFSE EINQVIAGLE AQRREQEQKA LEAKQLLQKT EKFYTEVSEK
ATSLQQREQE LKRYQEQEVQ KAIAQAKEEI AQVIRQLQQG SQTAQKAQQA TEALDQITQH
QLPKTPKKQA SYQPKVGERI RLSNLGQTAE VLEIDEEAQA LTVRFGLMKM TVALTEIESL
DGKKVEIQTQ KKTPTVTATK PDKSASVPII RTSQNTVDIR GSRVAEAESD LEKAIALATA
SGILWIIHGK GTGKLRQGVH EFLKLHPQID RFELAPQNEG GSGVTLAYLK