Gene PCC8801_4540 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_4540 
Symbol 
ID7095919 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011723 
Strand
Start bp30894 
End bp32288 
Gene Length1395 bp 
Protein Length464 aa 
Translation table11 
GC content43% 
IMG OID643467520 
ProductRelaxase/mobilization nuclease family protein 
Protein accessionYP_002364816 
Protein GI218203963 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones71 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value0.727703 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATAGGAC ATATTGAACA CGGCAGTAAC TTTGGGGGAT TATTTCGTTA CCTGTTGGCC 
AGCGATAAAG GGGCGAGAAT TATTGGAGGC AATGCCGCCG GAGAAACCAT CGACCAATTA
ACCCAAGAAT TTAATAATTG TGCCGACCAA CGGCGCACCA CCACCAAACC TGTCAAACAT
TTTATACTCA GTTTTGCCCC TGAAGATGGT TTTGTGTCCG ATGACCTCAA ACAAACCCTT
GCCAGCTTTG CCATTCAACG ATTAGGCTAC ATTGATAACC AATATGTCGT CATTGACCAT
CAGCGACAAG ACCCCGGCCA TGATTGGAAC CATGACCACG ACCATATTCA CATCGTCGTT
AATATGATTA CCCTAGATGG TCAAAGAGTC GATGATTGGC AGGATAAACG GCGATTTGAA
GCCATTATCA GGGAATTAGA ACAAGAACAT CAGCTAACCC CTGTTGCCCC CAGTCGGGAA
AGAAACCGCA AAGCCTTAAC CCACGGACAG GTACAGAAAT ATAAACGAGA ACTGAGGGAT
TTGAGGGCGG GAAAACAGAC AGAACCCCCA GAAATTCCGA TTTCTGTTAA GTTACAGGCA
GCCATTGATG CAGCCAGTCT TGACCAGCCA ACGATGACCA TTTTTATTGG CAGACTGCAA
CAGTTAGGGA TTGATGTTAT GCCCATTGTG ACCGAAACAG GCAGAAAACG CATTAGTTAT
CAAATGCACG GGGCGAAGCC CTTTAGGGGC AGTAAACTGC ATAATGGAAG CTTCCCGAAA
TTAATCAGTC ATCGGGGCAT AGATTTGGAT TTACAACGGG ATAAAAAGGC GATGGATGAT
GCCGTGAACC ATCAACCAGT TATCATTCCT TCAGACCAAT TAATTGAGTG GTCACAGATT
AATTTAATCC CCTATCTTCC TAGTGAATTA CCCCTTGAAG AGTTAGAGAA AGAAACAAAA
GCGGGTCAAG ATTTAATTTC CCAATCCCTT GAGCCGAAAG ATAATCAATT GATGCCTGAA
TTGTTTTCTG ATGTTTTAAA CAAACAACAA GAATATACTT TAATTATTGC CCCCATTGCC
CAACGATGGT TACAAGTCAA CCAAACTGTT AGTTATCAAT CTAAACATCA TGTCATTGAA
TATGAGTCAG GAAAATTGAC GATTAAAGAT AACCAGGGAA ATTATCAAAT GATGGCTGTT
GCCGTGGGAA CTGATCAACA AAACCAAACA ATTTGGCAGT CGGGTAATTT ACCTCTGAAT
AGCCCTGGAC TGACCCAAGA AGATGTAGAA CGGTTTACCT CTTCAAAGAT GGAAAAAGCG
ATTAAAGAAA TTGAAGGTAA ACTGAATCAA GCCACCTCTC AATCATCTCA ACCAAGAAGG
AGGAGAAGAA GATGA
 
Protein sequence
MIGHIEHGSN FGGLFRYLLA SDKGARIIGG NAAGETIDQL TQEFNNCADQ RRTTTKPVKH 
FILSFAPEDG FVSDDLKQTL ASFAIQRLGY IDNQYVVIDH QRQDPGHDWN HDHDHIHIVV
NMITLDGQRV DDWQDKRRFE AIIRELEQEH QLTPVAPSRE RNRKALTHGQ VQKYKRELRD
LRAGKQTEPP EIPISVKLQA AIDAASLDQP TMTIFIGRLQ QLGIDVMPIV TETGRKRISY
QMHGAKPFRG SKLHNGSFPK LISHRGIDLD LQRDKKAMDD AVNHQPVIIP SDQLIEWSQI
NLIPYLPSEL PLEELEKETK AGQDLISQSL EPKDNQLMPE LFSDVLNKQQ EYTLIIAPIA
QRWLQVNQTV SYQSKHHVIE YESGKLTIKD NQGNYQMMAV AVGTDQQNQT IWQSGNLPLN
SPGLTQEDVE RFTSSKMEKA IKEIEGKLNQ ATSQSSQPRR RRRR