Gene PCC8801_4550 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_4550 
Symbol 
ID7095929 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011723 
Strand
Start bp36115 
End bp37203 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content51% 
IMG OID643467530 
Productintegrase family protein 
Protein accessionYP_002364826 
Protein GI218203973 
COG category[L] Replication, recombination and repair 
COG ID[COG4974] Site-specific recombinase XerD 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones85 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value0.798479 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCACCA ATGCCACCCC TTCCCCTCTC AGTGAGGCTG ACTTATTAGC TCGCTTTGCC 
CAGTTTGTCC GTCTCGATAC CGCCGATGGC AATGCTTCTG ACGATACGGT GAAAACCTAC
GCCTGTACGG TTAAGCAATT TCTCGGTTGG TGCGACGCGC AACGGCTTCA TCCTCTCGAT
GCGACGCGAG ATGACCTCAA ATCCTATCGG CGTTGGTTAG TGGACGGTCA ACAATACAAA
TGTGCCACCA TCGCCTTGAA ATTGACGGTG GTGCGTCGTT TTTATGCAGG GGCGGTGGAA
CGGGGGCTGA TTCTGGCTAA TCCGGCTTTG GGCATTAAAC CCCCCAGGGA AACCATCGAT
CCGGCTGAGC GCATTAATTA CCTCGAAGAA GCCGAAGTAA CGCGGTTATT GGAGAGTTTG
CCCACTGAGA ATACGGTGGG GGCGTTGCGG GATAGGTTTC TGGTGGCCGT CATGGTTTTG
GAGGGATGCA GAACCGTGGA AATGCACCGC GCTTCTATTG GGGATATTGT AAAACGAGGT
GGTGATATCG GCATTCGGGT ATCAGGAAAA CGATCTCGAC GCATTGTGCC GTTAACGCCT
GATTTAGCCA AGCTGCTGAA TAAGTATCTG AATGCTAGGA AGCGGTCAGG GGAGGCATTG
TTAGCGGATA CTCCTTTGTT TATTGCGTTA GATAAAAGGA CGTATGGAGG GCGATTAAGT
CGTCGTTCGA TTCAGCGAGT AATTGATAAG TATTTACAGG CATCAGGGTT GAAAGAGCAG
CCGACAAAAC AAAAAAGCCC AAAACGGGCA TCTAATCAGT CTCATCAACC GTCTAGCGGG
GAGAAACAAC GCTCTTCACA ATCTGCTTCA TCTACGTCTA CTAAGTTTCA ACAACCAGAG
CGACAGTTGA GCGCACATTC TTTGAGGCAT ACGGCAGGGA CATTGGCCAT CAGGGCAGGT
TCGGATTTAA GGCAGGTGCA GGATTTGTTA GGCCATGCTG ATCCCAGGAC GACTGCTTTG
TATGCTCATG TGGCTGATCG GTGGCGCAAT AATCCAGCCT TGAGGTTGGG GGTCAAGGTT
CCGCTTTGA
 
Protein sequence
MLTNATPSPL SEADLLARFA QFVRLDTADG NASDDTVKTY ACTVKQFLGW CDAQRLHPLD 
ATRDDLKSYR RWLVDGQQYK CATIALKLTV VRRFYAGAVE RGLILANPAL GIKPPRETID
PAERINYLEE AEVTRLLESL PTENTVGALR DRFLVAVMVL EGCRTVEMHR ASIGDIVKRG
GDIGIRVSGK RSRRIVPLTP DLAKLLNKYL NARKRSGEAL LADTPLFIAL DKRTYGGRLS
RRSIQRVIDK YLQASGLKEQ PTKQKSPKRA SNQSHQPSSG EKQRSSQSAS STSTKFQQPE
RQLSAHSLRH TAGTLAIRAG SDLRQVQDLL GHADPRTTAL YAHVADRWRN NPALRLGVKV
PL