Gene PCC8801_2868 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_2868 
Symbol 
ID7105964 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp2958529 
End bp2960166 
Gene Length1638 bp 
Protein Length545 aa 
Translation table11 
GC content36% 
IMG OID643475904 
Producthelix-hairpin-helix motif protein 
Protein accessionYP_002373023 
Protein GI218247652 
COG category[I] Lipid transport and metabolism
[L] Replication, recombination and repair 
COG ID[COG1502] Phosphatidylserine/phosphatidylglycerophosphate/cardiolipin synthases and related enzymes
[COG1555] DNA uptake protein and related DNA-binding proteins 
TIGRFAM ID[TIGR00426] competence protein ComEA helix-hairpin-helix repeat region 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGCGAT CGCCTTGGAC ACAATGGATT AGTTCCCTAG TATTAATTGT GGGATTATGG 
GGGGTTTCTT CTTGTCAGTC TCAAGCTAAT CTTGAGCGTC CCAAACCATT ACCCCAAGAC
CCCTTAATTC AGGTTTATTT TAATCATAAT CAGTCTCAAG GATCTGATTA TACTGAACCT
TATCGTAATA TTACTCGTTC GGGAGATAAC CTCGAACAAA TCCTAATTGA TGCCATTAAA
GCTGCCCGTT CTACCATTGA TATTGCGGTT CAAGAATTTC GTTTACCCAA TGTAGCTAAA
GCCTTAGTAG AACAAGCAAA ACGAGGTGTT AAAGTAAGAA TTATTTTAGA AAATACCTAT
ACTACTCCTA TTAGTCAATC GGCTCAACAA ACCATTGATA ATGAAACAGA AAGAGAAGCG
GAAAAATCGG AGGAATATTT TGCTTTTGTT GATGTTAATC AAGATGGAAA ATTGAATTCA
GAAGAAATCA GCGATCGTGA TGCTTTAGTC ATTTTAAATC AAGCGAAAAT TCCAGTCATT
GATGATAGAG ATGATGGCTC AAAAGGAAGC GGGTTAATGC ACCATAAATT TATGGTAATT
GATAATAAAA TCGTCATGAC TGGTTCAATG AATTTTACGC CAAGTGACGT TCATGGAGAT
GTTACTAATT TAGAAACGCG AGGCAATGAT AATAATTTAC TGAAGATCAA TTCCGCTGAA
ATTGCTCAAG TTTTTACCGA AGAATTTAAT CTGATGTGGG GGGATGGAGT AGGAGGAAAT
TTTGATAGTC AATTTGGAGT TAAAAAATCA ATGCGATCGC CCCAAACTTT GACTGTTGGA
AATTCAAGAA TTACGGTAAA ATTTTCGCCT AATTCTAGAC AAGAAAATTG GCAAAATACA
AGTAATGGTT TGATTGAAAC GACTTTAAAT AGAGCAACTA ATTCGATTAA TTTAGCCCTG
TTTGTTTTCA GTGAACAAAC CTTAGTTGAT GACCTAGAAA AAAAGCATGA TCAAGGTGTA
GAAATAAGAG CCTTAATTGA CCCTGAATTT GTTTTTCGTA GCTATAGTGA AGGCTTAGAT
ATGTTAGGGG TTGCTTTGAG TGATAACTGT CGCTATGAAC CGAATAATAA ACCTTGGTTA
AATCCCATTG ATACGGTGGG TATTCCTAAT ATCCCAGACG GCGATAAACT GCATCATAAA
ATGGCTGTTA TTGATCAAAC TATTGTGATT ACGGGTTCCC ATAATTGGTC AGAAGCAGCC
AATCATCAAA ATGATGAAAC ACTTTTAATT ATTGAAAATC CGACAATAGC AGCCCATTAT
CAACGAGAAT TTGATAGATT ATATAGTACC GCCCAATTAG GATTACCCGA TTTTGTCCAG
AAAAAAATTC AAAAAGATAC TGACAATTGT CCGACTTTTT CTACTCGTAA ATCTTCCCGT
CATACCGATG AGATTATTAA TCTTAATACC GCCACTCAAG CAGAATTAGA AAGTTTACCC
GGAATTGGTG AAAAAACTGC TCAGAAAATT ATTGAAGAAC GTCAGAAAAA ACCCTTTACT
TCTTTAGATG ATTTAACGAG AGTATCGGGA ATTGGAGAGG CAAAAATTAA ACGATTACAA
GGTAAAGTAA CTTGGTAA
 
Protein sequence
MQRSPWTQWI SSLVLIVGLW GVSSCQSQAN LERPKPLPQD PLIQVYFNHN QSQGSDYTEP 
YRNITRSGDN LEQILIDAIK AARSTIDIAV QEFRLPNVAK ALVEQAKRGV KVRIILENTY
TTPISQSAQQ TIDNETEREA EKSEEYFAFV DVNQDGKLNS EEISDRDALV ILNQAKIPVI
DDRDDGSKGS GLMHHKFMVI DNKIVMTGSM NFTPSDVHGD VTNLETRGND NNLLKINSAE
IAQVFTEEFN LMWGDGVGGN FDSQFGVKKS MRSPQTLTVG NSRITVKFSP NSRQENWQNT
SNGLIETTLN RATNSINLAL FVFSEQTLVD DLEKKHDQGV EIRALIDPEF VFRSYSEGLD
MLGVALSDNC RYEPNNKPWL NPIDTVGIPN IPDGDKLHHK MAVIDQTIVI TGSHNWSEAA
NHQNDETLLI IENPTIAAHY QREFDRLYST AQLGLPDFVQ KKIQKDTDNC PTFSTRKSSR
HTDEIINLNT ATQAELESLP GIGEKTAQKI IEERQKKPFT SLDDLTRVSG IGEAKIKRLQ
GKVTW