Gene PCC8801_1079 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_1079 
Symbol 
ID7102249 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp1136508 
End bp1137590 
Gene Length1083 bp 
Protein Length360 aa 
Translation table11 
GC content48% 
IMG OID643474171 
Productrare lipoprotein A 
Protein accessionYP_002371309 
Protein GI218245938 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0797] Lipoproteins 
TIGRFAM ID[TIGR00413] rare lipoprotein A 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACAAAA AACTTTGGAG TGGACTAACA ACCACAGCAT TAACCACCGC TTTAGGAACC 
TCCGTTCTAC TCTGCGGTTC TTTCAATGGC TCGGTCGCAT CTGAGATGAA AGATAAAGCC
GATGACCTAG GAAAACTTCT AGGAGTGACG ACGACTGCCA ACACAGAAAG TTCCGAGCAG
TTAACAGCTT TAGCCCCAAA AGTTGTTAGT CTGGGTCAAG TAGATCTCCC TCAGGAGAAG
CAAGACCCCG AAAAAGGAAA CATTCAAGGC GCGTTTTTAG GGAATACGGG AGAAAACCTG
CTAGTGGAAC CTGAAAAACC CCTTTCAGCC ATCGCCACTC TCCATCCCCA TCAATGGAAA
TACAGATTAG CCATTACCCT AAGAGTTCGG GAAATTCCCG TCCTAACCTT CGTCGGTTCT
CAAGCGGATC TAGCCCAACT GAGAAATAAT CAAAATAACC CCGATGCTCC TCAAAAAGAC
AGCGAGGTGA TGAAAAAGGC CAAAGCCTTA GCCCAACGGC TCAATGAACT TGCTCAGGAT
GACACCTTTG AAGCCCAAAC CATCACCGTT AGCGAAATTC AAAAAAACAA AACCTATGGC
ATCAAAATCG ATGGAAAAGA ACTCGTTCGA GTTGATGGCC AAACCATTCT ACCCGACACC
ACCAACAATC TAGCAGCCGA TGCGCTACAA GTGACTAACC GTCTCCGTCG GTTGATGGGA
GGTGCATCCC CGTTAACGGC TATTAATCAA GTTCCTGATG GACTCGCTGG CGTTGAAGGA
CGAGTAACCA GCACCCGTAA AGGGATGGCC TCTTGGTACG GACCTGGATT TCATGGACGA
CGAACCGCTA ACGGAGAACG GTACAATCAA AACGGTCTAA CGGCGGCTCA TAAAACCCTT
CCTTTTGGAA CCCAAGTGAA GGTCACTAAC TTAAATAATG GTCGCTCGAT CACCGTTCGG
ATCAATGATC GCGGTCCCTA CGCCCACGGA CGGATTATTG ACTTATCCAA AGGCGCGGCG
CAAATTCTGG GCTTAGTCAG TAGTGGAGTA GCCCCGGTTC AAATTGAAAT CCTAGGGCGT
TAA
 
Protein sequence
MNKKLWSGLT TTALTTALGT SVLLCGSFNG SVASEMKDKA DDLGKLLGVT TTANTESSEQ 
LTALAPKVVS LGQVDLPQEK QDPEKGNIQG AFLGNTGENL LVEPEKPLSA IATLHPHQWK
YRLAITLRVR EIPVLTFVGS QADLAQLRNN QNNPDAPQKD SEVMKKAKAL AQRLNELAQD
DTFEAQTITV SEIQKNKTYG IKIDGKELVR VDGQTILPDT TNNLAADALQ VTNRLRRLMG
GASPLTAINQ VPDGLAGVEG RVTSTRKGMA SWYGPGFHGR RTANGERYNQ NGLTAAHKTL
PFGTQVKVTN LNNGRSITVR INDRGPYAHG RIIDLSKGAA QILGLVSSGV APVQIEILGR