Gene PCC8801_1678 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_1678 
Symbol 
ID7101645 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp1761221 
End bp1762393 
Gene Length1173 bp 
Protein Length390 aa 
Translation table11 
GC content44% 
IMG OID643474748 
Producthypothetical protein 
Protein accessionYP_002371884 
Protein GI218246513 
COG category[I] Lipid transport and metabolism
[R] General function prediction only 
COG ID[COG3240] Phospholipase/lecithinase/hemolysin 
TIGRFAM ID[TIGR02595] PEP-CTERM putative exosortase interaction domain 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATCAAC AACTTTTAGG GGGAATGGTG AGTGCTGTCT TTTCATCAAG TCTCCTTCTT 
GCCACACCGA CAACCCTAGC AGCCAATATC CCCGCTCAGC TTAGTACACT TACCCTAGAA
TTTAATCAAA AGCTCGATCA AACGATTGTT CAATTGAATG GGGAATTAAC GGGTATTAAT
ATTGCTTTAG CTGATGTCTA CACTCTTTAT CGAGATGTGT TACCCAACGA GTTTGCCAAT
AGTGACTCCG AGTGTTTTCA AGGACCTTTG TCTGCACCGA CAAGTATTTG CAATAATCCT
GATTCTTTTA TCTTTTTAGA TAATATTCAT CCAACATCAG CCACTCATAG TCGAATTGCT
GATATTTCCT TTGCCGTCCT TGATAATACG GTCAAAGCGT CGGTAGATGA AGTCTTTATT
TTTGGAGATA GTCTGAGCGA TCGCAATAAC TTCTATAGTT TTAGTAATGG GTTTGTCCCC
CCCACAGTAG CAACTGGTGG AGCGTTCGCT GGAACTCCTC TCTACAGTCC GGGGGCGTTC
ACTGATAACT TGTTATGGTG GGAACACCTG ATTAATGATT TAGGAGTATC CAGTCCCGTC
AGCTACTATG ATGATGTTTT CTTGAACATT TTCCCCACTG ATCCCACTGG AGGGATTAAT
TTTGCGGTTT CTGGCGCAAC AAGCGGACAA GATAATGCGG GTAATGCTAT GAACCCACCC
TTTCCCATCG ACTTACCAGG GTTAGAAGAT CAGGTAACTG CCTTTACTGG GTTATTTGCT
CCCGCTCAAC AGGCTAATTC AGACGCTCTG TACATCATCT GGGTTGGAAC GAATAATTTA
TTAGGGGCGT TTTCCCCCGT AACACCCGAT AATCCTTTTG CTCCCTTTAG CGATTTTACC
ACCAATGCCC AACAACCTGT CGATGATATT GCCGCAGCTA TTACGATTTT GCACGAGTTA
GGGGCTAGAA ATTTCCTGAT AGGTAATTTA TTTGATTTAG GGGATACACG GTTAGCGGCT
GAATTTGAAG CCATTCGAGT GTCTACTCCA GAACCTTCAG CAACTCCACT AATAACCTTT
CTGGCGGCCT TAGGACTGGG AACTCACGGG GTGAGAAAAA GCCTAAAGCG TAGACAGAGG
GTAAGAGGAG GGGTTCAGGA TCAATCAATC TAA
 
Protein sequence
MNQQLLGGMV SAVFSSSLLL ATPTTLAANI PAQLSTLTLE FNQKLDQTIV QLNGELTGIN 
IALADVYTLY RDVLPNEFAN SDSECFQGPL SAPTSICNNP DSFIFLDNIH PTSATHSRIA
DISFAVLDNT VKASVDEVFI FGDSLSDRNN FYSFSNGFVP PTVATGGAFA GTPLYSPGAF
TDNLLWWEHL INDLGVSSPV SYYDDVFLNI FPTDPTGGIN FAVSGATSGQ DNAGNAMNPP
FPIDLPGLED QVTAFTGLFA PAQQANSDAL YIIWVGTNNL LGAFSPVTPD NPFAPFSDFT
TNAQQPVDDI AAAITILHEL GARNFLIGNL FDLGDTRLAA EFEAIRVSTP EPSATPLITF
LAALGLGTHG VRKSLKRRQR VRGGVQDQSI