Gene PCC8801_4140 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_4140 
Symbol 
ID7104548 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp4342898 
End bp4344388 
Gene Length1491 bp 
Protein Length496 aa 
Translation table11 
GC content43% 
IMG OID643477129 
Productprotein of unknown function DUF344 
Protein accessionYP_002374228 
Protein GI218248857 
COG category[S] Function unknown 
COG ID[COG2326] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTAGATA ACCTAGATTT AAAATTAACC CTTGATAAAG AAACTTATCA ATCCCAACTA 
GAACAGTTGA TGCGTCAACT GCGATCGCTG CAAAAAGCGT GTTGGGACAA TAAACTCCCC
GTCATTATTG TACTCGAAGG TTGGGCAGCG GCCGGCAAAG GAACGTTATT ACAAAAAACT
ATTGGCTATA TGGACCCCCG TGGGTTTACG GTTCATCCGA TTTTAGCAGC GACTCCAGAT
GAGGAAAAAT ACCCGTTTTT GTGGCGATTT TGGCATAAAC TGCCAGCTAA GGGGAGTATC
GGCATTTTTT ATCACAGTTG GTATACCCAT GTCCTAGAAG ATCGTTTGTT TCAAAAGGTG
AATAACAGTG ATATTCCCCT CCTAATGCGC GATATTAACG CCTTTGAACA TCAATTAGTG
GATGATGGGG TAGCTATGGC TAAATTTTGG ATTCATTTGA GTCGAAAGGA GATGAAAAAG
CGACTCAAGA AGTATGAAGC GGATGAACTG GAGTCTTGGC GAGTTCGTCC AGAAGATTGG
CAACAGGCTA ACCGCTATGA TGAATATGCA GCCTTAGCTG AAGAAATGTT AACCTTTACC
AGTACGGGTC ATGCGCCTTG GACGTTAGTG GAAGGAGACT GTCAGCGATG GACTCGTATT
AAAGTATTAT CGCAGATTGT AGCCACCATT ACTCAAGCAT TAGATCTGCG GAAACTCCCT
CAAACCGCTA TTCCCTCCTT ACCTCCCCAA ACGGAATTAC AACCCACAGA GCCCGATTTT
TTGGGTAAGG TGGATTTAAG TCTGCATTTG TCTAAAGACG AGTATCGGCA ACGGTTAGGG
GAAGCGCAGG TTAAACTGCG TCAGTTACAA TTGCGGATTT TTCGGGAAAA TATCCCTGTT
TTAGTGACTT TTGAGGGATG GGATGCAGCC GGAAAGGGAG GGGCAATTAA ACGCCTTACG
GATACTTTAG ACCCGCGAAG TTACAAAGTC AATGCTTTTG CAGCCCCAAG CCAAGAAGAG
AAGCAATACC ATTATTTATG GCGATTTTGG CGATATTTGC CAGGGGGAGG AACAATAGGC
ATTTTTGACC GCAGTTGGTA TGGTCGGGTG TTAGTGGAAA GAATTGAAGG GTTTGCCAAT
GAGTTAGAAT GGCGGCGATC TTATAAAGAA ATTAATGAAT TTGAAGCCCA ATTAACCCAT
GGGGGCTATG TATTAGTTAA GTTTTGGTTA CATATTGGTT TGGATGAACA ATTAAGACGG
TTTGAAGAAC GGCGAGATAA TCCTTTTAAA AATTATAAAT TAACCGACGA AGATTGGCGA
AATCGAGATA AATTTCCGTT ATATTATGTC GCAGTTAATC AAATGATTGC CCGTACCAGC
ACCCCCGCAG CTCCTTGGTA TATTGTCCCT GGCAATGATA AATATTATGC CCGTGTTTTT
GTCATTGAAA CGTTGATTAG TGCTATTGAA ACTGAGTTAA AACGGCGATG A
 
Protein sequence
MLDNLDLKLT LDKETYQSQL EQLMRQLRSL QKACWDNKLP VIIVLEGWAA AGKGTLLQKT 
IGYMDPRGFT VHPILAATPD EEKYPFLWRF WHKLPAKGSI GIFYHSWYTH VLEDRLFQKV
NNSDIPLLMR DINAFEHQLV DDGVAMAKFW IHLSRKEMKK RLKKYEADEL ESWRVRPEDW
QQANRYDEYA ALAEEMLTFT STGHAPWTLV EGDCQRWTRI KVLSQIVATI TQALDLRKLP
QTAIPSLPPQ TELQPTEPDF LGKVDLSLHL SKDEYRQRLG EAQVKLRQLQ LRIFRENIPV
LVTFEGWDAA GKGGAIKRLT DTLDPRSYKV NAFAAPSQEE KQYHYLWRFW RYLPGGGTIG
IFDRSWYGRV LVERIEGFAN ELEWRRSYKE INEFEAQLTH GGYVLVKFWL HIGLDEQLRR
FEERRDNPFK NYKLTDEDWR NRDKFPLYYV AVNQMIARTS TPAAPWYIVP GNDKYYARVF
VIETLISAIE TELKRR