Gene PCC8801_2612 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_2612 
Symbol 
ID7103603 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp2701100 
End bp2702239 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content41% 
IMG OID643475653 
Productprotein of unknown function DUF21 
Protein accessionYP_002372772 
Protein GI218247401 
COG category[R] General function prediction only 
COG ID[COG1253] Hemolysins and related proteins containing CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGTCTTG ATCCCCCTTT TTACCAGATT ATCTTGCTAA CTCTAGATTA TCATTTCTTA 
GCGAGTACCG AACCTGCTCC TTTTTTAGGG CAGGTTTGGC TCGATTTAGC GGCAATCGTC
TTAATGCTGC TGATGTCTGC TTTTTTTTCC GCCTCAGAAA CCGCTATTAC TGCCTTTGAT
AATTTTAAAC TCAGGGGACT CATTGAGCAT CAAGGAGATC CTTCAGGAAT TTACCGCTTA
GTTCTCGAAA ATCGACGGCG TTTTATTACA AGTCTTTTAG TCGGGAATAA TCTGGTTAAT
AATTTTTCGG CTGTTCTTAC GAGTAATTTA TTTGCCATTT GGTTAGGTAA TGCAGGATTA
GGCATAGCAA CGGCTATTAT TACGGTTTTT ATTTTGATTT TTGGAGAAAT AACCCCAAAA
TCCCTAGCTA TTCTCCATAA TCGTGCTTTT TTTCGCCTTT CCGTTCGACC TGTTTTCTGG
TTGTCTCAAA TACTAACGGC GATCGCCATT GTTCCCATTT TTGAAACCAT TACCCAAAAG
ACCATTCAAA TTTTTCAAGG AAAATCCGAT AAAAATGCCC ATTCTGGAGA ATCTTTGCGG
GATTTACACC TAATGATCAA GATTTTGGGA GGCAAAGGGA CATTAGATTT GTACCGACAC
CAGTTACTGA ACAAAGCGTT AATGCTCGAT CAGTTAATAG CGAAGGATGT GGTCAAACCC
CGTATCGATA TGACTACGAT TTCCCATGAA TCTAGTTTAC AGCAATTCAT CGATTTATCT
CTCGAAACAG GCTATTCTCG CATTCCCGTC CAAGGAGAAT CGAAGGATCA GATAGTTGGG
ATAGTCAATC TTAAACAGGC ACTCCAGAAG CTGCAATCTG TTCCAAAACA AAGACTTTCG
GAGATAGCCG TCATTGAAGC GATGGATGCA CCGATTTATA TTCCTGAAAC TAAGCGGGTC
ACAAATTTGC TCAAGGAAAT GCTCCAACAA CGGTTTCATA TTGTCATTGT CGTCGATGAA
TATGGCGGAA CCGTTGGTTT AGTGACCTTA GAAGACATTT TAGAAGAATT AGTCGGCGAA
ATCTATGATG AAAGCGATTA TCCCTCGGTT CAGGAGTCCT TAGTTCAGCG TGATCCCTAA
 
Protein sequence
MSLDPPFYQI ILLTLDYHFL ASTEPAPFLG QVWLDLAAIV LMLLMSAFFS ASETAITAFD 
NFKLRGLIEH QGDPSGIYRL VLENRRRFIT SLLVGNNLVN NFSAVLTSNL FAIWLGNAGL
GIATAIITVF ILIFGEITPK SLAILHNRAF FRLSVRPVFW LSQILTAIAI VPIFETITQK
TIQIFQGKSD KNAHSGESLR DLHLMIKILG GKGTLDLYRH QLLNKALMLD QLIAKDVVKP
RIDMTTISHE SSLQQFIDLS LETGYSRIPV QGESKDQIVG IVNLKQALQK LQSVPKQRLS
EIAVIEAMDA PIYIPETKRV TNLLKEMLQQ RFHIVIVVDE YGGTVGLVTL EDILEELVGE
IYDESDYPSV QESLVQRDP