Gene Cyan8802_3491 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCyan8802_3491 
Symbol 
ID8392828 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8802 
KingdomBacteria 
Replicon accessionNC_013161 
Strand
Start bp3559286 
End bp3560425 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content41% 
IMG OID644981424 
Productprotein of unknown function DUF21 
Protein accessionYP_003139150 
Protein GI257061262 
COG category[R] General function prediction only 
COG ID[COG1253] Hemolysins and related proteins containing CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.532364 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.141748 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGTCTTG ATCCCCCTTT TTACCAGATT ATCTTGCTAA CTCTAGATTA TCATTTCTTA 
GCGAGTACCG AACCTGCTCC TTTTTTAGGG CAGGTTTGGC TCGATTTAGC GGCAATCGTC
TTAATGCTGC TGATGTCTGC TTTTTTTTCC GCCTCAGAAA CCGCTATTAC TGCCTTTGAT
AATTTTAAAC TCAGGGGACT CATTGAGCAT CAAGGAGATC CTTCAGGAAT TTACCGCTTA
GTTCTCGAAA ATCGACGGCG TTTTATTACA AGTCTTTTAG TCGGGAATAA TCTGGTTAAT
AATTTTTCGG CTGTTCTTAC GAGTAATTTA TTTGCCATTT GGTTAGGTAA TGCAGGATTA
GGCATAGCAA CGGCTATTAT TACGGTTTTT ATTTTGATTT TTGGAGAAAT AACCCCAAAA
TCCCTAGCTA TTCTCCATAA TCGTGCTTTT TTTCGCCTTT CCGTTCGACC TGTTTTCTGG
TTGTCTCAAA TACTAACGGC GATCGCCATT GTTCCCATTT TTGAAACCAT TACCCAAAAG
ACCATTCAAA TTTTTCAAGG AAAATCCGAT AAAAATGCCC ATTCTGGAGA ATCTTTGCGG
GATTTACACC TAATGATCAA GATTTTGGGA GGCAAAGGGA CATTAGATTT GTACCGACAC
CAGTTACTGA ACAAAGCGTT AATGCTCGAT CAGTTAATAG CGAAGGATGT GGTCAAACCC
CGTATCGATA TGACTACGAT TTCCCATGAA TCTAGTTTAC AGCAATTCAT CGATTTATCT
CTCGAAACAG GCTATTCTCG CATTCCCGTC CAAGGAGAAT CGAAGGATCA GATAGTTGGG
ATAGTCAATC TTAAACAGGC ACTCCAGAAG CTGCAATCTG TTCCAAAACA AAGACTTTCG
GAGATAGCCG TCATTGAAGC GATGGATGCA CCGATTTATA TTCCTGAAAC TAAGCGGGTC
ACAAATTTGC TCAAGGAAAT GCTCCAACAA CGGTTTCATA TTGTCATTGT CGTCGATGAA
TATGGCGGAA CCGTTGGTTT AGTGACCTTA GAAGACATTT TAGAAGAATT AGTCGGCGAA
ATCTATGATG AAAGCGATTA TCCCTCGGTT CAGGAGTCCT TAGTTCAGCG TGATCCCTAA
 
Protein sequence
MSLDPPFYQI ILLTLDYHFL ASTEPAPFLG QVWLDLAAIV LMLLMSAFFS ASETAITAFD 
NFKLRGLIEH QGDPSGIYRL VLENRRRFIT SLLVGNNLVN NFSAVLTSNL FAIWLGNAGL
GIATAIITVF ILIFGEITPK SLAILHNRAF FRLSVRPVFW LSQILTAIAI VPIFETITQK
TIQIFQGKSD KNAHSGESLR DLHLMIKILG GKGTLDLYRH QLLNKALMLD QLIAKDVVKP
RIDMTTISHE SSLQQFIDLS LETGYSRIPV QGESKDQIVG IVNLKQALQK LQSVPKQRLS
EIAVIEAMDA PIYIPETKRV TNLLKEMLQQ RFHIVIVVDE YGGTVGLVTL EDILEELVGE
IYDESDYPSV QESLVQRDP