Gene Cyan8802_4032 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCyan8802_4032 
Symbol 
ID8393383 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8802 
KingdomBacteria 
Replicon accessionNC_013161 
Strand
Start bp4147373 
End bp4148551 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content43% 
IMG OID644981952 
Productprotein of unknown function DUF1239 
Protein accessionYP_003139665 
Protein GI257061777 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.449073 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTCAAT CAAACCATCC CTCACCCCCC CAAAATTATT TAAAAGGAGG GGTAATAGTG 
TTATTGTGCT TCCTAGTAGC CTGTCAAGGG TCTAATCAAT CCCAAAATAC CAATAACCAG
CAGCAAGAAA CGGCTGAAGT GGAAAGTGGA TTAATTCTCA ATAATGCCAC CTTAGAACAA
GCCAACCCCA AAGGACAAAT ATTATGGAAA GTTCAGACTG ACGAAGCTGC CTATAGTCCC
GATCGCAAAA AAGCCCAATT AACAGGAGTC AAAGGCAATA TTTACCAAGA TGGCAAAATT
GTCCTGCGGG TTAAAGCCGA TCAAGGAGAA ATTAACCGAG ATGGACAGGA AATCCTTTTG
AAAAACAACG TCGTCGCGGT TGATCCCCGT AACAACACCG TGATCCGTAG CGAAGAAGTC
GAATGGCGGC CACAAGATTC GGTGCTGATG GTTCGCAAAA ACCTGCGGGG TTCCCATCCT
CAACTAGAAG CAACGGCCAA AGAAGCCAAA TACTTCGCCA AAAAGCAACA ATTTGAACTA
ATCGGCAATA TTATCGCTAC AGCCAAAAAT CCTCGCCTAC AACTGAAAAC AGAACACTTG
ATTTGGGATG TGCCCCAAGA TAAAGTAATC GGCGATCGCT TACTCAATGT TGTTCGTTTT
GAGGATAAAA CCATTACCGA TCAACTGGTG GCTAACCAAG CTCAAGTCAA CTTGAAAACG
AAACAAGTCC TAGTGGAAAA AAATATTGAG TTTAAATCCC TCGAACCCCC TTTACAAGTC
GCTACTAACG AAATTCTCTG GAAATATAAG GATCGTCAAG TGACCAGTAG CAAACCTGTA
AAATTGATTG AATATCAACG GGGCGTTACT GTTATTGGCA ATGAAGCGCA GGTCGATTTC
CCTCAGAATA TGGCCTATTT GCGCCGTGGT GTTCAAGGAA GCGGTCGTGC TAACGGGTCC
AAACTTTATT CTAATGATCT AACCTGGAAT ATTAAAGATC AAACCATCGA AGCTCTGGGC
AATGTGATCT ATGAACAAGC GGGTGATCCT CCCTTTAATC TGACGGGAGA AAAAGCAACT
GGAACCTTAC ATAACAACAA TATCGTGGTT CATGGCAACC CACAAGATAG AGTTGTTACT
GAAATTTTCC CCGAAGACCT CAACTCCAAT TCACGTTAG
 
Protein sequence
MTQSNHPSPP QNYLKGGVIV LLCFLVACQG SNQSQNTNNQ QQETAEVESG LILNNATLEQ 
ANPKGQILWK VQTDEAAYSP DRKKAQLTGV KGNIYQDGKI VLRVKADQGE INRDGQEILL
KNNVVAVDPR NNTVIRSEEV EWRPQDSVLM VRKNLRGSHP QLEATAKEAK YFAKKQQFEL
IGNIIATAKN PRLQLKTEHL IWDVPQDKVI GDRLLNVVRF EDKTITDQLV ANQAQVNLKT
KQVLVEKNIE FKSLEPPLQV ATNEILWKYK DRQVTSSKPV KLIEYQRGVT VIGNEAQVDF
PQNMAYLRRG VQGSGRANGS KLYSNDLTWN IKDQTIEALG NVIYEQAGDP PFNLTGEKAT
GTLHNNNIVV HGNPQDRVVT EIFPEDLNSN SR