Gene PCC8801_3990 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_3990 
Symbol 
ID7105271 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp4176337 
End bp4177515 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content43% 
IMG OID643476985 
Productprotein of unknown function DUF1239 
Protein accessionYP_002374085 
Protein GI218248714 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTCAAT CAAACCATCC CTCACCCCCC CAAAATTATT TAAAAGGAGG GGTAATAGTG 
TTATTGTGCT TCCTAGTAGC CTGTCAAGGG TCTAATCAAT CCCAAAATAC CAATAACCAG
CAGCAAGAAA CGGCTGAAGT GGAAAGTGGA TTAATTCTCA ATAATGCCAC CTTAGAACAA
GCCAACCCCA AAGGACAAAT ATTGTGGAAA GTTCAAACTG ACGAAGCTGC CTATAGTCCC
GATCGCAAAA AAGCCCAATT AACAGGAGTC AAAGGCAATA TTTACCAAGA TGGCAAAATT
GTCCTGCGGG TTAAAGCCGA TCAAGGAGAA ATTAACCGAG ATGGACAGGA AATCCTTTTG
AAAAACAACG TCGTCGCGGT TGATCCCCGT AACAACACCG TGATCCGTAG CGAAGAAGTC
GAATGGCGGC CACAAGATTC GGTGCTGATG GTTCGCAAAA ATCTCCGGGG TTCCCATCCT
CAACTAGAAG CAACGGCCAA AGAAGCCAAA TACTTCGCCA AAAAGCAACA ATTTGAACTA
ATCGGCAATA TTATCGCTAC AGCCAAAAAT CCTCGCCTAC AACTGAAAAC AGAACACTTG
ATTTGGGATG TACCCCAAGA TAAAGTAATC GGCGATCGCT TACTCAATGT TGTTCGTTTT
GAGGATAAAA CCATTACCGA TCAACTGGTG GCTAACCAAG CTCAAGTCAA CTTGAAAACG
AAACAAGTCC TAGTGGAAAA AAATATTGAG TTTAAATCCC TCGAACCCCC TTTACAAGTC
GCTACTAACG AAATTCTCTG GAAATATAAG GATCGTCAAG TGACCAGTAG CAAACCTGTA
AAATTGATTG AATATCAACG GGGCGTTACT GTTATTGGCA ATGAAGCGCA GGTCGATTTC
CCTCAGAATA TGGCCTATTT GCGCCGTGGT GTTCAAGGAA GCGGTCGTGC TAACGGGTCC
AAACTTTATT CTAATGATCT AACCTGGAAT ATTAAAGATC AAACCATCGA AGCTCTGGGC
AATGTGATCT ATGAACAAGC GGGTGATCCT CCCTTTAATC TGACGGGAGA AAAAGCAACT
GGAACCTTAC ATAACAACAA TATCGTGGTT CATGGCAACC CACAAGATAG AGTTGTTACT
GAAATTTTCC CCGAAGACCT CAACTCCAAT TCACGTTAG
 
Protein sequence
MTQSNHPSPP QNYLKGGVIV LLCFLVACQG SNQSQNTNNQ QQETAEVESG LILNNATLEQ 
ANPKGQILWK VQTDEAAYSP DRKKAQLTGV KGNIYQDGKI VLRVKADQGE INRDGQEILL
KNNVVAVDPR NNTVIRSEEV EWRPQDSVLM VRKNLRGSHP QLEATAKEAK YFAKKQQFEL
IGNIIATAKN PRLQLKTEHL IWDVPQDKVI GDRLLNVVRF EDKTITDQLV ANQAQVNLKT
KQVLVEKNIE FKSLEPPLQV ATNEILWKYK DRQVTSSKPV KLIEYQRGVT VIGNEAQVDF
PQNMAYLRRG VQGSGRANGS KLYSNDLTWN IKDQTIEALG NVIYEQAGDP PFNLTGEKAT
GTLHNNNIVV HGNPQDRVVT EIFPEDLNSN SR