Gene PCC8801_1533 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_1533 
Symbol 
ID7104171 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp1607712 
End bp1608992 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content35% 
IMG OID643474606 
Producthypothetical protein 
Protein accessionYP_002371743 
Protein GI218246372 
COG category[L] Replication, recombination and repair
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0758] Predicted Rossmann fold nucleotide-binding protein involved in DNA uptake 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTTTCCC CTGACACTCA AGCTATTTTA CTTCTTTGTG CAAGTTTTGG TCAAAATCGT 
CAAACCGAAC CCTTTCCTTT AACTCTTGGT GAATATAATA CCCTTGCTGG TTGGTTAAGA
GCAGAAAATA TGTTTCCTCA AGACCTACTT AACCCTAATT TTATCAGTCG TCTTTCTCAG
TTAACCATAG GTAAATTAGA TTCTAAACGA TTAGTTGCAT TGCTACAAAG AGGGGGATTA
TTAGCTTTGA CTGTTGAAAA ATGGCTTAAT CAAGGTTTAT GGATTATTAG TCGTGGAGAT
GCTGACTATC CCCTGCGATT AAAACAACAA TTAAAATATT TAGCTCCTCC TATTTTATAT
GGAATTGGGA ACAAAGATTT ATTATCAAAA GGGGGGTTAG CTGTTGTTGG TTCTCGTAAT
GTGGATCAAG AAGGATTAGA CTATACTTAT CACGTTGTAG AAGCTTGTGC AGAACAAAAT
ATTCAAGTGA TTTCAGGAGG TGCAAAAGGG GTTGATCAAG CTTCGATGTT AGGAACGTTA
AAAGTAGGAG GTACAGTGAT TGGCGTATTA GCTAATAACT TACTTAAAGC ATCTGTTGAT
GGAAAATATC GTACCAGTAT TAAAGAAGGA AAACTAACTT TAATTTCTGC GGTTGATCCT
AATGCTTCCT TTCATGTTGG TAACGCTATG AGACGTAATA AATATATCTA TGCTTTGGCT
AATTATGGGT TAGTTATTAG TGCTGACTAT AACACAGGTG GAACATGGGC AGGAGCAACA
GAAGCTTTAA ATACAATTAA GGATGTCCCT ATTTTAGTGC GAATACAGGG AACAATATCA
GAAGGCAATC AACATTTATT AAAACAAGGT GCAAAACCTT TTCCTGAAAC TCCTTGGAAT
CGTCCGATTA AAGAATTAAT TGAAACTACT GTATCAGAAT ATAAAAGCAT AGAATTTCGT
CAAAATAATA CTCAATTGAA TTTATTTAGT CAGGATAATC ATTCTGTTGT TTCTGACAAT
AAAGATGAAC TTACACCGCA AGATCCTGAT ATCTCTTCCC GTGATGATGC TTTGAAATCG
GCCTCAGAAA GACTTTATTA TGCTGTTTTA CCTATCATTC TTCAAGAACT AAACCAACCA
CAAGATCCGA AATCTTTAGC AACTAATCTA GATGTTCAAG TTGGTCAACT AAGCAAATGG
CTAAAAAAAG CAGTTACAGA TAAAAAAGTT ATCAAACAGA CTAAAAATAA CCAAGTTATT
TATAAATCAA ATAAAGTATA A
 
Protein sequence
MLSPDTQAIL LLCASFGQNR QTEPFPLTLG EYNTLAGWLR AENMFPQDLL NPNFISRLSQ 
LTIGKLDSKR LVALLQRGGL LALTVEKWLN QGLWIISRGD ADYPLRLKQQ LKYLAPPILY
GIGNKDLLSK GGLAVVGSRN VDQEGLDYTY HVVEACAEQN IQVISGGAKG VDQASMLGTL
KVGGTVIGVL ANNLLKASVD GKYRTSIKEG KLTLISAVDP NASFHVGNAM RRNKYIYALA
NYGLVISADY NTGGTWAGAT EALNTIKDVP ILVRIQGTIS EGNQHLLKQG AKPFPETPWN
RPIKELIETT VSEYKSIEFR QNNTQLNLFS QDNHSVVSDN KDELTPQDPD ISSRDDALKS
ASERLYYAVL PIILQELNQP QDPKSLATNL DVQVGQLSKW LKKAVTDKKV IKQTKNNQVI
YKSNKV