Gene PCC8801_1201 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_1201 
Symbol 
ID7104900 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp1248584 
End bp1250392 
Gene Length1809 bp 
Protein Length602 aa 
Translation table11 
GC content48% 
IMG OID643474286 
Productsingle-stranded nucleic acid binding R3H domain protein 
Protein accessionYP_002371424 
Protein GI218246053 
COG category[S] Function unknown 
COG ID[COG3854] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTACACA GCAAAGCAAA GAAGGCTTTA ACTCCTGATA CATCCTCATC AACATCTCTA 
CCTTTAATGC CAAATCAAAC CCTTGCCCAA CGGATGCAAA TTACCGATGA TATTGGTAAA
CTTTTAGCAA TTTTACCCGT AAATATTCGA GAACACATTG AAACCCATCC CCAAAAGGAT
CAATTGATCG AAGTGGTGAT GGACTTAGGA CGGTTGCCAG AAGCGCGTTT TCCCAATGAG
TCCGTCTATT TAGGAGAAAC GCCCATTTCT GAGGAAGATT TACAGCATTG TATCGACCGT
GTGGGACATT TTAGCGGGGA CAACCGCGCC GGAATTGAAC GGACTTTACA CCGCATCAGT
GCCATTCGCA ACCGCACAGG GAAAATTATC GGGTTAACCT GTCGCATGGG ACGGGCAATT
TTTGGCACCA TTACCATGAT TCAGGATTTG GTAGAAACGG GCAAATCCTT ATTGCTGTTG
GGTCGTCCTG GGGTGGGAAA AACCACAGCT TTACGGGAAA TTGCACGGGT TTTAGCCGAT
GACTTACATA AACGGGTGGT GATCATCGAT ACCTCTAATG AAATTGCTGG AGATGGCGAT
ATTCCCCATC CGGCCATTGG CCGTGCCCGA CGGATGCAGG TGGCCAGTCC TGAATTGCAG
CATCAGGTGA TGATTGAAGC GGTAGAAAAC CATATGCCCG AAGTCATCGT TATTGATGAA
ATTGGTACGG AACTCGAAGC ATTAGCCGCC CGAACGATAG CCGAACGGGG GGTACAATTG
GTTGGAACTG CCCACGGAAA CCGCATCGAA AATTTGCTCA AAAATCCCAC CTTATCGGAC
TTGGTAGGGG GGATTCAAGC GGTAACACTG GGGGATGAGG AAGCCCGGCG ACGGGGTTCG
CAAAAGACCG TTTTAGAACG AAAAGCTCCT CCTACCTTTG AAATTGCGAT CGAAATGCTC
GAACGTCAGC GTTGGGTAGT CCATGAAGAT GTCTCCCATA CGGTGGACAT TCTCCTGCGT
AATGGGGAAC CGAACCCCCA ACTACGGACG ATTAATGAGC ACGGTGAGGT AGAAATTAGC
CAAGAACCCC CCCAAAAACC GCCGATGGGA TTATTAGGTC ATAGTGCGAG TTTGGAGACG
GTAGACCGTC CCACAGGATG GCGGGCATCA GGACGGATGA GTCCCGTTGC CCCCTTTGGA
ACCCAAAAAG GGACTGTATC GGAGTTTGAT CGCCTGTTGG ATGAGTCTTG GCATCAGCCA
GAAAGTTCAG AAAAAGTGCG GATTCCTGGT CCTAATGGGG AAGATTGGCC GGTCTATGTG
TATCCCTACG GAATTGGGCG ATCGCAGATT GAACAGGTGA TCGAGATTTT AAATTTACCG
ATAGTCTTGA CAAAAGACCT CAGTCATGCT GATGTGGTGG TGGCTCTGCG ATCGCACGTT
AAAAACCACT CTAAGTTACG TCAGATGGCA AAGGTGCGTC AAATTCCTAT CCACGGGGTT
AAATCTAATA CGATTCCTCA AATTACCCGC ACTTTACAGC GCATTTTAGG GATGGATGAG
CCAAAACAAC CAGAAACGGC GGATATTCGT TTGTTTACCC GTGGGGGAGA TGATGATGAG
TTGGAAGCGT TGGAGGAGGC ACGCTTAGCT GTAGAACAGA TTGTACTCCC GAAAGGACAA
CCTGTGGAGT TGTTACCCCG TTCTCCTAAA GTGCGAAAAA TGCAGCATGA ATTAGTGGAA
CATTATCGGT TACAGTCGGA TAGTTTTGGA GAGGAACCTA ACAGGAGATT ACGAATTTAT
CCTGCTTAA
 
Protein sequence
MLHSKAKKAL TPDTSSSTSL PLMPNQTLAQ RMQITDDIGK LLAILPVNIR EHIETHPQKD 
QLIEVVMDLG RLPEARFPNE SVYLGETPIS EEDLQHCIDR VGHFSGDNRA GIERTLHRIS
AIRNRTGKII GLTCRMGRAI FGTITMIQDL VETGKSLLLL GRPGVGKTTA LREIARVLAD
DLHKRVVIID TSNEIAGDGD IPHPAIGRAR RMQVASPELQ HQVMIEAVEN HMPEVIVIDE
IGTELEALAA RTIAERGVQL VGTAHGNRIE NLLKNPTLSD LVGGIQAVTL GDEEARRRGS
QKTVLERKAP PTFEIAIEML ERQRWVVHED VSHTVDILLR NGEPNPQLRT INEHGEVEIS
QEPPQKPPMG LLGHSASLET VDRPTGWRAS GRMSPVAPFG TQKGTVSEFD RLLDESWHQP
ESSEKVRIPG PNGEDWPVYV YPYGIGRSQI EQVIEILNLP IVLTKDLSHA DVVVALRSHV
KNHSKLRQMA KVRQIPIHGV KSNTIPQITR TLQRILGMDE PKQPETADIR LFTRGGDDDE
LEALEEARLA VEQIVLPKGQ PVELLPRSPK VRKMQHELVE HYRLQSDSFG EEPNRRLRIY
PA