Gene PCC8801_2086 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_2086 
Symbol 
ID7104324 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp2157309 
End bp2158463 
Gene Length1155 bp 
Protein Length384 aa 
Translation table11 
GC content47% 
IMG OID643475143 
Productprotein of unknown function DUF58 
Protein accessionYP_002372274 
Protein GI218246903 
COG category[R] General function prediction only 
COG ID[COG1721] Uncharacterized conserved protein (some members contain a von Willebrand factor type A (vWA) domain) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAACTT TATCTTCTTT GACTGAATGG TTAGAAACCC ATTGGGTGAC CCCTGCTTTT 
AGTGGCTGGT TATTAGCGGG ACTGGCTATC TGTTTTTTTG GGGCAGCTAC TAATACCATG
GCCGGCTGGT TATACGTTCT GAGTGGGACT ATTTTTGCCT TATTGGGGTT AGGGGCAATT
TTACCGATGC GATCGCAACG TCACCTTAAA GTCCATCGTC CTCTCATTTC CCCCGTCAGT
GCAGGAGAAG AGCTTACGAT TGAACTCATC ATCGAAAATA CAGAGAAAAC CGCCAAAACC
CTGCTAGAAG TTAGGGATCT GGTTCCCCAT GTCCTCAGAA CCCCCGTTAA AACCGCTATT
GAAGTGATTC CTCCCCAAAA TAAGTATTCG TGGATCTATT ATCTCCCAAC GCAACGACGG
GGAGTTTATC GTTGGCAAGA GGTGGAAGTG CGAACGGGAA CCCCCCTAGG ACTGTTTTGG
TGTCGTCGTC ACCAAGAAGT CCCGGCTAAG GGTATTGTTT ACCCACAGGT TTTACCCCTT
ACGCAATGTC CTCTAGTGGA TACCATCGGA CAAGAGGACA GTGATACTCT ACAGAGCGAT
CGCCACTATC AAGCTGCCAA CGAAGGGGTA ACAAAAACCC TACGTCCCTA CCGTTATGGC
GATCCTATGC GTCTGATCCA TTGGCGTACC AGTGCCCGTT TTGATGAATT TAAGGTCAGA
GAATTGGAAA TTATCACCGG AGGAGAGGAC ATTCTCATCT GTCTCGATAG TGCTTCTCCA
TGGCAACCTG ATAATTTTGA ACAAGCGGTA ATTGCCGCCG CTTCGTTATA TTTTTATGCC
CTACGTTCAG AACTCAATGT TAAATTTTGG ACGGCTGGAA CGGGGGTTAT TCATGGCAAC
CGTCAAGTAT TAGAAACCTT AGCAGCGATC GCATCAGAAG AAGAGACACT TAATCTACCT
TTTCCCAAGT TACCGACGAT TTGGCTCACC CAAAATACCG CTACCTTAGA CACCCTTTCT
CAGGGAAGTC GTTGGGTGGT TTTTGCTACA GGACAAACCC CAGATGCTCA ACAACTAATA
AACCCTTCTA CCGGTGGTTT AGTCATTGAT CCTGAGCAAC CGTTAGCCCT CCAATTACAA
AAACCGTTAA GATGA
 
Protein sequence
MKTLSSLTEW LETHWVTPAF SGWLLAGLAI CFFGAATNTM AGWLYVLSGT IFALLGLGAI 
LPMRSQRHLK VHRPLISPVS AGEELTIELI IENTEKTAKT LLEVRDLVPH VLRTPVKTAI
EVIPPQNKYS WIYYLPTQRR GVYRWQEVEV RTGTPLGLFW CRRHQEVPAK GIVYPQVLPL
TQCPLVDTIG QEDSDTLQSD RHYQAANEGV TKTLRPYRYG DPMRLIHWRT SARFDEFKVR
ELEIITGGED ILICLDSASP WQPDNFEQAV IAAASLYFYA LRSELNVKFW TAGTGVIHGN
RQVLETLAAI ASEEETLNLP FPKLPTIWLT QNTATLDTLS QGSRWVVFAT GQTPDAQQLI
NPSTGGLVID PEQPLALQLQ KPLR