Gene PCC8801_4342 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_4342 
Symbol 
ID7102695 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp4555977 
End bp4557167 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content30% 
IMG OID643477321 
Producthypothetical protein 
Protein accessionYP_002374420 
Protein GI218249049 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTTCTG AAGAAAGACC AAGAGATTTA AGCTATCGAG CTAGAAACGG AATATACTTT 
ACTCTGTTAA GTGGAGAATT TGAAGCTCAA TTTGAAACGA TTATTACAAA ATTATTACAA
TGGATTAAAT CTGACTATGA CTCTAATTTG CCAGTTTCAA ACATTAATCA TGGTGGATTT
AGAATACTTG ATACTTTTTT ATCTGATACT CTTAGCCAAG AATCCTTTAA AAGAATTTTT
GAAATATCTT CTATAAAAAA AATTCCAATA AAAATCTTAT TAGCTAATCC AGATAGTCAA
TTTGCTATTG CTCGACATAA CTCTTTAAGA CACTCAGTAC AACAAGAAAC TCAGCAAGAA
ATGAATCGAA GAAGAGAAAT CAGAGCTAAA ATTGGATTTC AAAAAATTCT TGAGAGTTTT
TTGAAATCAA AAAAAATAAA ATATGATGAT ATAGCAATTC AAGAACTGAG TTATGATAAA
ATGGCTGAAA AGTTTAATCA AATTAAATCA AATAATGACA GTCATATAGA AATACAATTT
TATACTGAGG TTCCTAGCGG ACCAATGCTC TTTTTTCAAG ATGTTCTTTT ATCAGGATTT
TACTGTGCAG GAATTTCTTC AAGAGAATTA CCTTGGTTAG TGATTATTGA TGATCCTAAT
ATTAACAATG ATATGTATGA TGTATTTAAT GCTGAATTTG AGAGGATATG GGAATTGAGT
AGTACAAATA GAGAACGACC GAGTTCTGAG CTTTATAACT ATGCTCATAG CTATTTTATT
AGCTATTCTA GTCAAGATAA AGAAATAGCG GATCACATAG AATTACTTCT TTGGAGAAAA
AATCGTTTAG TGATTAGAGA TGAAAGAGAT CTAACCTCTG GTCAAAATTT ATCAGAAGAT
ATTGAAAGTG TAATTGGTAA GTCACAAACA TTTTTATTTT TATGTAGTCA ATCTTCTAAT
CAAAGCGATT ATTGTAGAGG AGAAATTGAT GTAGCTTTTG AATACAAAAA GCTAAAAGAA
CAACAAGGTA ATAATGGACA AGAAGGAATA CAACGAATTG TTGTTATCTC TTTAGATGGA
CAAAAACCGC AAGATTTACG ACTTAGATCT TATTTACGTT TGCAGGGAGA GAACAGAACT
GAAAGAGAAT CATCAATTCG ACGAATTATA GATGAAGAGG AAAGGATATA G
 
Protein sequence
MFSEERPRDL SYRARNGIYF TLLSGEFEAQ FETIITKLLQ WIKSDYDSNL PVSNINHGGF 
RILDTFLSDT LSQESFKRIF EISSIKKIPI KILLANPDSQ FAIARHNSLR HSVQQETQQE
MNRRREIRAK IGFQKILESF LKSKKIKYDD IAIQELSYDK MAEKFNQIKS NNDSHIEIQF
YTEVPSGPML FFQDVLLSGF YCAGISSREL PWLVIIDDPN INNDMYDVFN AEFERIWELS
STNRERPSSE LYNYAHSYFI SYSSQDKEIA DHIELLLWRK NRLVIRDERD LTSGQNLSED
IESVIGKSQT FLFLCSQSSN QSDYCRGEID VAFEYKKLKE QQGNNGQEGI QRIVVISLDG
QKPQDLRLRS YLRLQGENRT ERESSIRRII DEEERI