Gene PCC8801_4037 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_4037 
Symbol 
ID7104613 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp4230072 
End bp4231865 
Gene Length1794 bp 
Protein Length597 aa 
Translation table11 
GC content45% 
IMG OID643477032 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_002374132 
Protein GI218248761 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTCAAA TTATTGAAAC CTCAACCATC CCTACCACCT TAATTGACTT ATTACGCTTA 
CGCGCAAGCC AAACTCCCCA TAATCACGCC TATACCTTTC TCATTGATGG CAAAAAAGCA
ACCCCACCCC TAACCTACGC CGAATTAGAC CGACAATCAA GAGCGATCGC TGCCTTACTT
CAACAATACC AAGCCAGAGG AGAACGGGCT CTTCTGCTCT ATCCCCAAAG TTTAGAAGTT
ATTGCCGCCT TTTGTGGCTG TTTATACGCC GGAGTTATCG CTATTCCCGT TCCTCCTCCA
GAGTCCGGCC GACTCAAGCG TACTTTACCC AGATTACGCG CTATCGTCAA AGATGCTAAC
GCCAAATTTG CCTTAACAAC CGCTGGAATT TTCGACCTGA TTAACAATTT TAAGTCTGAG
TTTCCCGAAT TTGACCAAAT GAACTGGATA GATACTGCCA AGGTCGACCT ATCCCTCGCA
GATGACTGGC AAGATCCCAA CATCGACAAA GACGAGTTAG CCTATCTTCA GTATACGTCC
GGCTCTACTT CTACGCCAAA AGGGGTCATG CTCAGTCATT TTAACCTAAT GCACCACGCT
CGCTACCTCC AAAGGGCTTG TGGCTACGAA CCCGATAGCG TTACCCATAC TTGGATGCCC
TATTTTCATG ATTATGGGTT AGTTGAAGGT ATAATGGTTC CCCTCTACAA CGGAACCCCC
TGTTATCTGA TGTCTCCGTT CTCGTTTATT AAGCGTCCTA TCCAATGGCT GCACAATATC
ACAAAATACG GTGTTACCCA CTCCCAAGCC CCTAATTTTG CCTATGATTT GTGTATTCGT
CGCGTTAAAG ACAAAGATAT CCCCCAACTC AATTTAAGCT GTTGGCAAGC AGCCGGAAAC
GCAGCAGAAC CGATTAATCC GAGGGTCATG GCGGATTTTG TTGAAACCTT TGCTCCTTGC
GGTTTTTCTT GGGAAACTTT TGCTCCTGCT TTTGGGTTAG CGGAGTATAC GTTACTGGTA
TCGAGTAAAC CCAAGGGAAC TGCTCCTGTT TTTGTTTGTT TGGATAGTTC TGCACTAGAA
AGGGATAAAA TTGTTGAAGC TAACCCGGAT CAAGACCAAG GGGTGAGAAT AATGCCCAGT
TGTGGTCAGT TGGTCTGTGA GACCCAGGTA GCGATTGTTC GTCCTGACAC CTTAACCCGT
TGTGCTTCCG ATGAAGTAGG AGAAATTTGG GTCTCTGACC CCAGTATGTC TCAAGGCTAT
TGGCAACGTC CCCAAGAAAC CCAAGAAACC TTCGGAGCTT ACCTTAAAGA TACGGGAGAA
GGTCCGTTTT TAAGAACCGG AGATTTAGGG TTTCTTAAAG ACGGAGAATT ATATATTACG
GGACGGATGA AAGACTTAAT TATTATCCGA GGGACTAATC ATTATCCCCA AGATATTGAA
TGGACGGTAC AACATCTTAA CTCGGTTTTT CGTCCTGACT ATGGGGCTGC TTTTTCGATT
ACAGATCAGG GGGAAGAAAA GTTAGTCGTG GTTCAAGAAA TAGAACGCCG TAGCAGCGAC
TTGGATACAG AAAAATTATT AGCAGATATT CGTCAAGAAA TTGCTGAAGA ACACGAAATT
TTTACCCATG CCATTGTTTT AGCAAAGTCG GGAACTATCC TAAAAACCGC TAGTGGTAAA
ATTCAGCGTC GTGCTTGTCG TCAAAACTTT CTCAATGGAA CCATCAATAT TATCGCTGCT
TGGAGTGAAA ATCCGGCATT AGTTGCTAAT TTTAAAGAGT CTGAAACTGA CTAA
 
Protein sequence
MTQIIETSTI PTTLIDLLRL RASQTPHNHA YTFLIDGKKA TPPLTYAELD RQSRAIAALL 
QQYQARGERA LLLYPQSLEV IAAFCGCLYA GVIAIPVPPP ESGRLKRTLP RLRAIVKDAN
AKFALTTAGI FDLINNFKSE FPEFDQMNWI DTAKVDLSLA DDWQDPNIDK DELAYLQYTS
GSTSTPKGVM LSHFNLMHHA RYLQRACGYE PDSVTHTWMP YFHDYGLVEG IMVPLYNGTP
CYLMSPFSFI KRPIQWLHNI TKYGVTHSQA PNFAYDLCIR RVKDKDIPQL NLSCWQAAGN
AAEPINPRVM ADFVETFAPC GFSWETFAPA FGLAEYTLLV SSKPKGTAPV FVCLDSSALE
RDKIVEANPD QDQGVRIMPS CGQLVCETQV AIVRPDTLTR CASDEVGEIW VSDPSMSQGY
WQRPQETQET FGAYLKDTGE GPFLRTGDLG FLKDGELYIT GRMKDLIIIR GTNHYPQDIE
WTVQHLNSVF RPDYGAAFSI TDQGEEKLVV VQEIERRSSD LDTEKLLADI RQEIAEEHEI
FTHAIVLAKS GTILKTASGK IQRRACRQNF LNGTINIIAA WSENPALVAN FKESETD