Gene PCC8801_1377 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_1377 
Symbol 
ID7103068 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp1440560 
End bp1441720 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content46% 
IMG OID643474456 
Productcitrate synthase 
Protein accessionYP_002371593 
Protein GI218246222 
COG category[C] Energy production and conversion 
COG ID[COG0372] Citrate synthase 
TIGRFAM ID[TIGR01800] 2-methylcitrate synthase/citrate synthase II 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATGTAT GTGAATATAT ACCCGGGTTA GAAAATATTC CCGCGGCCAA ATCAAGTATC 
AGCTATGTGG ATGGTCAAGA GGGCATACTG GAATATCGAG GCATTCGCAT CGAAGAACTA
GCAACAAAAG GTAGCTTTGT AGAAACCGCC TATCTCCTCA TTTGGGGTGA ACTACCCACC
CAAGAAGAAC TCGATGCCTT TGAAGGGGAA ATTCGTTACC ATCGCCGCAT CAAATACCGC
ATCCGTGACA TGATGAAGTG TTTTCCTGAA ACGGGACACC CCATGGATGC TTTACAAACC
TCAGCAGCAG CGTTAGGTTT GTTTTACGCC CGTCGCGCCT TGGATAACCC GGATTATATT
CGACAAGCGG TCGTTCGTCT ATTAGCCAAA ATTCCGACGA TGGTAGCAGC CGCCCATCAA
ATGCGCCGAG GAAATGATCC CATTCAACCC AACGATAACC TAGATTATGC TGCCAATTTC
CTCTACATGA TGACGGAACA AAAACCTGAC CCCCTAGCAG CAAAAATTTT TGATGTTTGT
CTGACGCTTC ATGCGGAACA CACCATCAAT GCGTCTACTT TCTCGGCCAT GGTAACGGCT
TCTACCTTAA CCGATCCCTA TGCCGTTGTC GCTTCGGCGG TAGGAACCTT AGCCGGTCCC
TTACACGGGG GGGCAAACGA GGAAGTTTTA GCGATGTTAG AGGAAATTGG GTCAGTGGAA
AATGTTCGTC CCTACATCGA AAAGTTGGTA GCCAATAAAC AGAAAATTAT GGGGTTTGGC
CATCGAGTTT ATAAGGTTAA AGATCCTCGC GCTACTATTC TGCAAAATTT AGCGGAACAA
CTCTTTGAAA AAACTGGCCG TGATGAATAT TATGCCATTG CCCAAGAAGT GGAAAAAGTG
GTGGAAGAAA AGTTAGGGCA TAAAGGAATT TATGCTAATG TGGACTTCTA TTCGGGGTTA
GTTTACCGTA AGTTAGGCAT TCCCAGTGAT TTGTTTACGC CCTTATTTGC GATCGCGCGC
GTAGCGGGAT GGTTAGCCCA TTGGAAGGAA CAATTAGCTG TTAACCGTAT TTTCCGTCCT
ACCCAAGTTT ACATCGGCGA ACGCAATCAG CCCTATGTTC CCATGGAAAA ACGGCTCATG
GTTAACCGTA ATGGCTTATA G
 
Protein sequence
MNVCEYIPGL ENIPAAKSSI SYVDGQEGIL EYRGIRIEEL ATKGSFVETA YLLIWGELPT 
QEELDAFEGE IRYHRRIKYR IRDMMKCFPE TGHPMDALQT SAAALGLFYA RRALDNPDYI
RQAVVRLLAK IPTMVAAAHQ MRRGNDPIQP NDNLDYAANF LYMMTEQKPD PLAAKIFDVC
LTLHAEHTIN ASTFSAMVTA STLTDPYAVV ASAVGTLAGP LHGGANEEVL AMLEEIGSVE
NVRPYIEKLV ANKQKIMGFG HRVYKVKDPR ATILQNLAEQ LFEKTGRDEY YAIAQEVEKV
VEEKLGHKGI YANVDFYSGL VYRKLGIPSD LFTPLFAIAR VAGWLAHWKE QLAVNRIFRP
TQVYIGERNQ PYVPMEKRLM VNRNGL