Gene PCC8801_3137 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_3137 
Symbol 
ID7102440 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp3282408 
End bp3284231 
Gene Length1824 bp 
Protein Length607 aa 
Translation table11 
GC content48% 
IMG OID643476162 
Productdehydrogenase subunit 
Protein accessionYP_002373273 
Protein GI218247902 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2303] Choline dehydrogenase and related flavoproteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCAAA AAATTGTAAC AGTTGATAGC GAGACTGTCT CTAACACCGT CTACGATGTC 
GTTATTGTCG GAGCAGGGAT CGCGGGAGCC ATTGTCGCCA AACAATTGAG CGAACAAGGA
AAGCGAGTTT TAATCTTAGA AGCGTCCACC AGTGAGGGTC TAACCCTAGA AGGCTTCCAA
GGCTATGTCG AAAGATTCTA CGGCTCTGTG GTCAAACACG GTAACTCCCC CTATCCCCTC
AATGCCAACG CCTTGAGTCC AACGGATGAC GTGATTCGCT ATTTTGAAGA AAAAGGTCCC
TTACCCCTTA GTGGGTCTTA TACACGGGTA TTCGGGGGAA CCACCATGCA CTGGGAAGGC
AAAACCCTAC GGATGTTACC CGAAGACTTC AAATTAAAGA CCAATTACGG TCACGGACTC
GACTGGCCTA TTACCGATCA AGACCTGTGG AAGTATTATC GTCAAGCGGA GTATGAAATC
GGGGTTTCTG GGAACACCGC CGAACAAAGA CAACTCGGCA TTACCTTTGA GTACGATGAT
TACGTCTTCC CCATGAAAGA ACTCCCCCCG TCTTACTTAG ATAAGAAAGT ACGGGAGAAG
ATACAGGGAA CGACCGTGGA CTTCCATGGA GAAACACGGG AATTAGGCCT AAGTACCTTT
CCCCAAGGTC GCAATAGTAT TCCGAACTCA GACTACAAAA CCTACAACGA TGGTTACGAC
TTCGTACCCG ATGGAGTAGC GAGTAAAGTC CCGGTTGAGT ATGGAGAACG GTGTCAAGGA
AACGCCAACT GTGTGCCTAT TTGTCCCGTT CAAGCCAAAT ATGATGCCAG AAAAACCCTC
AATACCATTG CCTTTGGCGA TCGCGTCCAT CTGTTGGCTC AAACCGTCGC TTCTGAAGTA
GAAATTGACC CCCAAACCGG TCGAGTGACT GCCATTCACT ACAAACACTA CCAAGATAGA
AGCATCCCTT CCTATACCGT AGGAACCGCC AGAGGTAAGC TATTTGTCCT AGCGACTAAT
GCCGTCGAAA ACGCTAAATT AATGCTCGCT TCCCACCTAC CGAGCACCAG TGGACTGATG
GGACGCAATT TGATGGATCA CCCCTTCGTT CTAGCTTGGG CACTGATGCC TCAAGTTACC
GGAACCATGC GCGGTCCCTT GGTCACATCG GGTATTGCCA GTTTCCGTCG AGGAGACTTT
CGCAAAGAAC AATGCGCTTT CGGCATCGAT ATCCATAACG ACGGTTGGGG ATGGTCAGGA
ACGGGTGCAA CGGATATCGT TCGAGATGCC GTCGATAACC ACAAAAAATA TGGGTCTGAA
CTGCGTCAGG AATTAATCAG TCGGATCTCA CGGCAGCTAC TGCTGGCCTT TATGTGCGAA
TTACCCGCCG ATCCCAGTAA CCGCGTTAGC ATCGATCCCC ACTATAAAGA TCAAATTGGC
AATTATCGTC CCGTTATTCA CTTTAATATC CCCGATTATT GCAAAAAAAC CATCGCCTAT
TGCCGTGAGC TTTCTAAGAC GATTTTCCAG CGTTTAGGCG CGGAAGATCA CACCCATTAC
GATAAGTCAG ACCCTGCCTA TTTTGAGTAC GAAGGTGAAG GTTACTGGTT CCGAGGAGGG
AATCACTTTT CGGGAACCCA TGTCATGGGG ACGACCAAAT ATAACTCCGT GGTCAATGCT
CAACAGCGTT CTTGGGATCA TGAGAACCTC TACTTAGTCG GTGCAGGAAG TATGCCTTCT
ATTGGTACGT CTAATACAAC CTTAAGTATC GCTGCCTTAG CCTTTTTAGC CTCTGAACAG
ATGCTAAAAG ACTTAAACGC TTAA
 
Protein sequence
MSQKIVTVDS ETVSNTVYDV VIVGAGIAGA IVAKQLSEQG KRVLILEAST SEGLTLEGFQ 
GYVERFYGSV VKHGNSPYPL NANALSPTDD VIRYFEEKGP LPLSGSYTRV FGGTTMHWEG
KTLRMLPEDF KLKTNYGHGL DWPITDQDLW KYYRQAEYEI GVSGNTAEQR QLGITFEYDD
YVFPMKELPP SYLDKKVREK IQGTTVDFHG ETRELGLSTF PQGRNSIPNS DYKTYNDGYD
FVPDGVASKV PVEYGERCQG NANCVPICPV QAKYDARKTL NTIAFGDRVH LLAQTVASEV
EIDPQTGRVT AIHYKHYQDR SIPSYTVGTA RGKLFVLATN AVENAKLMLA SHLPSTSGLM
GRNLMDHPFV LAWALMPQVT GTMRGPLVTS GIASFRRGDF RKEQCAFGID IHNDGWGWSG
TGATDIVRDA VDNHKKYGSE LRQELISRIS RQLLLAFMCE LPADPSNRVS IDPHYKDQIG
NYRPVIHFNI PDYCKKTIAY CRELSKTIFQ RLGAEDHTHY DKSDPAYFEY EGEGYWFRGG
NHFSGTHVMG TTKYNSVVNA QQRSWDHENL YLVGAGSMPS IGTSNTTLSI AALAFLASEQ
MLKDLNA