Gene PCC8801_3777 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_3777 
Symbol 
ID7103996 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp3965592 
End bp3967154 
Gene Length1563 bp 
Protein Length520 aa 
Translation table11 
GC content35% 
IMG OID643476782 
ProductSubstrate-binding region of ABC-type glycine betaine transport system 
Protein accessionYP_002373883 
Protein GI218248512 
COG category[E] Amino acid transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1174] ABC-type proline/glycine betaine transport systems, permease component
[COG1732] Periplasmic glycine betaine/choline-binding (lipo)protein of an ABC-type transport system (osmoprotectant binding protein) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACAGTT TTTATGATGC TCAATTATTG CAAGAGATTA TTCAACGAAC AGTTGAACAT 
TTAATCTTAG TAAGTATTGC TATGGGGGCA GCGATCGCAG TGGGTATCCC TTTAGGCATT
ATTATTGCTA AACAGCCTAA ATTAGCTGAT CCTATTTTGG GAGTTGCAAA TGCCATTCAA
ACGATTCCTA GTTTAGCCAT TTTTGGCTTT TTAATTACCG TTCCCATCAT TGGAGGAATT
GGTAAAATTC CCGCGATTGT TGCCCTAATT CTCTATGCTT TATTGCCTAT TATTCGTAAT
ACTTATACAG GGATAAAGCA AGTGGATAAA GGAGTGAAAG AAGCAGCAAT AGCCCTAGGA
ATGACAAACC GACAAATATT ACTCTTAATT GAAATCCCCT TAGCGTTAGG TATTATTTTA
GCCGGAGTTA GAGTTTCTAC CGTCATTTGT GTGGGAATTA CGACCATTGC TGCTGCTATT
GGGGCAGGAG GGTTAGGGGT GTTTATTTTT CGGGGAATTT CCATGGTTAA TAACCAAATT
ATCCTCGCTG GTGCTATTCC ATCAGCTATC ATTGCTTTAG CAGCAGATTG GGGAATTGGT
TGGCTGGAAA AATCCCTAAG TCAGCACAAG ACAAATCAAC AAAAATCCAA TAAAAAAAGC
TTAATTATTT TTGGAATATT AGGATTGATT TTATTTAGTT TATTGGGAAT TTTTCATCAA
AAAATATTTT TAACAAATCA AGCCAGTGGT ATCGTAATTA TTGGCTCTAA AAATTTTACT
GAGCAGGTGA TTTTAGGAGA AGTATTAGCC CAAAGTATTG AAGCAAAAAC GAATTTAAAA
GTAGACAGAA AATTTAATTT AGGGGGAACT TTTATTTGTC ATCAAGCATT ACAAGCGAGA
AAAATTGATG GATATGTAGA ATATACAGGA ACCGCTTTTA CCGCAATTCT TGAACAGAAA
CCCATTAATA ATCCTCAAAC TGTTTATGAA AAAATTAAAC AAGTGTATCA TGATCAATTT
AACCTAGAAG TCATGCCATC TCTAGGATTT GAAAATACCT ATGCTATTCT CGTTCGTCAA
AAAGATGCCA AACAATATCA ACTAAAAACT ATCTCAGATG CGTCTCAATA TAGCCCCCAA
TGGCAAGCAG GATTTGGTCA TGAATTTTTA TCAAGAGAAG ACGGTTATCC AGGGTTAGCC
AAAACCTATA ATTTAAACTT TGCGACACTT CCAAAAACCA TGGAACTCGG TTTAATGTAT
CGCGCTTTAG CTAATCAAGA AATCGATTTA GCAGCAGGAA ATTCTACTGA TGGAACAATT
CCTATTTTAA ATTTAACGAT TTTAGAAGAT AATAAAAAGT ATTTTCCTCC CTATGAAGCA
GTCCCGATTT TTAATCAAGA AACTTTCAAA GACTATCCTA ACTTAAGGTC TATTATTGAA
CAATTAGCGG GAAAAATTTC CGCCCAAGAA ATGCAACAAC TCAATTATCA AGTGGATGGG
GAGAAAAAAT CAGTTAATGA AGTAGTACAC AAGTTTTTAA TCAAGAAGGG GTTAACATCT
TAA
 
Protein sequence
MNSFYDAQLL QEIIQRTVEH LILVSIAMGA AIAVGIPLGI IIAKQPKLAD PILGVANAIQ 
TIPSLAIFGF LITVPIIGGI GKIPAIVALI LYALLPIIRN TYTGIKQVDK GVKEAAIALG
MTNRQILLLI EIPLALGIIL AGVRVSTVIC VGITTIAAAI GAGGLGVFIF RGISMVNNQI
ILAGAIPSAI IALAADWGIG WLEKSLSQHK TNQQKSNKKS LIIFGILGLI LFSLLGIFHQ
KIFLTNQASG IVIIGSKNFT EQVILGEVLA QSIEAKTNLK VDRKFNLGGT FICHQALQAR
KIDGYVEYTG TAFTAILEQK PINNPQTVYE KIKQVYHDQF NLEVMPSLGF ENTYAILVRQ
KDAKQYQLKT ISDASQYSPQ WQAGFGHEFL SREDGYPGLA KTYNLNFATL PKTMELGLMY
RALANQEIDL AAGNSTDGTI PILNLTILED NKKYFPPYEA VPIFNQETFK DYPNLRSIIE
QLAGKISAQE MQQLNYQVDG EKKSVNEVVH KFLIKKGLTS