Gene PCC8801_3471 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_3471 
Symbol 
ID7101562 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp3625358 
End bp3626938 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content34% 
IMG OID643476483 
ProductTetratricopeptide domain protein 
Protein accessionYP_002373592 
Protein GI218248221 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2956] Predicted N-acetylglucosaminyl transferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATATATC GTCGTTTAAT TCATTATAAG ATCCCTAAAT TAATAGTCAT AGCAATTACT 
CTGACCAATA TTACTCTTCT CAGTTCTCCT AACCCTAGTT TAGCTCAACT TGAACAACCC
AAACCCAACC CCCTAGAACT TCCCCTCGAA GATCCTCTTC TTCCAAATAT TCCCCGTCCC
TTAACTCCTT TTGAAAAGCG TAAACTTCGG CAAGAATTGG ATCAATTAAA TGCTCAAGCA
CAAGCTCAAT TTGACAAAGG AAATATACCC GCAGCGTTTG ACATTTGGTA TCGAGAATTA
AGAGTGAGAA GAGTCTTAGG CCGCTTAGAA GAAATTGCTG CATTAGGACG GGTTGGAGAA
ATTGCTTGGA ATAACACTTT AACCCAAGAT GTCAAAATTA TTAGTAAACG ACTCAAAACC
TTACAAGAAT TATCAGAGAA AGAAGATCCT TTAAGTCCAG AATTATTAAT GGCTTTAGCG
GATGCTTATG AAAAATTACA CGCTTTAGAT GATTCTTTAG GGATTTATAA AAGAGTTCTT
AATAATGCTC GTAAAGCTAA AGATCCTATA GCGGAAGAAA AAGCCTTAAA AAGATTGGGG
GAATTATATT TAGCAAAATT TGATTATCCC CAAGCTGCTC CTATTTATGA GGAATTACTT
GATCGAGCAC AAGCCAGATC AAATACCCTA GAAGAAGGAG TTTATCTGCA AATTTTAGCA
GAAATTTATA CTCGTTCTTT ACAACCAGAA AATGCAACAA AAATCAAGGA AGATCTAGCA
GAAAATTATT TAAAAACAAA AAAAATTCAG TTAATTCCTG CGCTAAAAAT TTCCATTGGA
CAAGATTATG AAGCTTTAAA TCAGCCAGAA ATGGCTAGTA AAAATTATCA AGAAGCCTAT
TCCTTATCTT GGTCATTACA GTTATTTGGT GCAGCAGCAG AAGCTTTAAC AAAATTAGGA
AAACTCTATC AAACTTACAA GCAAGATGAC TATGCCTTAC AAATTTATCA AAACTTGATT
CAAGTCGAAC AACTCTCTTA TAATTATTAT GGATTAATGA GAACCTATGA GACAATTGGT
GAAATTTATT TAGCTTATAA ACAATATGAT GTCGCCTTAG AATTTTTTAA ACAAGGACTA
ATTTTAGCTC AATCTCTTAA CTATAAACAA GACAACTTTC TTTCCCGTAT TGAGCAAACT
AACAGAAAAA TTCGAGGAGA AGATGAGCCT GAACCACCTC AAGTTAACCC CGATGTAACC
AATCCTAATA CGTTAGAAGA AATTCCTGAA ATTAACCCCG ATGTAACCAA TCCTAATACG
TTAGAAGAAA TTCCTGAAAT TAACCCCGAC TTAATCAATC CTAATACGTT AGAAGAAATT
CCTGAAATTA ACCCCGACTT AATCAATCCT AATACGTTAG AAGAAATTCC TGAAATTAAC
CCCGACTTAA TTAATCCTAA TCCTTTATAC GAAATCCCTC AACTTGATCC AGACTTAATT
AATCCCAATA CGTTAGAAGA AATTCCTCAA CTTGATCGAG ATTGGCTGAA AAAGGGCAAT
TTAAAGAATA GGAAAAATTA G
 
Protein sequence
MIYRRLIHYK IPKLIVIAIT LTNITLLSSP NPSLAQLEQP KPNPLELPLE DPLLPNIPRP 
LTPFEKRKLR QELDQLNAQA QAQFDKGNIP AAFDIWYREL RVRRVLGRLE EIAALGRVGE
IAWNNTLTQD VKIISKRLKT LQELSEKEDP LSPELLMALA DAYEKLHALD DSLGIYKRVL
NNARKAKDPI AEEKALKRLG ELYLAKFDYP QAAPIYEELL DRAQARSNTL EEGVYLQILA
EIYTRSLQPE NATKIKEDLA ENYLKTKKIQ LIPALKISIG QDYEALNQPE MASKNYQEAY
SLSWSLQLFG AAAEALTKLG KLYQTYKQDD YALQIYQNLI QVEQLSYNYY GLMRTYETIG
EIYLAYKQYD VALEFFKQGL ILAQSLNYKQ DNFLSRIEQT NRKIRGEDEP EPPQVNPDVT
NPNTLEEIPE INPDVTNPNT LEEIPEINPD LINPNTLEEI PEINPDLINP NTLEEIPEIN
PDLINPNPLY EIPQLDPDLI NPNTLEEIPQ LDRDWLKKGN LKNRKN