Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PCC8801_3471 |
Symbol | |
ID | 7101562 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 8801 |
Kingdom | Bacteria |
Replicon accession | NC_011726 |
Strand | - |
Start bp | 3625358 |
End bp | 3626938 |
Gene Length | 1581 bp |
Protein Length | 526 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 643476483 |
Product | Tetratricopeptide domain protein |
Protein accession | YP_002373592 |
Protein GI | 218248221 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2956] Predicted N-acetylglucosaminyl transferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATATATC GTCGTTTAAT TCATTATAAG ATCCCTAAAT TAATAGTCAT AGCAATTACT CTGACCAATA TTACTCTTCT CAGTTCTCCT AACCCTAGTT TAGCTCAACT TGAACAACCC AAACCCAACC CCCTAGAACT TCCCCTCGAA GATCCTCTTC TTCCAAATAT TCCCCGTCCC TTAACTCCTT TTGAAAAGCG TAAACTTCGG CAAGAATTGG ATCAATTAAA TGCTCAAGCA CAAGCTCAAT TTGACAAAGG AAATATACCC GCAGCGTTTG ACATTTGGTA TCGAGAATTA AGAGTGAGAA GAGTCTTAGG CCGCTTAGAA GAAATTGCTG CATTAGGACG GGTTGGAGAA ATTGCTTGGA ATAACACTTT AACCCAAGAT GTCAAAATTA TTAGTAAACG ACTCAAAACC TTACAAGAAT TATCAGAGAA AGAAGATCCT TTAAGTCCAG AATTATTAAT GGCTTTAGCG GATGCTTATG AAAAATTACA CGCTTTAGAT GATTCTTTAG GGATTTATAA AAGAGTTCTT AATAATGCTC GTAAAGCTAA AGATCCTATA GCGGAAGAAA AAGCCTTAAA AAGATTGGGG GAATTATATT TAGCAAAATT TGATTATCCC CAAGCTGCTC CTATTTATGA GGAATTACTT GATCGAGCAC AAGCCAGATC AAATACCCTA GAAGAAGGAG TTTATCTGCA AATTTTAGCA GAAATTTATA CTCGTTCTTT ACAACCAGAA AATGCAACAA AAATCAAGGA AGATCTAGCA GAAAATTATT TAAAAACAAA AAAAATTCAG TTAATTCCTG CGCTAAAAAT TTCCATTGGA CAAGATTATG AAGCTTTAAA TCAGCCAGAA ATGGCTAGTA AAAATTATCA AGAAGCCTAT TCCTTATCTT GGTCATTACA GTTATTTGGT GCAGCAGCAG AAGCTTTAAC AAAATTAGGA AAACTCTATC AAACTTACAA GCAAGATGAC TATGCCTTAC AAATTTATCA AAACTTGATT CAAGTCGAAC AACTCTCTTA TAATTATTAT GGATTAATGA GAACCTATGA GACAATTGGT GAAATTTATT TAGCTTATAA ACAATATGAT GTCGCCTTAG AATTTTTTAA ACAAGGACTA ATTTTAGCTC AATCTCTTAA CTATAAACAA GACAACTTTC TTTCCCGTAT TGAGCAAACT AACAGAAAAA TTCGAGGAGA AGATGAGCCT GAACCACCTC AAGTTAACCC CGATGTAACC AATCCTAATA CGTTAGAAGA AATTCCTGAA ATTAACCCCG ATGTAACCAA TCCTAATACG TTAGAAGAAA TTCCTGAAAT TAACCCCGAC TTAATCAATC CTAATACGTT AGAAGAAATT CCTGAAATTA ACCCCGACTT AATCAATCCT AATACGTTAG AAGAAATTCC TGAAATTAAC CCCGACTTAA TTAATCCTAA TCCTTTATAC GAAATCCCTC AACTTGATCC AGACTTAATT AATCCCAATA CGTTAGAAGA AATTCCTCAA CTTGATCGAG ATTGGCTGAA AAAGGGCAAT TTAAAGAATA GGAAAAATTA G
|
Protein sequence | MIYRRLIHYK IPKLIVIAIT LTNITLLSSP NPSLAQLEQP KPNPLELPLE DPLLPNIPRP LTPFEKRKLR QELDQLNAQA QAQFDKGNIP AAFDIWYREL RVRRVLGRLE EIAALGRVGE IAWNNTLTQD VKIISKRLKT LQELSEKEDP LSPELLMALA DAYEKLHALD DSLGIYKRVL NNARKAKDPI AEEKALKRLG ELYLAKFDYP QAAPIYEELL DRAQARSNTL EEGVYLQILA EIYTRSLQPE NATKIKEDLA ENYLKTKKIQ LIPALKISIG QDYEALNQPE MASKNYQEAY SLSWSLQLFG AAAEALTKLG KLYQTYKQDD YALQIYQNLI QVEQLSYNYY GLMRTYETIG EIYLAYKQYD VALEFFKQGL ILAQSLNYKQ DNFLSRIEQT NRKIRGEDEP EPPQVNPDVT NPNTLEEIPE INPDVTNPNT LEEIPEINPD LINPNTLEEI PEINPDLINP NTLEEIPEIN PDLINPNPLY EIPQLDPDLI NPNTLEEIPQ LDRDWLKKGN LKNRKN
|
| |