Gene Cyan8802_2645 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCyan8802_2645 
Symbol 
ID8391971 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8802 
KingdomBacteria 
Replicon accessionNC_013161 
Strand
Start bp2667687 
End bp2669267 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content34% 
IMG OID644980606 
ProductTetratricopeptide domain protein 
Protein accessionYP_003138342 
Protein GI257060454 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2956] Predicted N-acetylglucosaminyl transferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.528519 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATATATC GTCGTTTAAT TCATTATAAG ATCCCTAAAT TAATAGTCAT AGCAATTACT 
CTGACCAATA TTACTCTTCT CAGTTCTCCT AACCCTAGTT TAGCTCAACT TGAACAACCC
AAACCCAACC CCCTAGAACT TCCCCACGAA GATCCTCTTC TTCCAAATAT CCCCCGTCCC
TTAACTCCTT TTGAAAAGCG TAAACTTCGG CAAGAATTGG ATCAATTAAA TGCTCAAGCA
CAAGCTCAAT TTGATAAAGG AAATATACCC GCAGCGTTTG ACATTTGGTA TCGAGAATTA
AGAGTGAGAA GAGTCTTAGG CCGCTTAGAA GAAATTGCTG CATTAGGACG GGTTGGAGAA
ATTGCTTGGA ATAACACTTT AACCCAAGAT GTCAAAATTA TTAGTAAACG ACTCAAAACC
TTACAAGAAT TATCAGAGAA AGAAGATCCT TTAAGTCCAG AATTATTAAT GGCTTTAGCG
GATGCTTATG AAAAATTACA CGCTTTAGAT GATTCTTTAG GGATTTATAA AAGAGTTCTT
AATAATGCTC GTAAAGCTAA AGATCCTATA GCGGAAGAAA AAGCCTTAAA AAGATTGGGG
GAATTATATT TAGCAAAATT TGATTATCCG CAAGCTGCTC CTATTTATGA GGAATTACTT
GATCGAGCAC AAGCCAGATC AAATACCCTA GAAGAAGGAG TTTATCTGCA AATTTTAGCA
GAAATTTATA CTCGTTCTTT ACAACCGGAA AATGCAACAA AAATCAAGGA AGATCTAGCA
GAAAATTATT TAAAAACAAA AAAAATTCAG TTAATTCCTG CGCTAAAAAT TTCCATTGGA
CAAGATTATG AAGCTTTAAA TCAGCCAGAA ATGGCTAGTA AAAATTATCA AGAAGCCTAT
TCCTTATCTT GGTCATTACA GTTATTTGGT GCAGCAGCAG AAGCTTTAAC AAAATTAGGA
AAACTCTATC AAACTTACAA GCAAGATGAC TATGCCTTAC AAATTTATCA AAACTTGATT
CAAGTCGAAC AACTCTCTTA TAATTATTAT GGATTAATGA GAACCTATGA GACAATTGGT
GAAATTTATT TAGCTTATAA ACAATATGAT GTCGCCTTAG AATTTTTTAA ACAAGGACTA
ATTTTAGCTC AATCTCTTAA CTATAAACAA GACAACTTTC TTTCCCGTAT TGAGCAAACT
AACAGAAAAA TTCGAGGAGA AGATGAGCCT GAACCACCTC AAGTTAACCC CGATGTAACC
AATCCTAATA CGTTAGAAGA AATTCCTGAA ATTAACCCCG ACTTAATCAA TCCTAATACG
TTAGAAGAAA TTCCTGAAAT TAACCCCGAC TTAATCAATC CTAATACGTT AGAAGAAATT
CCTGAAATTA ACCCCGACTT AATCAATCCT AATACGTTAG AAGAAATTCC TGAAATTAAC
CCCGACTTAA TTAATCCTAA TCCTTTATAC GAAATCCCTC AACTTGATCC AGACTTAATT
AATCCCAATA CGTTAGAAGA AATTCCTCAA CTTGATCGAG ATTGGCTGAA AAAGGGCAAT
TTAAAGAATA GGAAAAATTA G
 
Protein sequence
MIYRRLIHYK IPKLIVIAIT LTNITLLSSP NPSLAQLEQP KPNPLELPHE DPLLPNIPRP 
LTPFEKRKLR QELDQLNAQA QAQFDKGNIP AAFDIWYREL RVRRVLGRLE EIAALGRVGE
IAWNNTLTQD VKIISKRLKT LQELSEKEDP LSPELLMALA DAYEKLHALD DSLGIYKRVL
NNARKAKDPI AEEKALKRLG ELYLAKFDYP QAAPIYEELL DRAQARSNTL EEGVYLQILA
EIYTRSLQPE NATKIKEDLA ENYLKTKKIQ LIPALKISIG QDYEALNQPE MASKNYQEAY
SLSWSLQLFG AAAEALTKLG KLYQTYKQDD YALQIYQNLI QVEQLSYNYY GLMRTYETIG
EIYLAYKQYD VALEFFKQGL ILAQSLNYKQ DNFLSRIEQT NRKIRGEDEP EPPQVNPDVT
NPNTLEEIPE INPDLINPNT LEEIPEINPD LINPNTLEEI PEINPDLINP NTLEEIPEIN
PDLINPNPLY EIPQLDPDLI NPNTLEEIPQ LDRDWLKKGN LKNRKN