Gene PCC8801_1411 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_1411 
Symbol 
ID7103623 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp1477119 
End bp1478987 
Gene Length1869 bp 
Protein Length622 aa 
Translation table11 
GC content39% 
IMG OID643474488 
Productglycosyl transferase family 39 
Protein accessionYP_002371625 
Protein GI218246254 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCACAAAA TAACCTTACA ACAAGAACCT ATTGAGCCTG AGATTCTTGC TAAAAAATCT 
CCAAAAACAC TTTGGATATT TAGTATTTTA TGGCTATTAT TAATTAGCTG GATTGCCTTT
TTATGGAATT TGGGTGCAAT TGGACTGGTA GATAAAACTG AGCCAATGTT TGTCGAAGCA
TCCCGTCAAA TGGCCTTAAC AGGAGATTGG ATAACACCCT ATTGGAATGG AGAAACTCGT
TTTGATAAAC CTCCCTTGAC CTATTGGTTA ATATCCCTAT CTTTTAGGGT ATTTGGAGTT
AATGAATGGG CTGCTCGTTT CCCCTCTGCT GTATTTGCGA TCGCCCTAGT GTGCCTAGGA
TTTTATACCC TACGTTATTG GGGATTTTCC GATACCAACG TTCTTGGATC TTCAACGCAA
GATAAAAACA ACTCAAATGA GCAGCAATTA TGGTTTTCAG CCATTATAGG ATCGGCAATT
ATTGCCCTTA ATCCTTTTTG GATTGCTTGG GGAAGAACAG GGGTTTCTGA TATGTTTCTC
TCTAGCAGTA TTGGGTTAGC CTTACTATCA TTTTTTATTG GCTATGTACA ACCTAAAACT
GCATTACCTA AACCGTTAGG GTTATCAGTA AAAAATTGGT GGTATGTTGG CTATTGGGTA
TTTATGGCCT TAGGGGTTTT GGCAAAAGGT CCCGTAGCAC TCGTCCTTCC TGGCTTTGTT
GTTATCGCTT TTTTATTATA TGTAGGTCGG TTCCTAGAAG TGGTTAAAGA AACCCCTTGG
TTATTAGGAA TTGGCAGCTT TATCTTAATT GCTGTTCCCT GGTTTATTTT AGTGACCCTA
GAACACGGAC AAGAATATAT TGATACCTTC TTTGGATTTC ACAACGTTCA GCGATTTACT
AGCGTTGTTA GCGTCCATCC TGGGGCATGG TATTATTACT TTCCAGTGGT TCTAGTCGCC
TTAATTCCTT GGTCAATTTT TCTGCCCTTA GCCATTGCTC GTTTACGCTT TTGGAGACGT
AACAATTGGG TTAATTCTGA ACGTTCAACC CATCTAGGAT TATTTGCCTT AGTTTGGTTT
CTTGTTATCT TTGTTTTCTT TTCTAGTTCC GTTACAAAAT TAGCCGGGTA TGTTCTTCCT
TTAGTCCCTG CTGCTGCCAT TCTGATTGCC CTGTTTTGGA GTGATCAAAC GACGCAAAAA
GCAACCCAAA ACCAAGGAAT TTGGCCGCTA TTATTTATGC TGAGTGGATT AGCTAATATA
GGAATTTTAG GCGGTTTAGC GGTAGCGAGT TTTCTCAGTC CTCAATTGGT GGGTAATGAC
CCAATGATGC CCCAATTTAA CCAACTATTA CAAGCATCCG GTTTACCCAT CAGAAGCGGA
ATTATTTGGG CTTTTTCTAC CTTGGGTGTT TTAATTTTAT TGCTTTGGAA AGGTCATAAA
CGGTGGATTT GGGCAGCAAA TCTATTAGGA TTTATGGCAT TTTTTAGTTG GGTTGTACTT
CCTATTGCCC CCATTATTGA CAGCGAACGT CAACTTCCTT TAAGAGAATT AGCTACCTTA
GTTAAGAAAG AACAAAAGCC TGGAGAAAAG TTAGTTTTAC TCGGTTTTAT GCGTCCAAGT
ATGGTTTATT ACACACAACA AACCGTTGAT TCTATTACCG AAACTGACAT CTACAGTGGA
CAAGCAATTG AATACTTTAA ACAGCAACTC AATACCCCAG GAACTTCTCC AACAACCTTA
ATTGTTGCTA AACCAAGATA CTTTGAAAAA CTGCGTTTAA ACCCTTCTGA TTATGAAGTG
ATTGGTGACG GAAAAGTCTA TCAATTAATT CGCGTCTCTA AAGATAAGGT AATCCAGAAG
AACTCATAA
 
Protein sequence
MHKITLQQEP IEPEILAKKS PKTLWIFSIL WLLLISWIAF LWNLGAIGLV DKTEPMFVEA 
SRQMALTGDW ITPYWNGETR FDKPPLTYWL ISLSFRVFGV NEWAARFPSA VFAIALVCLG
FYTLRYWGFS DTNVLGSSTQ DKNNSNEQQL WFSAIIGSAI IALNPFWIAW GRTGVSDMFL
SSSIGLALLS FFIGYVQPKT ALPKPLGLSV KNWWYVGYWV FMALGVLAKG PVALVLPGFV
VIAFLLYVGR FLEVVKETPW LLGIGSFILI AVPWFILVTL EHGQEYIDTF FGFHNVQRFT
SVVSVHPGAW YYYFPVVLVA LIPWSIFLPL AIARLRFWRR NNWVNSERST HLGLFALVWF
LVIFVFFSSS VTKLAGYVLP LVPAAAILIA LFWSDQTTQK ATQNQGIWPL LFMLSGLANI
GILGGLAVAS FLSPQLVGND PMMPQFNQLL QASGLPIRSG IIWAFSTLGV LILLLWKGHK
RWIWAANLLG FMAFFSWVVL PIAPIIDSER QLPLRELATL VKKEQKPGEK LVLLGFMRPS
MVYYTQQTVD SITETDIYSG QAIEYFKQQL NTPGTSPTTL IVAKPRYFEK LRLNPSDYEV
IGDGKVYQLI RVSKDKVIQK NS