Gene Cyan8802_4231 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCyan8802_4231 
Symbol 
ID8393582 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8802 
KingdomBacteria 
Replicon accessionNC_013161 
Strand
Start bp4368634 
End bp4369887 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content38% 
IMG OID644982143 
Productglycosyl transferase group 1 
Protein accessionYP_003139855 
Protein GI257061967 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00915385 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAGATTC TTATACTCTC CTGTGCTTTT CCCTATCCTC CCCTGGAAGG AAAGACCCAG 
ATGAGAACTT TTTCTTTTCT CAAATATCTC AATCAGCGTC ATGATATTAC CTTCGTGACC
CAAGCATCTG ATGAAGTCCT TGAAGAAGAT ATAGCAAGTT TAAAACAGGA AATCCCTAAC
TGTATTATCT TTCCTGATGT TACCCAAGAA ACGACTTCTC CTAGATTTTT AGAAAAAGCC
AAACGGTTAG GAGTGTTTTT TCAGCAGGGA ACCCCCCCAA AATTACTATC CCACTATTCT
CTGAAACTTC AAGAATATTT AGATCAAGAA ATTGACTCAG GACAATTCGA TTTATTAACC
TGTGAAGGTA ATAATGGCGA GATTTATATC CGTCCTGAAT GGCAAAAAAA GTTACCTACT
GTTATTAATA TTCATCGGTC ACTATGGGGT GTTTACAAAC ATCAAAGAGA AAACCACTCT
GGGGAATCTG GTTTGCGAAA TCAAATTAAT TTACCGCTAT TACGTCGCTA CGAAAAACAC
TATTGTAGTA AATTTTCTGC CATTGTTACA GCGACTCAAA CTGAACAAAA ATTACTCAAA
AATTTAAAAT TAGAAACGCC GATTACCGTT GTTCCCAACG GTTTAGATTT ATCTGTTTTT
CCTAAACGTC CTGCCAATTC AGGAGGACAA CGCATTATTT TTATTGGGGC AATGGATAAA
CCAGCGAATA TTGATGCAGC GCGGTTTTTT AGTTTAGAAG TCTTTCCTAA AATCCGTCAA
CGTTATCCAG AAAGTATATT AGAATTAGTG GGGATTCGTC CTGTTCATGA AGTATTAGAA
TTAGGAGAAT ATCCAGGGAT TAAAGTAACC GGACCAGTTT CTTCAATGAT TGAATATCTC
CATTGGGCTA CTGTTTGCGT CATTCCCCTT CGTAAAAGCA TGGGAACTAA AATAAGAACC
CTTCAGGCTT TAGCAACAGG AACTCCTTTA GTGGCTAGTG ATTATGGGTT AGAAGGGTTA
TCTGTTGATG GCACAGGAGT CCCTTTGTGT GCGATGAGAG CTAATGAAAT TGATGAATAT
GTTTATGCGA TTGGTCGTCT GTTTGAACAG CCTAAATTGC GGGAAAAACT CTCAATTAAT
GGTCGAACTT TGATCGAAAA TGAATATACT TGGAAACGCA TGGGAGAACG TTATGAACAA
ATCTTGCTGA CGACCTATAA TACCAGTCAT AAGTTAAGTG ATAACTCACC TTAA
 
Protein sequence
MKILILSCAF PYPPLEGKTQ MRTFSFLKYL NQRHDITFVT QASDEVLEED IASLKQEIPN 
CIIFPDVTQE TTSPRFLEKA KRLGVFFQQG TPPKLLSHYS LKLQEYLDQE IDSGQFDLLT
CEGNNGEIYI RPEWQKKLPT VINIHRSLWG VYKHQRENHS GESGLRNQIN LPLLRRYEKH
YCSKFSAIVT ATQTEQKLLK NLKLETPITV VPNGLDLSVF PKRPANSGGQ RIIFIGAMDK
PANIDAARFF SLEVFPKIRQ RYPESILELV GIRPVHEVLE LGEYPGIKVT GPVSSMIEYL
HWATVCVIPL RKSMGTKIRT LQALATGTPL VASDYGLEGL SVDGTGVPLC AMRANEIDEY
VYAIGRLFEQ PKLREKLSIN GRTLIENEYT WKRMGERYEQ ILLTTYNTSH KLSDNSP