Gene Cpha266_0351 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_0351 
Symbol 
ID4569523 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp387683 
End bp390667 
Gene Length2985 bp 
Protein Length994 aa 
Translation table11 
GC content32% 
IMG OID639764949 
Productglycosyl transferase family protein 
Protein accessionYP_910834 
Protein GI119356190 
COG category[R] General function prediction only 
COG ID[COG1216] Predicted glycosyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.792541 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCAACA AATCTCTTTT TTTAAAACCA GCAAGAATTA TTGAACCAAT CGGTTGGTTA 
GGGCATATTC CATTCTCATT CTGGATTGTA GAAGCGGCGA AGCCCCAAGT AATTGTTGAA
CTTGGTACTC ATTCAGGAAA TTCATATTTC TCTTTTTGCC AAAGCGTCAA GTATAACAGC
TTGCCTACAC TCTGTTATGC AATATATTCA TGTTTGAATA ATGAACAAAC CGAATTATAT
AATGCCGATA TTATTGATTC ATTATTAACC TATAATAATG AACACTACCA GGCTTTTTCA
AATGTTACTG GTAACACTTA TTATAATGCC CTTTCTTTTT TTGAAGACGC TTCTATTGAC
ATTTTACATA TAAATAGCGA AGGGACTTAT ACATCAGTAG TAAATTTATT TAATGCTTGG
ATTCCAAAAT TATCCTCTTC TGGAATAGTA CTTTTTGATA ATATTAATGC TTCAGAAAAT
AATACCGAAG TTTATAAATT ATGGGGATAT GTCAAAAACA TATACCCAAA CCTTACTATT
GAGCATTCCA ATGGGCTGGG AGTTATTTTT ACTGGTAATA TGTATAATGA TAATATTAAT
GAATTCATTA AAACTATTGA AACAGAGGCT AATAGAAATA TAATAGGAAC TCTATTTGAA
AAGATTGGCA ACACAATAGA AATGGATTTT CAAATAAGAC GCCTATTAAA TTCCTTAAAA
GAAGCTGATA CAATGATCGA TATGCTGAAT CAAACAATCA GTAATCGAGA GACTGTAATC
AACAAACTTA TTCAAACAAT TGAAAAAAAC GATGAAACAA TAAAAAGTTT TAGTAAAAAA
CATGATTTTG TGAAAGTGAA ACCTTTGCGA AAACTAAAAA AATCAATAAG AAAAAGAGCT
CATACACTAA GTAAGCTTTT TTTCAAAGTC GAAGATAAAA ATCATCTCAG TGTAAATACG
CATAACAGTA ATGCAGAATG GTTTCTTAAA GAATATAAAA ATATAGCATG TTCAGATAAT
AAACATTGTT TTCATTATAA ATTCGGCATT ATTGATCATC TTGCCGAAAA CAATTCTTTC
TTTAAAGAAT CGAAACCGAC TGCTTCTATA ATTATTCCTG TTTATGGAAA TATTTCATAC
ACAATAAAAT GTATTGAGTC ACTATTAAAT TTACCTGACT CAACATCCTT TGAAATCATA
ATAATCGATG ATCACTCCCC AGATAACTCA TATGAAACAC TTAATAAAAT CAGTCAAATA
AAATTAATAA GAAACGACTG CAATAAAGGA TTTATACATT CATGTAATTC AGGCGCATCT
CTTGCAACTG GTGAATATCT GATATTTCTC AATAATGATA CAGAAGTACT AAATGCCTGG
CTCGATAGTC TTATAGCTCC ATTTATAATA CATGATAATG TCGGCTTAGT AGGTTCTCAA
ATCATATATC CTGATGGGCG TTTACAGGAA GCTGGCGGAG TAATCTTGTC AGATGGCAGT
GGATTAAATT ACGGAAGATT AAGTGATCCA AATAAACCAG AATACAATTT CCTGAGGGAA
GTAGATTATT GCTCGGGATG CTCAATTGCA ATAAAAAAAT CTTTATTCGA TAGCATTGGA
GGATTTGACA CTCTATTTAT CCCTGCTTAT TATGAGGATA CAGACATAGC ATTTACAGTT
CGGAAAATGG GATATAAAGT ACTATATCAA CCAGCATCAA AGGTAATTCA TCATGAAGGG
ATTACTTCTG GTACTGATTT GAAAAAAGGA GTTAAAAAAT TTCAAAATAC AAATAAAATT
AAATTTTACA ACAAATGGGA AACACAATTA AAATCTCACA ATATCTTACC AGATAATTAT
AGCTTAGCTA AGTGCAAGTA TTATAAAGAT ACTATATTAT TTATAGATGC ATGCACTCCA
ACGCCTGATA AAGATTCTGG CTCAGTTGAT GCTTTTTTTC ATCAATATAT TTTCACAAAA
TTAAATTTTA AGGTAACATT CATACCAGAT AACCTGATTT TTCTTGATGG TTACACACAG
GTTTTACAAC AACTTGGTAT TGAATGCCTG TACAAACCAC ACATAACATC TATAGAAAAT
TTTTTAAAAA CCAGAGGTGC TGAATTTAAA TACGTTGTAC TTTCAAGAGT AGGTATTGCA
GTAAAACACA TTAACGCAAT AAAGAAATAC TGCCCTAACA GCATTTTAAT ATTTAACCTT
GTTGATATTC ACTATATTCG AGAAGCACGA CAAGCAAGAA CGCTGAATTC CATCGAACTT
CTTCATAAAG CAAAAAAAAC CAAAGCTACT GAACTCAGTA TCATGAAAAG ATCAGATGTG
AATATCATTA TTAGTGAGTC GGAAGTCAAG CATCTTAGCA AGATCGATCC AGAATTAAAA
CTTTTTAATC TTCCGTTAAT TCTTGACATG CATCAGCGCA CCAATAATTT TGAAAACAGA
AAAAACATCA TGTTTATTGG CGGATTTCAA CATTCCCCCA ATGTAGATGC AGTACTGTAC
TTTTCCAGGG AGATCTGGCC ATTGGTCAAA ACCAGGTTAG TTGATGCTCA CTTTATTATC
ATCGGTAGTG AAATGCCAAA TGAAATAATA AATCTGCATA ATCAAAATGG AATTCTAAGC
ATTGGATATA TTGAAGATTT ATCGCCTTTC TTCAACTCAT GTAAATTTTC TATAGCTCCC
TTGAGATATG GAGCAGGGCA AAAAGGTAAG CTTGCTCGAA GTGGCAGCTA TGGACTTCCC
TCAGTTGCAA CAAGTATAGC AGTTGAAGGA ATGGGCCTTA AACATGAAAA GCATATACTG
ATTGCAGATA AACCTGATGA TTTTGCAAAT TCCATTGAAA GACTTTATCA TGACAGTATA
CTTTGGGAAA AGCTTTCAAA AAATATATAT GATTACACTA ATAGTGAGTA CTCAATTACA
AAAGGAATAA GCAGAATAAG CAATTTAATT CACGCTTTAA ATTAG
 
Protein sequence
MINKSLFLKP ARIIEPIGWL GHIPFSFWIV EAAKPQVIVE LGTHSGNSYF SFCQSVKYNS 
LPTLCYAIYS CLNNEQTELY NADIIDSLLT YNNEHYQAFS NVTGNTYYNA LSFFEDASID
ILHINSEGTY TSVVNLFNAW IPKLSSSGIV LFDNINASEN NTEVYKLWGY VKNIYPNLTI
EHSNGLGVIF TGNMYNDNIN EFIKTIETEA NRNIIGTLFE KIGNTIEMDF QIRRLLNSLK
EADTMIDMLN QTISNRETVI NKLIQTIEKN DETIKSFSKK HDFVKVKPLR KLKKSIRKRA
HTLSKLFFKV EDKNHLSVNT HNSNAEWFLK EYKNIACSDN KHCFHYKFGI IDHLAENNSF
FKESKPTASI IIPVYGNISY TIKCIESLLN LPDSTSFEII IIDDHSPDNS YETLNKISQI
KLIRNDCNKG FIHSCNSGAS LATGEYLIFL NNDTEVLNAW LDSLIAPFII HDNVGLVGSQ
IIYPDGRLQE AGGVILSDGS GLNYGRLSDP NKPEYNFLRE VDYCSGCSIA IKKSLFDSIG
GFDTLFIPAY YEDTDIAFTV RKMGYKVLYQ PASKVIHHEG ITSGTDLKKG VKKFQNTNKI
KFYNKWETQL KSHNILPDNY SLAKCKYYKD TILFIDACTP TPDKDSGSVD AFFHQYIFTK
LNFKVTFIPD NLIFLDGYTQ VLQQLGIECL YKPHITSIEN FLKTRGAEFK YVVLSRVGIA
VKHINAIKKY CPNSILIFNL VDIHYIREAR QARTLNSIEL LHKAKKTKAT ELSIMKRSDV
NIIISESEVK HLSKIDPELK LFNLPLILDM HQRTNNFENR KNIMFIGGFQ HSPNVDAVLY
FSREIWPLVK TRLVDAHFII IGSEMPNEII NLHNQNGILS IGYIEDLSPF FNSCKFSIAP
LRYGAGQKGK LARSGSYGLP SVATSIAVEG MGLKHEKHIL IADKPDDFAN SIERLYHDSI
LWEKLSKNIY DYTNSEYSIT KGISRISNLI HALN