Gene Cpha266_0354 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_0354 
Symbol 
ID4569526 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp394874 
End bp396850 
Gene Length1977 bp 
Protein Length658 aa 
Translation table11 
GC content44% 
IMG OID639764952 
Productglycosyl transferase family protein 
Protein accessionYP_910837 
Protein GI119356193 
COG category[R] General function prediction only 
COG ID[COG1216] Predicted glycosyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATAAGAG CTGACCAGAA AGCACAGAAC GATTTTGGCG CAGGGGAATG CCTGGATACA 
GAGCTTATAA CCAGAAATAA CGACCTGCAG GCACTTCTTT CTGAGCAGGA ACGGAGAATA
TATGATCTCC TTCAGCAATA CCACAAAAAC GAGTCAGAAT TACGCTCCGT CTACAATACC
CCCATAGGAA AAATTGTCAG ATACTATAAA ACCATCAAGC AAAAAAAGAA AACGCAGAAA
AACGCAGAAA AAAACGACTA CAATATCTGG CTGAACAAGT ATGACCTTTT AACTGAAAAG
AGACGAAAAA ACATCATTGC AGAAATCGCT ACCAAACGTG AGTCTCCCCT GATTTCGGTA
TTCATGCGTC TTAATAAACG ATCACTCTCC TCGCTTGAAC AATCCGTTCA GTCAGTCAAA
GCCCAACTTT ACCCACATTG GGAACTCTGC ATCATTGCAG AGGCATCGCA ACATACAGAA
GCAGAAACTG CGATCAGAAA ATTTGTGGAG AACGATGCAC GAATAACCCG CCACGTCAAA
AAAGAGACCA ACACCATATC GGAGGCAAGC AATGCAGCTC TGGAACTTGC TGGCGGAGAA
TTTTTCGCTC TTCTTGAGAG TGGCGACACC ATTCATCCTC TCGCCCTTTA TCATGTTGCA
CAAGAGGTCA TGCGCTATCC TGAAGCGGGG TTGCTCTATT CCGATGAGGA CTCATTGGAC
AACAACAATA AAAGAGTAAA CCCTTTTTTC AAGCCCGATT TCAATTATGA TCTGTTTCTT
TGCCAAAACA TGGTAGGTAA TCTTGCGGTT TTCAAAACAT CATTGGCCAG GCAAACGGGG
GGGTTTAAAA GAGAGCTTGA TGGAGCTCAG GATTACGACC TCGCTCTAAG GTTTTATGAA
AAACTGAAAC CCGAGCAGAT CCGTCACATT CCCAGAGTGC TTTATCACAA GCGAATTTCC
CGATCCGGAA CCATAGCAGC AACAGAGACT CAAATATCCG GCAATCAGGA AGCAGCGCTG
CTTGCTGTCA ACCATCATCT CAAACGAACC GGAATAGAGG CAACCGTTGA AAAAGCGCCG
GAATATCCGG AATGCAACCG AATACGATAC ACCATCCCCA ACACTCCACC ATCGGTTGAC
ATCATCATAC CGACTAAAGA CATGGCGAAC CTTCTGAAAA TCTGTGTTCT GTCAATTCTC
GCCAAAACAA CCTATAACAA CTATTCGATA ACCATCATTG ACAATGGGTC AAAAGAGCAG
AATACTCTTG ACCTCCTGAA ACAGTGGAAA AATGACTCCC GTATCAGAAT AATACGCGAC
GACGAAACAC CATTTAACTA TTCAAAACTT AACAACAGAG CAGTACACTC TTCTTCTGCG
GATTTTATCT GCCTGATGAA CAATGATATT GAAATCATAA CGCCTGAATG GCTTAACGAA
ATGATGGGGC ATGCAATACA ACCTGGAGTT GGCGCGGTTG GCGCAAGACT CTGGTATCCA
AATGCAACGC TACAACACGC TGGTGTCATT ACGGGGATGT ATACAGGAAC TGGCCATGCA
CATAAAAAAT ACCCTAAAGG AAATCCGGGA TATTTCGGAC GAGCCTGCCT GCAACAGGAA
TATTCAGCCG TTACAGGCGC CTGTCTCTTG ATCAACAGAA TAAATTACCT GCATGTTGCT
GGTTTAAATG AACAAGAGCT CACCGTTGCA TTTAACGATA TTGAGCTCTG CCTGAAACTG
AAAAAAAAGG GACTGCGAAA TATCTGGACA CCCTATGCGG AAATGTTTCA TCACGAATCT
TTGACAAGGG GCCGCAACGA CACCCCTGAA AAAAAAGAAC TTGCAGGAAA AGAACTTGCA
TACATGCAAA ACACATGGGG AATTGATAAA AATCATGATC CTGCCTACAA CCCGAATCTG
TCCATTACAA GCGATGATTT TTCACTGTCC TGGCCTCCTC GAATACTTGA TCGATAA
 
Protein sequence
MIRADQKAQN DFGAGECLDT ELITRNNDLQ ALLSEQERRI YDLLQQYHKN ESELRSVYNT 
PIGKIVRYYK TIKQKKKTQK NAEKNDYNIW LNKYDLLTEK RRKNIIAEIA TKRESPLISV
FMRLNKRSLS SLEQSVQSVK AQLYPHWELC IIAEASQHTE AETAIRKFVE NDARITRHVK
KETNTISEAS NAALELAGGE FFALLESGDT IHPLALYHVA QEVMRYPEAG LLYSDEDSLD
NNNKRVNPFF KPDFNYDLFL CQNMVGNLAV FKTSLARQTG GFKRELDGAQ DYDLALRFYE
KLKPEQIRHI PRVLYHKRIS RSGTIAATET QISGNQEAAL LAVNHHLKRT GIEATVEKAP
EYPECNRIRY TIPNTPPSVD IIIPTKDMAN LLKICVLSIL AKTTYNNYSI TIIDNGSKEQ
NTLDLLKQWK NDSRIRIIRD DETPFNYSKL NNRAVHSSSA DFICLMNNDI EIITPEWLNE
MMGHAIQPGV GAVGARLWYP NATLQHAGVI TGMYTGTGHA HKKYPKGNPG YFGRACLQQE
YSAVTGACLL INRINYLHVA GLNEQELTVA FNDIELCLKL KKKGLRNIWT PYAEMFHHES
LTRGRNDTPE KKELAGKELA YMQNTWGIDK NHDPAYNPNL SITSDDFSLS WPPRILDR