Gene Clim_0502 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_0502 
Symbol 
ID6354849 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp568652 
End bp569719 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content54% 
IMG OID642668135 
Productglycosyl transferase family 2 
Protein accessionYP_001942574 
Protein GI189346045 
COG category[R] General function prediction only 
COG ID[COG1216] Predicted glycosyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value0.925562 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCGACG ATTTTCAGCT CCTTCCTCCG GTTGATATTA TCATTCCCCA TTACCGGGGT 
GAGGAGCATC TCGAACGCTG CCTTCGCTCT CTGGCAAATA CCCGTTACCC GTCGATGGGC
ATAGTGGTCG TCGATAATGC AAGTCAAACT CCGGGACTGC AAGAGCTGAT CGAAAGGTTC
GCCGGCGTCC GTCTGCTTGC ACTGCCGCAG AACAGAGGTT ATCCGGGTGG CTGCAACGCG
GGTTTCAGCG CAACGAAAGC CGAATTTCTT GTGTTCATGA ACGACGATAC CCGACACGAT
CCGAACTGGC TCGAACCGCT TGTTACGGCA GCACGTCGGG ATGGGTGCAT TGCTGCCCTG
CAGCCGAAAA TTCTCTCTTT GCGGGAATTC GAACAAGGGA ATAACCGCTT CGACTATGCC
GGGGCTGCGG GAGGGATGCT CGACAGGCTC GGCTATCCCT GGTGCTATGG CCGGACTTTT
TCCGGAGTTG AAACGGATAA TGGCCGGTAC GACACCCCGC GGAATATTTT CTGGGCCTCG
GGCGTAGCCA TGTTCGTCCG TCGGAGTGTG TTTGAAGAGC TTGGCGGGTT TGACGACTCT
TTTTTCATGC ACATGGAAGA GATAGATCTT TCATGGCGTA TGCAGCTTTC CGGATACACG
GTCCGGTCGG TACCTTCATC GGTGGTTTAT CATGAAGGCG CCTCTTCGCT TGCATACGGC
TCCCCTGAAA AAACCTATTA CAATCACCGA AACAATCTTC GTATGATGCT CAGGAACATG
AGTGTCGGGT CACTGATGGT GGCTTTTTCC GCCCGTTTGT TGCTCGAACC CGCAGCGGCC
CTGTTTTATC TCACGAAGGG GCGCAGAGGG TATCGCAACG CTTTTGCCGT CCTGAAAGCG
TTACGGGATT TTCTGATGGA GCTGCCTGAA ACGCTGAGAA CTCGAACGCG GGTGCAGGCT
TTACGGAAAA GAACCGACAA AGCACTGTTC AAAGGGCTGC CGTTCAGTAT TTTTTACCCT
TGGCGGAAAA GTTTTTTTAA TCACGCCGGT CAAGATGGCC TTTGCTGA
 
Protein sequence
MRDDFQLLPP VDIIIPHYRG EEHLERCLRS LANTRYPSMG IVVVDNASQT PGLQELIERF 
AGVRLLALPQ NRGYPGGCNA GFSATKAEFL VFMNDDTRHD PNWLEPLVTA ARRDGCIAAL
QPKILSLREF EQGNNRFDYA GAAGGMLDRL GYPWCYGRTF SGVETDNGRY DTPRNIFWAS
GVAMFVRRSV FEELGGFDDS FFMHMEEIDL SWRMQLSGYT VRSVPSSVVY HEGASSLAYG
SPEKTYYNHR NNLRMMLRNM SVGSLMVAFS ARLLLEPAAA LFYLTKGRRG YRNAFAVLKA
LRDFLMELPE TLRTRTRVQA LRKRTDKALF KGLPFSIFYP WRKSFFNHAG QDGLC