Gene Clim_1873 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_1873 
Symbol 
ID6355214 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp2066204 
End bp2067490 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content55% 
IMG OID642669474 
Productglycosyl transferase group 1 
Protein accessionYP_001943888 
Protein GI189347359 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000113427 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAACGGCC GGGAAGGGCT TTTTTGGGGT GTCATGAATT TTTTGTTTGT CCACCAGAAT 
TTTCCGGGTC AGTTTCCTCA TGTTGCAAGG GCTTTGGCCG GGATGCCCGG TAACCGTGTG
GTTGCAATTG CTGAGGAGAA GAATGTTGTT CAGCGGTTGC CGGTGCATCC GAACGTAGTG
GTTAAAACCT ACCGGCAGGA GAAAGGCAGC GGTCGCGAAA CGCATCACTA CATTCGAGAT
TTTGAAAGCG CTGTGCGAAG GGGTCAGACG GTTGCGCGTC TGGCGATTGA AATCAGGAAG
TCGGGGTTTC ATCCTCATGT TGTCGTGGGA CATCCAGCCT GGGGTGAAAC CCTGTTTTTG
AAGGATGTGT TTCCGAATGC GCGGCATATA TCGTATTTCG AGTTTTTTTA CCGGGCAGAC
GGGGGCGATG TGGGTTTTGA TCCGGAGTTT CCTTCGGTGT TCGATGACCG GCTGCGGATA
AGGGTCAAGA ATACTACCCA GCTGCTCAGC CTGGAGGCTG CCGATGCGGG GATCTCTCCT
ACCCTCTGGC AGCAGAGCCG GTTTCCTGAA GAGTTCCATT CGAAAATCAG GGTGATTCAT
GAAGGTGTCG ATACCGCATT CGTTCGCCCT GACCCTGATG CCGCAGTCGA GCTTGACGGT
ATGACGCTGA AGAGGTGCGA TAAGGTGGTG ACGTTTCTCT CGCGGAACCT CGAGCCGTAC
CGGGGGTTTC ATGTTTTTAT GAGGACACTG CCGTTGATCC AGAAGGCTTG TCCCGAAGCG
AGGATCGTCA TTATCGGCGG CGATGGGGTG AGTTACGGCA GGAGGCTTCC TGAAGGGCAG
ACGTACCGTG CGATGTATGC TGCAGAAGCT GGTGACAAGG TTGACTGGTC GAAGGTGCAT
TTTACCGGCA GGGTTCCGTA TAACCGGTAT CTTTCGCTTC TGCAGGTTTC TTCGGCGCAT
ATCTACCTGA CCTACCCGTT CGTGCTTTCG TGGTCGATGA TCGAGGCGAT GTCGCTCGGT
TGCGCGCTGA TCGCTTCTGC GACGCCTCCG GTGCAGGAGG TGGTCGAGCA GGGTGAAAAC
GGCATTCTTG TGGATTTTTT CGATCGGGAT GGCCTTGCCG CTGCGGTAGC CGATGCCCTC
GACAATCCGG GAGCTTACGA GCCGATGCGG CAGAGAGCAC GCGAGACTGC TGTGGAGCGG
TACGATTTGC GTTCGAAGTG CCTTCCGGAA ATGCTTCGGT ATCTGAGTGG GGAAGATGAT
TGTGGGCTGT ATGCGGTTAG CGGTTAG
 
Protein sequence
MNGREGLFWG VMNFLFVHQN FPGQFPHVAR ALAGMPGNRV VAIAEEKNVV QRLPVHPNVV 
VKTYRQEKGS GRETHHYIRD FESAVRRGQT VARLAIEIRK SGFHPHVVVG HPAWGETLFL
KDVFPNARHI SYFEFFYRAD GGDVGFDPEF PSVFDDRLRI RVKNTTQLLS LEAADAGISP
TLWQQSRFPE EFHSKIRVIH EGVDTAFVRP DPDAAVELDG MTLKRCDKVV TFLSRNLEPY
RGFHVFMRTL PLIQKACPEA RIVIIGGDGV SYGRRLPEGQ TYRAMYAAEA GDKVDWSKVH
FTGRVPYNRY LSLLQVSSAH IYLTYPFVLS WSMIEAMSLG CALIASATPP VQEVVEQGEN
GILVDFFDRD GLAAAVADAL DNPGAYEPMR QRARETAVER YDLRSKCLPE MLRYLSGEDD
CGLYAVSG