Gene Clim_2081 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_2081 
Symbol 
ID6355059 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp2295929 
End bp2297074 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content51% 
IMG OID642669677 
Productglycosyl transferase group 1 
Protein accessionYP_001944089 
Protein GI189347560 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.212135 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAATTG CCTTATATGC AGGGACGTAT GTCAAAGACA AGGATGGGGC TGTACGGTCG 
ATCTATCAGC TTGTCGCCTC GTTCAGGAAA CATGGTCACG AGGTTATCGT CTGGTCTTCA
GACGTATCCG AGCAGGACAA TCATGGATCT CTGAAAGTAC TCCGTCTCCC TTCGGTACCG
ATTCCGCTTT ATCCCGACTA TAAGCTCGGA TTTTTCAGTG CCGTTACAAA GCGGCAGCTC
GATGCGTTCG CTCCGGATAT CGTTCATATA TCCACTCCGG ATATTGTGGG GCGCAGGTTT
CTTCTCTACG CAAAAAACAA GAAGCTTCCG GCGACATCGG TCTATCATAC CGATTTCCCT
TCGTACCTCA GTTATTACCG TCTGGGATTT GCTTTGGGAC CGGTCTGGAA GTACCTGAAA
TGGTTCTACA ATACCTGTGA CCTTGTGCTT GCACCCAATG AGATCGTTCA ACGCAAACTG
ACAGACAAGA GTATCAGAAA CGTTGAAATC TGGTCGAGGG GGATCGACAG GGAGTTATTT
GATCCATCCC GGCGTTCGGA GCTGCTTCGA CAGGAGTGGC ATGCCGTTGA GCGAACGGTG
TTTGTCTATG CCGGCCGTTT TGTGCTCTAC AAGGATATCG AAGTGGTCAT GAGCGTTTAT
GAACGCTTTA TGAGAGAGGG CTTTATCGAT AAGGTTCGTT TCGTCATGAT CGGTTCAGGT
CCAGAAGAAG AACAGATGCG AAGGCGCATG CCGCAAGCGG TTTTTACCGG TTATCTTATC
GGTACGGCGC TGCCGGAGGC GTATGCAAGC GGGGATGTTT TTCTTTTTCC CTCTACTACC
GAGGCGTTCG GAAATGTTGT GCTGGAAGCT TTCGCAACCG GATTGCCTGC TGTCGTCTCC
GACGTTGGCG GTTGCATGGA GCTGGTTAAC GCATCGGAAG CCGGCCTGGT GGCAAAAGCG
GGTGATATCG ATCAGTTTTA TGCCCATTGC CTTAAATTGC TCGATGATGC TCATACCCGC
TCCTCGATGC GCAGGAAGGG GGTCCTTTTT GCCGAAAAAA AGTCTTGGGC TTCGGTAAAC
GGAGCCCTGA TAGCCAGATA CCTTGAACTG ATTGCTGCAG GCCGTTCTGA AGCGGCGACA
GGCTGA
 
Protein sequence
MKIALYAGTY VKDKDGAVRS IYQLVASFRK HGHEVIVWSS DVSEQDNHGS LKVLRLPSVP 
IPLYPDYKLG FFSAVTKRQL DAFAPDIVHI STPDIVGRRF LLYAKNKKLP ATSVYHTDFP
SYLSYYRLGF ALGPVWKYLK WFYNTCDLVL APNEIVQRKL TDKSIRNVEI WSRGIDRELF
DPSRRSELLR QEWHAVERTV FVYAGRFVLY KDIEVVMSVY ERFMREGFID KVRFVMIGSG
PEEEQMRRRM PQAVFTGYLI GTALPEAYAS GDVFLFPSTT EAFGNVVLEA FATGLPAVVS
DVGGCMELVN ASEAGLVAKA GDIDQFYAHC LKLLDDAHTR SSMRRKGVLF AEKKSWASVN
GALIARYLEL IAAGRSEAAT G