Gene Clim_1872 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_1872 
Symbol 
ID6355213 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp2064180 
End bp2065823 
Gene Length1644 bp 
Protein Length547 aa 
Translation table11 
GC content56% 
IMG OID642669473 
Productglycosyl transferase group 1 
Protein accessionYP_001943887 
Protein GI189347358 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000487215 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATTCTG CGATTTTATT TGAACCGGAT GGTTATGTGC TTTCCGGGGA AAAACTGATG 
GGTCGGCATG CTGCCGGTCA TGCTTTTCTT CGTGCTGCGG TTTCGGGCAG GGATGGCCTG
CCGTTGTTGG CTTATACCCC GCATCGGGGT TCGTTCGATG TTTTTACCCG TCTTGTGCAT
GCTTTCGATC CATCGGCTGA AACCCGGTGG ATTTCCGCAA ACAGGCTGGA TCTGCTGGAG
CGGTCCGGTA CCCTGTATAT TCCCGGACCA GGCCTCGATA CTCAGGCCTG GCTGAGATTG
CGCAGGGGTA TTACGGCTTA CAGCGTCTGC GGGGTGACGC ATACGACGGC TTCGCATGGT
GCCATGGATT CGATTGCGGG TCTGCTGGAG GCTCCGGTTA TGGAGTGGGA TGCCTTGATC
TGTACTTCGG AAGCCGTGCG GGAGAGTGTC CGGCTGGTAC TGGATGCCGG GCGCGATTAT
CTGCAATGGC GGTTCGGTTC CGTCAGGCAA CTGACCATTC CGAAACTTCC GGTTATTCCG
CTTGGAGTGC ATTGCGATGA TTTCCGCTTC GATGAAGCGG AACGCAAAGC TGCCCGTGAA
GCTCTGGGGA TTTCTGACAG CGCTGTTGTT GCGCTGTTTG CCGGGCGCCT TTCTTTTCAT
GCCAAAGCTC ATCCTTTCGC TATGTATGCC GCATTGCAGC AGGTTGCGGA GAGGAGCGGC
AGGGAACTGG TGCTGGTGCA GTCGGGTTGG TTTGCAAATG ATCATATCGG CAACGCATTC
TCGTCGGGGT CAGAGCTGTT TTGTCCCGGG GTGAGAGTGC TCTGCACCGA TGGACGAAAG
CCTGAAGAGC GTCGCAGGAG CTGGGCCGCC GCTGATCTTT TTATTTCGCT TTCGGATAAT
ATTCAGGAAA CGTTCGGGTT GACCCCGATC GAGGCGATGG CGGCCGGCCT TCCCTCTCTG
GTGACTGATT GGGATGGGTA CAGGGATACG GTCAGGGATG GTATCGACGG GTTCAGGATT
GCGACCCGCA TGCCTGAAAA GGGATGCGGA AGTTTTCTTG CCGAGGCGCA TGAGAGCGGT
TCGATGGGCT ACGATATGTA CTGCGGATAT GCATGTCAGC TGGTGTCGCT CGACATCTCC
GCACTTGTTT TGCGGCTTTC CGAGCTATGC GGCAACCCTG AATTGCGGTT TTCGATGGGT
ATTGCTGCAA GAAAACGGGC AGAAGAGGTG TTCGACTGGA GGGTGATTTT CAGTCGCTAC
AAGGAGCTGT GGCAGGAACT GGATGCGGTT CGTGCCGCCG CGGTCGGTCG GTCGGGTGCA
GTTCCGGCAT GTTCTCCTGC ACGGATGGAT CCGTTCACCG TGTTTCAGCA CTACAGCACC
TTTTCGGTCA ACAGGCTTTC AGCGGTTTCT CTGCAGCCGG GTTCAGGCAT GCAGCACTAT
CGACAGCGGC TTGCTCACCC GCTTTTCAGC TATGCTGCCG GGCTGCTTCC CAAACCGGGA
GAGATGGAGC GTTTTTTTCT GTTTTTAATG GCCAGGGGTA CCTGTATTAT CGGTGATATT
GCCCGGGAAA TCGGTCTCGA TGAGTCGAGT ATTATCAGAG CGGTCGTCAT GCTTGAGAAG
ATGGATATTG TTACGATTTC GTGA
 
Protein sequence
MNSAILFEPD GYVLSGEKLM GRHAAGHAFL RAAVSGRDGL PLLAYTPHRG SFDVFTRLVH 
AFDPSAETRW ISANRLDLLE RSGTLYIPGP GLDTQAWLRL RRGITAYSVC GVTHTTASHG
AMDSIAGLLE APVMEWDALI CTSEAVRESV RLVLDAGRDY LQWRFGSVRQ LTIPKLPVIP
LGVHCDDFRF DEAERKAARE ALGISDSAVV ALFAGRLSFH AKAHPFAMYA ALQQVAERSG
RELVLVQSGW FANDHIGNAF SSGSELFCPG VRVLCTDGRK PEERRRSWAA ADLFISLSDN
IQETFGLTPI EAMAAGLPSL VTDWDGYRDT VRDGIDGFRI ATRMPEKGCG SFLAEAHESG
SMGYDMYCGY ACQLVSLDIS ALVLRLSELC GNPELRFSMG IAARKRAEEV FDWRVIFSRY
KELWQELDAV RAAAVGRSGA VPACSPARMD PFTVFQHYST FSVNRLSAVS LQPGSGMQHY
RQRLAHPLFS YAAGLLPKPG EMERFFLFLM ARGTCIIGDI AREIGLDESS IIRAVVMLEK
MDIVTIS