Gene Clim_1167 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_1167 
Symbol 
ID6353683 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp1261160 
End bp1262176 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content50% 
IMG OID642668783 
ProductADP-L-glycero-D-manno-heptose-6-epimerase 
Protein accessionYP_001943214 
Protein GI189346685 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID[TIGR02197] ADP-L-glycero-D-manno-heptose-6-epimerase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000000000444503 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCATTA TAACCGGAGG AGCCGGATTT ATCGGCAGCG CCTTGCTCTG GGAGCTCAAT 
ATGCAGGGTC GCCAGGATAT TGTCGTCGTC GACACTCTTG GCTCTACTGC TACCGGACAG
TGGCGGAATC TTTCCGGCCT CAGCTTTGCG GACTTCATCC CCAGGGATAA TTTTCTCCCG
CTCCTCGAAA GCGGAGCATT TACCGGCATA ACAGCTGTCA TTCATATGGG CGCTATCAGC
GCCACAACAG AAACCGATGC AGATCTGCTC ATCGAGCGAA ATTTCGCTTA CTCCAAAAGC
ATAGCGGCAT ACTGCATGAA AAAAAACATC CGGCTGATCT ATGCCTCAAG CGCCGCGACT
TATGGTGACG GGACAGCCGG CTACGAAGAC GATGAAACCA GAATCGACAG CCTTCGTCCG
CTTAATATGT ACGGCTACTC AAAGCAACTG TTCGACCGAT GGGCGCTGAA ACAAGGCATT
CTCGAACATG CCGCAGGTCT GAAATTCTTC AATGTGTACG GACCGAACGA ATATCACAAA
AGCGATATGA CCAGCGTCGT CTATAAAGCC TTCAATCAGA TAAGGGAAAA TGGACAGGTT
AGTCTATTCA AGTCACACCG GCCCGATTAC CATGACGGCG AACAGATGCG CGATTTCGTC
TACATACGTG ACTGTACCGC AATCATGATC TGGCTGCTCG ATCATCCGGA CATATCCGGC
ATTTTCAACA TCGGAAGCGG AGAGGCAAGA AGTTTCAACG ATCTTGTCAA TGCAACGTTT
GCCGCACTGA ATCTGAAACC CGCGATCAAT TATGTACCGA TGCCCGAGCA CCTCCAGGGG
CGTTACCAGT ATCACACACG TGCTGAAATG GCAAAGCTGC GTACTGCAGG ATTTCACAAG
CCTATGACCC CTCTCGAAGA AGGCGTGGGC GACTATGTAC GCAACTATCT TGACGCCGAA
ACGCCTTATT ACGACCTGCA GAGCATACAT AAAAAGATTA ATCACAAGGA CTTTTGA
 
Protein sequence
MIIITGGAGF IGSALLWELN MQGRQDIVVV DTLGSTATGQ WRNLSGLSFA DFIPRDNFLP 
LLESGAFTGI TAVIHMGAIS ATTETDADLL IERNFAYSKS IAAYCMKKNI RLIYASSAAT
YGDGTAGYED DETRIDSLRP LNMYGYSKQL FDRWALKQGI LEHAAGLKFF NVYGPNEYHK
SDMTSVVYKA FNQIRENGQV SLFKSHRPDY HDGEQMRDFV YIRDCTAIMI WLLDHPDISG
IFNIGSGEAR SFNDLVNATF AALNLKPAIN YVPMPEHLQG RYQYHTRAEM AKLRTAGFHK
PMTPLEEGVG DYVRNYLDAE TPYYDLQSIH KKINHKDF